Facebook not scraping correct specific data - debugging

I've had this problem in the past where FB scrapes the wrong data, but changing the thumbnail image name has always fixed the problem. However, today I'm unable to get the same result. The error is this:
Debugger screenshot
I've cleared my browser cache, renamed the image, inserted code at the top of the blog post with the image link, and scraped a dozen times in the debugger. My company uses Placester for our web platform (if that means anything to anyone), so I don't have quite as much freedom as with other sites. Can anyone help me with this, but put it in layman's terms?

Related

Liferay 7: Is it possible to get an image for from a linked site to a sitemap automatically?

Liferay 7.3.2.
I'm using a sitemap to create link boxes that link to other pages in the website. Links and names and all else is basic stuff, but I have no idea, if and how I could get images for the boxes.
Asset publisher would've been my first choice, as I've done similar stuff before with them, but I can't find a way to add pages to the publisher, so I'm not sure if it's viable here. Of course, I could create an asset publisher that would ask for a name, image and a link, but this should be automatic to avoid extra hassle.
Sitemap allows for automation, if you just map the page correctly, but getting the image would still be a problem.
Navigation menu I haven't really put much thought into, as there will be many pages with link boxes and would be really hard to control. The image would be a problem as well.
The optimal solution would be to get the image from the page the link takes the user to, but if that's impossible, I wonder if there's a way to give webpage an image, that could be shown here.
Got it working with a bit of looking around! This can help with same kind of problems others might be facing, so I'll gladly answer this myself!
From Admin panel, go to Settings and Additional Fields
Select Page and add a field to it. After filling the details, you should be able to see this field in the page settings for every page.
Go to page settings and fill out your info. In my case, I created a text field, that will be used for image URLs.
Finally, in your Application Display Template, you can find the field by writing:
${entry.getExpandoBridge().getAttribute("Extra field")}, Extra Field being the name of your field. Remember to put conditional statements, if necessary!

why hosting images in google sites from google drive has stopped working?

yesterday I created a simple "quote of the day" script for my google site. The script reads the quote from a google sheet. I also wanted each quote to go
with a companion image, which I host in my google drive (the image Ids also are stored in the sheet).
I followed the instructions explained here, and everything seemed to work fine. The permissions for the images was "everyone with the link within my institution" (this is supposed to work in a website on the google apps for education).
It was a success, because the images did display correctly.
Today I opened that page again, and the images have disappeared.
I know that image hosting was discontinued in 2016, but I think it was a different way of hosting (no permalink). Indeed, as I mention, yesterday I could see the images in the website.
Also, when I write the "permalink" in chrome address bar I get "403. That’s an error. We're sorry, but you do not have access to this page. That’s all we know." This happens despite I'm the owner of the file, and I'm logged in the correct google account. Also, it happens irrespective of the browser and/or machine I use (I tried with my phone too).
I also tried what explained here, although I'm not sure I understand that completely. There should be a problem though, because instead of the image I get the alt text (NA).
Is the "permalink" method still supposed to work?
Could this be a problem of excessive traffic? I haven't even published the page yet, I only did some testing...
What is the best way of displaying images from google drive into google sites, without resorting to third party services?
Thanks a lot
Francesco
You already answered your question. Hosting on Google Drive has been stopped.
However, you can try the workaround from this SO post where you'll use the URL:
https://drive.google.com/uc?export=view&id={IMAGE_FILE_ID}
It worked fine when use with the <img src="{IMAGE_URL}"> HTML tag. That's the closest thing to what you're trying to do.
You can also try Firebase Storage where you host your images on the cloud and display them on your web page.
This is at least a partial answer to the problem I tried to explain, which I'm summarizing below:
I was hosting images for a page in my google site on my google drive, via the syntax
https://drive.google.com/uc?export=view&id={IMAGE_FILE_ID}
the sharing settings of the individual images was "anyone with the link within [my institution]";
this worked for one single day and then stopped working: the images stopped showing, and I could not figure out why. All this happenened a few days ago (oct 21-22 2017), and hence cannot possibly have to do with google discontinuing a previous service, which happened in 2016;
this happened despite I was logged into the account in my institution (and hence the owner of the images).
I found out that the problem appears to be in the sharing settings. Apparently, the images only display if their sharing settings are "public on the web" or "anyone with the link". Instead "anyone with the link within [my institution]", or more restrictive settings won't work. The first settings were mentioned in the webpage I linked, but I thought it would also work with "anyone with the link within [my institution]" (and, for one day, it did).
I also tried by setting the permissions of the parent folder. In that case only "public on the web" would work. With anything more restrictive than that the images won't display.
Despite I carefully double checked all this, I think this permissions thing seems to be a bit erratic though.
As I mention, on the first day (oct 21) the images would display even with "anyone with the link within [my institution]. I could swear on my life that, before the systematic tests I just finished carrying out, all was working with the folder settings to "anyone with the link [within my institution]". Now that setting won't work.
I know, I said "it appears" a bunch of times. This is because all of this is just from my point of view, so far. I still have to share the link of the page I'm preparing. As soon as I can I'm going to ask a colleague to take a look at it.
What I anyway want is that the page displaying the images is visibile only to people within my institution, for the moment being.
However, now I'm at least able to see the images hosted in drive, in my website.

How to Change Search Text that Google shows for my Joomla Site

Hi I have built a joomla search that has been live for a couple of months and is working completely fine. One of the things that I came across last week is when i search in Google for 'MyCompany' then this brings up the link for my site, but underneath the link rather than a brief text about the company all i see is the banner image names that have been used on the site.
Unfortunately I have never really dwelled into the SEO side of the website but would really like to know what I need to do on the site to get this fixed, would really appreciate it if somebody could please advise.

Website with a very slow load time on every page

http://www.puppykisses.org/
i made a WordPress page for a client, and for some reason it is taking over a minute to load the page. The only thing that I could think of being the problem is the amount of photos that he inserted into the slider up top on the home page. It looks like all those pictures need to load before anything else pops up. But then I click on Contact or any other page that has no real images to speak of, and the problem is still there. Just wondering if anyone can point me in the right direction to fix this. thanks!
Like #David said, its the initial request (the source for the page) that is giving you the issues. This means it is unlikely an issue with hosting, and most likely an issue with your code. I would go through any plugins you have installed and disable them one-by-one, and slowly start commenting out your own custom dynamic code bit by bit, till you see what is taking so incredibly long. Then rewrite/excise that code from the site.
Start With the Basics
Keep the number of WordPress plugins you use to a minimum
Get a Proper Hosting Provider
Remove Unnecessary Code From WordPress Header -> http://goo.gl/yfRcF
Use firebug and click Network tab to check loading speed for each files
Check Suggestion how to improve website speed -> http://goo.gl/FtiX3
Install WP Super Cache plugin -> http://wordpress.org/extend/plugins/wp-super-cache/
*If you use gallery try to use image thumbnail rather than load whole images size

About search engines: how do they take screenshots of web sites?

This may be a dumb question, but I really have no idea and I'm utterly curious! So please bear with me.
What I know is search engines just read HTML and words in a site. They usually ignore CSS or part of it. They arguably cannot read images. Do they?
If they really cannot or ignore to read those, then my question is how do they make screenshot, which is a page that is presented just the way as CSS makes it, and has images.
If they do not read CSS, images, and they also do not like human being to open it in his or her screen. How do they make the screenshot?
Thanks!
Are you referring to Google's new screenshot feature, or their old cache feature? Your question is talking about screenshots and doesn't mention the cache at all, but your comments on your question seem to imply that you're referring to the cache, not the screenshots.
In the case of the screenshots:
You are correct in that search engines usually only read the HTML and text on a website, because that's all they need. But that doesn't mean they can't.
When they want to take a screenshot of a site, they'll just do exactly what a normal browser does when a user visits the site. Download the website, the CSS, the images, and everything else, and render it with the rendering engine of a web browser, such as WebKit.
In the case of the cache:
The search engine usually just stores the HTML without/before parsing it. It sends the saved HTML to your browser, and your browser pulls all the other stuff in the page (images, etc) from the original website. The search engine isn't reading anything, it's just saving the page verbatim (well, with minor changes, namely URL rewriting), and giving it to your browser.
There are apps that takes screenshot of pages as if displayed in a chosen browser.
Browershot is an example of online service that does it.
Here are some links and projects of webpage thumbnail generator:
Build your own website thumbnail generator with Django (Python)
Zubrag Website Thumb Generator (PHP)
Maybe I'm not understanding your question, but...
You seem to be using "read an image" to mean load the data from the image to the search engine. This the search engine does do (including CSS). When people say search engines ignore images they mean it doesn't see them as meaningful searchable data. In other words if I make an image that has the word "Hello" on it you and I "read" it in the sense that we see and understand that the image contains a word. A search engine typically will not attempt to do this, the search engine will however "read" the image into its storage if it wants to have the ability to present that to a user at a later time.
Search engine don't use the CSS and image content for indexing but they can store them on their servers to make a cached version of the site.
In the case of google I think they store only text files, so HTML, CSS, maybe javascript but no images.

Resources