I'm trying to automatically process the results of a google similar image search. I've noticed that some results come back with "x-raw-image:///<48 bytes or so of data>" in the URL for the image. I think this is happening when the search is extracting an image from a PDF document. Does anyone know what that data is? Is it anything that is usable outside of google? I was hoping that it might be similar to a "data:image" reference, with the image data embedded, but that might not be the case.
Thank you.
ETA: Here is a random example I found that I hope works for everyone:
https://www.google.com/imgres?imgurl=x-raw-image%3A%2F%2F%2F79da01aca79b556defc81ee719442327379e067a26c5c5c9ec104fc39fb70177&imgrefurl=https%3A%2F%2Farxiv.org%2Fpdf%2F0706.2165&tbnid=bVWbp6wB2nmBiM&vet=12ahUKEwiD06O97dXuAhWUUM0KHXp7Co0QMygCegQIARAy..i&docid=nD--BiC0InIh_M&w=1600&h=2071&q=physics%20papers%20diagram%20pdf%20x-raw-image&client=firefox-b-1-d&ved=2ahUKEwiD06O97dXuAhWUUM0KHXp7Co0QMygCegQIARAy
This is the frame you get when you click on one of the results in an image search.
Related
Backstory to the below issue:
I'm using the jQuery plugin Cropit to produce an image which I get in data URL form (the user uploads an image and Cropit allows them to manipulate it, when the user is happy, Cropit exports the final image).
This data URL is attached to the product (this is a Shopify website) via Shopify properties (in a similar way you would attach text for an engraved product) and then when the order is created, I have an app listening for new orders and I pull the data URL from the order.
From testing, I can confirm that the data URL is wrong / corrupted / broken at the time the order is placed and not being broken in transit.
Original Question
I have a bit of a weird situation and I can't find any similar situations online.
I'm being sent an image in data URL format (from Shopify if it's relevant, I have written a private app and their webhook is sending me an image)
The image is in a data URL format that starts with, as an example,
.....
The problem I am having is sometimes (and it's maybe less than 10% of the time) when I get the image and try to print it, it's missing the bottom chunk of the image. In a PDF, it considers the image corrupt, and in a web browser, it just sees the bottom of the image as transparent, however much is missing.
This is what it looks like in Inspect Element on Google Chrome when you hover over the image URL (image has been purpled out for anonymity)
My question is, does anyone know why?
We can't find a correlation with browser or device type. And I'm not sure if it's because part of the data URL is somehow missing (maybe a character limit, because it's a really long string!) or if it's the type of image. Might possibly be something going wrong in the upload process?
Is anyone able to shed any light? It's such a weird issue I'm not even sure what to google!
And just to confirm, the image absolutely has to be sent in this format for a whole series of reasons, mainly Shopify restrictions so I can't send the image in file format.
I was copying image links from google and I'm seeing more and more urls like this. What exactly is going on here and why are developers doing this? Heres a example.

That is the image, stored as a base64 encoded string.
So they're not giving you a link to the image, and are instead giving you the image data directly.
Copy and paste everything after the first comma into e.g. https://www.base64decode.org/ and you'll see the picture.
This article gives a very detailed breakdown on why someone might choose to do so. One of the main reasons is to cut down the number of requests to your webserver. (clicking that link did not make a request to google, as you had already downloaded the data)
this image is just encoded using base64.
why people do this ? depend on the project. This way your page will be a bit heavier, but your image will be inside the page. If you use an URL, to load your image the browser will need to make a call.
Maybe the best call to do that, is this way you don't depend on another website to save and keep your image
Some time ago (about a year and half) I found an image on google, having the following url in the search engine archive:
http://t0.gstatic.com/images?q=tbn:ANd9GcQhMUKiipOilBrW3xjrD7FCuQ4M9jCEWyQXmv0d3-pgHIVeYHJwTHvkW-1_yg
Now you can see it can't be found (1x1 pixel image means that).
Is there a way to decode the tbn value and recover the original image name or query or anything helpful to find another copy of the same image?
You can't. That's the whole point of it. When a user requests an image removal or Google decides that for some reason an image must be removed, then the image is being deleted from the Google cache, so you can't find the image because it got deleted.
Edit: I understand that you reaaally want to get back that image but it is deleted from the Google cache. There is no way to recover it using info in the tbn value.
If you know someone who may have recently requested that image through his browser, you can search with the url in their browser's cache, using this method.
I am interested in how google have done it's image search using an image. I am doing a project on image comparison and searching using images, and considering how impressive google image search using image seems to be, I am curious what they have done.
I also wonder how much they do actually image comparing and how much they use like keywords around images etc.
I've looked around, this question has been asked before but i would like to know if it was possible to do so, maybe without using an API, or using another search-engine.
I would like to check the web for images, by searching with an image.
Is there any way to do so, if so can anybody point me in the right direction, i just can't seem to get started.
You can do this currently with Google. Drag and drop the image into the search field.
Just go to:
http://images.google.com
and drop your image over the search field.
Google used to have some images below the search field that you could drag and drop, as examples.