I know Google Image Search is very powerful by their face recognition algorithm.
I wonder is there any way to process/change the images which Google cannot search or detect?
I tried to change the images (of some famous movies like Batman) to black&white, reduce the resolution or "print-screen" each scene from the movies but they didn't work. Google can find out all of them.
I assume that Google will act the same for text and image crawling.
The easiest way should be to define a robots.txt for your site.
This file can look like this:
User-agent: *
Disallow: /images/
By setting this up, google will not index everything in and below /images. Besides google every other well-behaving search engine crawler will consider your robots.txt.
See details about this file here.
Related
Google had a beautiful API which you can use to search for large images, but unfortunately they decided to disable it. Now you can use their "custom search engine", but it doesn't get even close to what that old API could do. For a start, the results you get are not the same as if you search in the common search page with your browser, and you can't specify the size of the images you are searching for.
Is it there any programatically way I can get a list of the URLs of the images I would find in the common search google page, size included?
You can use scrapping the google image search results and parse the links to the images. urllib2 library in python can help you here.
I'm trying to find out if there is a way to do google similar image searches via an API?
I know the image search api is depreciated but is it still useable?
https://developers.google.com/image-search/
Also... It seems that you can do image searches with the custom search api but I can't seem to work out if a similar image search is possible.
http://thenextweb.com/dd/2012/02/14/googles-custom-search-api-now-supports-image-only-results/
Any leads on advice on working this our would be appreciated.
Thanks!
If you have a URL for a hosted image (using Dropbox, imgur, etc), the answer at https://stackoverflow.com/a/15134958/116891 shows you how to find similar images. Basically,
http://images.google.com/searchbyimage?image_url=YOUR_HOSTED_URL
That is deprecated.
But I need JSON format of similar images result.
So, I try to use google cse but this is not supported searching of similar images.
It's just displayed image search results in a custom domain.
Another method that i try is URL.
http://images.google.com/searchbyimage?image_url=YOUR_HOSTED_URL
But this is not solution what i need.
It is because able to use in the browser. I need JSON.
Conclude, I decide to use Vision API of Google.
This is very simple.
https://cloud.google.com/vision/
You can try on the top.
First, access the URL.
Second, upload your image file on the "Try API".
Third, click "JSON" tab menu on the result.
You can be seen JSON about similar images.
I have a website for book reviews. I offer a link to the Amazon entry of the books. I discovered after a bit of research that the direct URL for Google's "I'm Feeling Lucky" is:
http://www.google.com/search?hl=en&q=TITLE+AUTHOR+amazon&btnI=745
Which works magic because then I don't have to manually include the Amazon link in my database and directly links to the Amazon page (works 99.99% of the times).
I was wondering if there was an equivalent for images (whether Google or some alternative) to retrieve an image URL based on keywords only (for the purpose of getting the book cover image).
There's no such thing for Google Images, but you might be able to use another web service to do what you want. I noticed that when you're searching for a book, the first image result isn't always the cover of it. Sometimes it's a photo of the author, sometimes it's some image from book's review, so you can hardly rely on that.
It should not be hard to parse the amazon page and get the image and link but google has an API to google books that return all informations about a book in JSON format, you can try it online on the API Explorer (the cover are on the results too). Click here to see an example (click "Execute" to run it).
Unfortunately public Google search engine doesn't support that. You should use Custom Search API to implement such feature in your application. Alternatively use XGoogle (unofficial Python wrapper to Google Search services, see google_dl tool for example).
Other suggestions is to use:
YQL by Yahoo (see yql-tables repo at GitHub for examples).
Use alternative search engines.
E.g. In Wolfram Alpha you can type: "show image of laptop" and it'll give you the first popular picture, however you need to use Wolfram|Alpha APIs or some script (see this ChatBot for example) to pick up the direct link.
I'm using Google and Bing image search APIs to provide a way for users of my web app to search for images to include in the documents they create in the app. A (rare?) problem I encountered today: a result from either Bing or Google (I'm going to assume Bing) caused the Google Chrome Malware detector to go off.
Is there any good way to avoid this that I'm not aware of, aside from only using the Google Image API (which is being deprecated!) since I assume they filter out results from sites they think contain malware?
There doesn't seem to be any performant way on my end to check these results before displaying them to prevent this error from occurring, and I'm very worried that any less savvy computer users will think my site is at fault (not to mention being unable to make the warning go away).
I guess I'm also making the assumption here that images from random Internet sites are okay to include in the page as long as they are returned by these APIs...I do copy them over to our own S3 account a few minutes after they are added to the document in case they are changed/removed on the external site...
EDIT: The result is indeed being included from the Bing API, and it is from thefatlossauthority.com.
I would prefer a solution based in Ruby, but given a general solution I'm more than willing to implement it myself.
I am trying to get Music Artist images and Album Art information from Wikipedia. Has anyone tried this before?
I searched around and found few links with relevant information
http://commons.wikimedia.org/w/index.php?title=Special:Search&ns0=1&ns6=1&ns12=1&ns14=1&ns100=1&ns106=1&redirs=0&search=buitenzorg&limit=10&offset=0 - search images for buitenzorg
http://www.mediawiki.org/wiki/API:Allimages - All images API call to fetch image urls from wiki
But could not find a way to download all the images from Music category and run deltas for updates after the initial download. Any pointers to the solution will be helpful.
I searched everywhere and finally I went with below mentioned implementation.
Here's the api documentation: http://www.mediawiki.org/wiki/API:Main_page
Call wikipedia API for artist page
Look at the player card info given on the right, and get the image name.
Make another wikimedia call to get Image download url.
Download the actual image.
For deltas, the only way is to hit wikipedia APIs again with image name and check for updates.
The other work around for this is setting your own wikimedia server, which will handle deltas.
See here http://www.mediawiki.org/wiki/Manual:Installation_guide