It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I want to scrape reviews about various products and things in the web, how can I do that. There is a company called searchreviews.com, they do it, I want to know how they do it.
They get a page's HTML then parse it, targeting whatever information they need.
It's really awful, because it depends on the DOM of the site you're scraping, which can change at any time, in both trivial and complex ways. I've worked with companies that have scraped (legitimately) various types of sites, and it's horrible.
mechanize or watir or rautomation are related gems that might help you here.
I've done this very often for various clients, and most of the time a site that gathers reviews is pretty well structured, so scraping isn't too hard. Look at Yelp.com for example. I built a routine in screen-scraper that searched zip codes in the client's area, used the filters to hone in on the desired business types, and makes a list of unique results (since the zip code searches could render duplicate results). From there I hie each unique URL. The reviews are pretty easy to parse with just RegEx, and some page iteration.
Related
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 9 years ago.
I am searching for an elegant library/spark for CodeIgniter that offers you the option to set pagination, filtering and sorting without to much hassle and independent from the database query.
I am already using a pagination library made by me a while ago, very easy to use, but I'm searching for something that extends to sorting and filtering as well.
Any resource is much appreciated.
UPDATE: I have actually found a great solution that works in Codeigniter, but not limited to. It supports ajax, server-side procesing, individual column filtering (which I needed by the way) and everything you need regarding pagination, sorting and filtering: DataTables
I also found an implementation with CI on Github, maybe this will be helpful to somebody: https://github.com/blake-nouribekian/codeigniter-datatables/blob/master/index.php
Try Carbogrid. This is so cool and easy to integrate
You can try these both are good
JQgrid or ExtJs
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 9 years ago.
So far in my use of objective c and cocoa, I have made some simple OSX apps that make use of some of the basic apple UI elements such as NSButton, NSTextFeild, NSView, etc. However, now I find that I'm wanting to add parts to my programs that don't fit so squarely within the given UI elements. ie. in a program I'm writing now, I want to create a checklist in which users can enter new elements, check off elements to delete them, etc. all with a custom UI. My problem right now is that I don't even know where to start looking for the information I need to learn how to create more advanced things like this, or what to search for to start finding information. Could anyone set me on the right track as to what information I should be looking for and where I can go to find it? Thanks.
You'll probably want to look at Cocos2d or even OpenGL. I'm guessing though that this may be overkill for your solution. You might also be able to override or extend some of the base UI elements.
If you're new to Objective-C and Mac OS programming, I would recommend taking it a bit slower with this. Often we developers dream up huge complex interfaces when really, just a basic, stock interface would work just fine for our users.
For your first release, keep it simple and just ship some code in short time. Then work with your users and see how they really want to work with your app and go from there.
Good luck!
db
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I want to digitise some places of my city in ArcGIS. I found ArcGIS 'World Imagery' layer which I can use it as a basemap for digitisation. But the problem is it works only while the internet is connected. Is there anyway to save the cache of the layer at a particular zoom level so that I can work on it offline even after I restart the computer.
If no, please tell me any other way- may be with google or bing imagery, I need to digitise important features of the city. So i require resolution at house level.
A quick and dirty way would be to take a screen shot, save it, add it as data, then georeference it with the "Georeferencing" toolbar. I'm not sure how to save a cache or how much memory that might take. If you are working at a fixed scale and can find the imagery you need (like google or bing), then take a screenshot or maybe use a program like Snagit (if you have it) to capture the extent you want at the best resolution.
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I've a book review site, where readers can write reviews about books, other users can post comments. I wanted to know following things automatically whenever new review publish or new comment published.
(1) whether book review is positive or not? How much % positive / negative?
(2) whether comment made by particular user is positive or not? How much % positive / negative?
(3) I want to read Tweets about particular book and wanted to check whether the tweet is positive or not?
bottom line, I want some tool suggestions (opensource), which I can use for my website. Website is written in PHP and I'm looking for some semantic analysis tool which I can customize to meet my need or which best fit my need.
if not, I want to know if its easy to build one with minimal requirements. I know PHP, Perl, Shell Script. I can learn Python. I know C++, Java may be right language to start from scratch; but don't have much experience.
There is an open source semantic analyses engine incubated in the Apache Software Foundation, currently, called Stanbol. It provides APIs to interface with it over HTTP as well as through a Java API if needed. It's pretty advanced, but generally speaking if your needs are simpler you can always try some SaS solution like uClassify.
In response to your first request, I'd suggest you create a form where the user has a voting option (such as a x/5 star rating, etc) then you would calculate the average from all of the reviews.
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 11 years ago.
I want to implement a "text" suggestion.
I have a huge number of data, how I can implement an efficient and scalable auto-complete?
Edit 1:
i have a mysql table with one client per row and a 'name' column, i want to create a suggest in order to search client name (like google suggest but instead of queries it is client name) - I have a huge numbers of rows, how I can design an efficient suggest?
When user will start typing inside an "input text", I want to display possible client names
OK, I think I understand what you're looking for and here are some possible solutions for you:
What is the best autocomplete/suggest algorithm,datastructure [C++/C] (the answers are generic enough despite the fact that it's a C/C++ question)
How to implement autocomplete on a massive dataset
Autocomplete using a trie
Algorithm for autocomplete?
Trie based addressbook and efficient search by name and contact number
How do you autocomplete names containing spaces?
Essentially, it seems like you're looking for auto-complete functionality (if I understood your question correctly). Along those lines, the above questions and their answers also provide a lot of references on how to do more complex suggestions (i.e. based on content, semantics, intent, etc.).
This will probably not address your question if you're looking for an algorithm that makes "related" suggestions, e.g.:
"water" may suggest kool-aid, gatorade, vitamin water.
"sea" may suggest ocean, lake, river