Web Crawling using import.io - xpath

I am trying to crawl the following website https://goo.gl/THqDhD using import.io tool. I used the connector tool to parse the whole search result for specific query (and include the pagination), and successfully chosen all the rows in the search result, but was unable to select the items'image box (as column)
import.io contain manually xpath overriding for the selected, so I tried to select images in the search results using the following xpath:
.//*[#id='container-inner']/div[3]/div[4]/div[*]/div[1]/div/a/
which should represent the columns of the table, but I got the following problem
What you have selected is not within a result
The result here is the previous selected rows, but I inspected the item box and made sure that the selection is inside. Any help please?

Related

Not finding how to use a search result term in link attributes in oracle apex 22.2 search page

Im using apex 22.2 And with it comes a new feature, the search page, with its search configurations attached, I am implementing this into a project and having trouble using the link function as I would like, the desired result is that after searching for something that comes up in the search configuration it grabs the primary key in this case 'project_ID' and it inputs it into the 'proyect_Id' field in my target page which takes you to the correct page instead of the generic template page with no data. I understand how to do this in any other type of link but with search configurations I can't find any items that bring up what I just searched/selected from the search If anyone knows the way to do this or a workaround that helps my case it'd be much appreciated!
Ive tried browsing the items that i can set as value, using various &PRIMARY_KEY AND &PROJECT_ID type values and setting the value into an item but none of them have worked. i also cant find much info in this
Using &PROJECT_ID. with a dot at the end as the Value in the Link dialog should work to select the selected project.

Web Scraping returning empty data table UiPath

I’m using Data Scraping to scrape a product Information (i.e Product Name, Url, Price, Model) from a shopping website.
When I search for a product, I want whatever item comes first it scrapes that item’s data and for that purpose I have set maximum number of results to 1. But the problem is sometimes it is returning empty Data table And I cannot figure out why.
What I think is, if the current search result matches those elements that I selected in data scraping wizard, it returns the data table and if it doesn’t match it returns empty Data table.
For Example, While selecting elements in Data scraping wizard the search results were Samsung monitors. And when I ran the project I searched for Dell monitors, it returned Data table but when I searched for Samsung series or Dell Series it returned empty Data table. What is wrong with this?
You need to tell what you actually need as output.
But if your output is empty, mostly the reason is one of the following:
make sure the timeout is high enough, set it to 30000 if you are unsure
set a proper selector that has not a bad impact even when the website is being changed for some reason
For me it working properly with a proper timeout and a flexible selector with a *.

I want to write Relative Xpath for two Web Elements

I want to create relative path to the following web elements
Since its a very old application so , its difficult to verify my xpaths on browser console.
I want to reach the web element to the text under "Owner" column of the table.
I want to reach the web element to the text under "Owner" column of the table
I want to Reach to the second option in the Table "Professional Liability".
I want to Reach to the second option in the Table "Professional Liability"
Try following xpath
xpath for owner.
"//td[#sortfield='owner']"
xpath for Professional Liability
"//select[#name='AVAILABLE_COVERAGES']/option[contains(.,'Professional Liability')]"

Empty filter results gives #VALUE! error

I've been working on this project where I need to consolidate data from two other sheets within the spreadsheet and filter the result for easy viewing. But I realized the problem when the filter gives no result there will be a #VALUE! error. The error isn't solved even when I have used IFERROR.
Link to the sample of the Google Spreadsheets.
There are two classes and I wish to filter out those who passed in the class and populate the table in the collated sheet.
You should be able to do something like this:
=query({Class1!A2:C; Class2!A2:C}, "where Col3 = 'pass'")
(change the sheet names and ranges to suit !).
(Also check the formula I entered in A1 of sheet 'JP')

Extract href in table with importxml in Google spreadsheet

I am trying to pull the href for each row of each table from this website:
http://www.epa.gov/region4/superfund/sites/sites.html#KY
I can pull the table information off using =IMPORTHTML(A1,"table",1) for all 7 tables, but I need the href to the site with the detailed information.
Using =IMPORTxml(A1,"//div[#class='box']") I can pull the information needed from a site like:
http://www.epa.gov/region4/superfund/sites/fedfacs/alarmyaplal.html
but I need to extract the fedfacs/alarmyaplal.html portion for each row on the original page.
I've tried using //#href, but it is not returning any results. I'm thinking it is because the data is structured in a table but I'm stuck on where to go from here.
I'm not sure about any of the Google Spreadsheet functionality, but here's an XPath to select all href attributes of the Kentucky sites (since your first link included the 'ky' anchor):
//body//a[#id='ky']/following-sibling::table[1]/tbody/tr/td[1]/strong/a/#href
This is very specific to the Kentucky table: following-sibling::table[1] means the first table node after, and at the same level of, a[#id='ky'].

Resources