parsehub is a data-extraction platform for static and dynamic data web-sites. The platform has free and paid tiers of service. Templates can be written that will select elements, extract data to CSV/JSON, and interact with elements in the page. Multi-page navigation is possible. There is an API to gain access to the platform's capabilities for user projects.
Questions tagged [parsehub]
27 questions
2
votes
2 answers
Regex: Keep last word after text
Tommy Hilfiger Men Teal Blue Analogue Watch TH1791293_BBD
I need to split this and keep the last part i.e
TH1791293_BBD
The issue is that the part before the target string i.e
Tommy Hilfiger Men Teal Blue Analogue Watch
Can be of varying…
Ishaan Shakunt
- 53
- 1
- 6
2
votes
2 answers
Parsehub API PHP
How can I dump the results into a MySql db?
Specially decode gzip and parse it to an HP array then dump into a db.
Arun Prasan
- 31
- 6
2
votes
0 answers
Pasehub: Extract data from the event url
Need to extract data from the url that is provided as a link while parsing a template.
I have created a template for a website and it is working fine. But the link that is provided with each set of extracted object, I need to go to that particular…
Syed Asad Abbas Zaidi
- 848
- 1
- 15
- 28
2
votes
2 answers
ParseHub Webhook with RoR
Parsehub provides the webhook feature. But currently I'm testing my Rails app locally. So how could I provide the webhook url for a project on Parsehub to point to my local server or any specific method in my controller.
Parsehub Doc…
Syed Asad Abbas Zaidi
- 848
- 1
- 15
- 28
2
votes
1 answer
Parsehub main_template and renaming
So using the parsehub tool to experiment with data-scraping and wondering if there is a rule to keeping the main_template name which is automatically given to all projects. Is it possible to change it and what is the significance of the name and the…
Shawn Mehan
- 4,274
- 9
- 28
- 48
1
vote
1 answer
Parsehub website elements only display on a certain date
I'm using Parsehub to scrape certain data from certain pages on a website into a google spreadsheet.
The issue I'm having is that a certain html element only displays on a certain date and I'm wondering if there is a way to set it up so Parsehub…
Andrew McAleenan
- 31
- 1
1
vote
1 answer
Parsehub: Pagination not work on http://eservices.dubaitrade.ae
Im trying to scrap data by pagination but pagination not work on next button below you can see i applied code but it not get data by pagination below is URL please…
Care Nice Xavier
- 11
- 2
0
votes
0 answers
parsehub youtube greyed out
When I tried to start a new project crawling some youtuber's minecraft world files on parsehub, youtube just gave me all grey divs, and in the youtuber's video's page nothing showed up. Can I potentially change headers to make youtube's response…
Mikety2520
- 41
- 4
0
votes
0 answers
Parsehub keeps giving my result rows new names
I am trying to extract Company names, Phone numbers and website urls from this website:https://www.dastelefonbuch.de/Suche/It.
Parsehub is names the entry title/row in my results according to the selection functions that ran before it, instead of…
0
votes
0 answers
Parsehub - not loading table in own browser (umbraco form issue?)
I'm trying to use Parsehub to scrape a website: https://drc.ngo/about-us/job-and-career/vacancies/
The problem is the table (where the info I need is) loads in normal browsers, it even loads in Parsehub's 'browse' mode. But it doesn't load when…
intdev
- 1
- 1
0
votes
0 answers
ParseHub, seeing all selections in the steps window but certain items are missing from run
I'm trying to scrape category pages from various supermarkets. When setting up the steps I can see that it's correctly selecting the 58 products on the page however when I run (either as a test or full run) it isn't returning all 58, in this…
plan303
- 25
- 3
0
votes
0 answers
Using keywords with Parsehub
I created a project in Parsehub that scrapes provider name, address and phone from URL fastbraces.com/providers using a keyword list consisting of several zip codes. Each time I run the parser, it only processes the first two zip codes from the…
RyanM
- 1
0
votes
0 answers
How to extract specific coloured entries from table parsehub
i am trying to extract only pink,yellow and purple entries from the website how can I do it
A R MOHAMMED
- 57
- 6
0
votes
1 answer
extracting specific text from a URL using REGEX
I am working on a project where i need to grab some text and the only option available to do this is through regex. If anyone here can help, then please let me know what would be the correct command/syntax to capture the file number from the below…
0
votes
0 answers
How to clean and split the output text in ParseHub
My output is a list of text elements that look like this:
CVE-2016-2002: DoDB JS/Py 3.1.3, DoDB 5.5.65, DoDB 10.4.7, DoDB 10.3.17, DoDB 10.2.26, DoDB 10.1.41
(The number of commas is different between elements)
My goal is the split this one column…
Doron
- 11
- 7