Questions tagged [parsehub]

parsehub is a data-extraction platform for static and dynamic data web-sites. The platform has free and paid tiers of service. Templates can be written that will select elements, extract data to CSV/JSON, and interact with elements in the page. Multi-page navigation is possible. There is an API to gain access to the platform's capabilities for user projects.

27 questions
2
votes
2 answers

Regex: Keep last word after text

Tommy Hilfiger Men Teal Blue Analogue Watch TH1791293_BBD I need to split this and keep the last part i.e TH1791293_BBD The issue is that the part before the target string i.e Tommy Hilfiger Men Teal Blue Analogue Watch Can be of varying…
2
votes
2 answers

Parsehub API PHP

How can I dump the results into a MySql db? Specially decode gzip and parse it to an HP array then dump into a db.
2
votes
0 answers

Pasehub: Extract data from the event url

Need to extract data from the url that is provided as a link while parsing a template. I have created a template for a website and it is working fine. But the link that is provided with each set of extracted object, I need to go to that particular…
Syed Asad Abbas Zaidi
  • 848
  • 1
  • 15
  • 28
2
votes
2 answers

ParseHub Webhook with RoR

Parsehub provides the webhook feature. But currently I'm testing my Rails app locally. So how could I provide the webhook url for a project on Parsehub to point to my local server or any specific method in my controller. Parsehub Doc…
Syed Asad Abbas Zaidi
  • 848
  • 1
  • 15
  • 28
2
votes
1 answer

Parsehub main_template and renaming

So using the parsehub tool to experiment with data-scraping and wondering if there is a rule to keeping the main_template name which is automatically given to all projects. Is it possible to change it and what is the significance of the name and the…
Shawn Mehan
  • 4,274
  • 9
  • 28
  • 48
1
vote
1 answer

Parsehub website elements only display on a certain date

I'm using Parsehub to scrape certain data from certain pages on a website into a google spreadsheet. The issue I'm having is that a certain html element only displays on a certain date and I'm wondering if there is a way to set it up so Parsehub…
1
vote
1 answer

Parsehub: Pagination not work on http://eservices.dubaitrade.ae

Im trying to scrap data by pagination but pagination not work on next button below you can see i applied code but it not get data by pagination below is URL please…
0
votes
0 answers

parsehub youtube greyed out

When I tried to start a new project crawling some youtuber's minecraft world files on parsehub, youtube just gave me all grey divs, and in the youtuber's video's page nothing showed up. Can I potentially change headers to make youtube's response…
Mikety2520
  • 41
  • 4
0
votes
0 answers

Parsehub keeps giving my result rows new names

I am trying to extract Company names, Phone numbers and website urls from this website:https://www.dastelefonbuch.de/Suche/It. Parsehub is names the entry title/row in my results according to the selection functions that ran before it, instead of…
0
votes
0 answers

Parsehub - not loading table in own browser (umbraco form issue?)

I'm trying to use Parsehub to scrape a website: https://drc.ngo/about-us/job-and-career/vacancies/ The problem is the table (where the info I need is) loads in normal browsers, it even loads in Parsehub's 'browse' mode. But it doesn't load when…
intdev
  • 1
  • 1
0
votes
0 answers

ParseHub, seeing all selections in the steps window but certain items are missing from run

I'm trying to scrape category pages from various supermarkets. When setting up the steps I can see that it's correctly selecting the 58 products on the page however when I run (either as a test or full run) it isn't returning all 58, in this…
plan303
  • 25
  • 3
0
votes
0 answers

Using keywords with Parsehub

I created a project in Parsehub that scrapes provider name, address and phone from URL fastbraces.com/providers using a keyword list consisting of several zip codes. Each time I run the parser, it only processes the first two zip codes from the…
RyanM
  • 1
0
votes
0 answers

How to extract specific coloured entries from table parsehub

i am trying to extract only pink,yellow and purple entries from the website how can I do it
0
votes
1 answer

extracting specific text from a URL using REGEX

I am working on a project where i need to grab some text and the only option available to do this is through regex. If anyone here can help, then please let me know what would be the correct command/syntax to capture the file number from the below…
0
votes
0 answers

How to clean and split the output text in ParseHub

My output is a list of text elements that look like this: CVE-2016-2002: DoDB JS/Py 3.1.3, DoDB 5.5.65, DoDB 10.4.7, DoDB 10.3.17, DoDB 10.2.26, DoDB 10.1.41 (The number of commas is different between elements) My goal is the split this one column…
Doron
  • 11
  • 7
1
2