Questions tagged [html-xml-utils]

10 questions
4
votes
1 answer

Parse local HTML python (lxml)

I'm trying to parse a local HTML with lxml, but I'm getting an error, but I don't know why (sorry in advance for the bad code, I'm new to this). from lxml import etree, html from StringIO import StringIO parser = etree.HTMLParser() doc =…
Lara M.
  • 831
  • 1
  • 8
  • 20
2
votes
1 answer

xml2csv package error '_IterParseIterator' object has no attribute 'next'

I am using xmlutils package to convert xml file to csv. My code is following : from xmlutils.xml2csv import xml2csv as x input_path='/media/ishan/Local Disk/doc.xml' output_path='media/ishan/Local…
Ishan
  • 917
  • 1
  • 12
  • 26
1
vote
2 answers

How can I use hxselect to generate array-ish result?

I'm using hxselect to process a HTML file in bash. In this file there are multiple divs defined with the '.row' class. In bash I want to extract these 'rows' into an array. (The divs are multilined so simply reading it line-by-line is not…
Daniel
  • 1,453
  • 13
  • 33
1
vote
1 answer

Middle Selectors Ignored in hxselect

I'm attempting to extract some text from a webpage using hxselect from html-xml-utils 7.4. According to the man, hxselect will accept a comma delimited list of CSS selectors. I have three selectors: /usr/local/bin/hxnormalize -x -i 0 -l 5000…
Bryan
  • 245
  • 1
  • 2
  • 8
1
vote
1 answer

Groovy: keeping of CR/LF of attributes while serializing XMLSlurper

I'm new with groovy (a few weeks of experience). Currently I'm trying to process some visual studio .vcproj files using groovy: replacing some paths, that will be found by a regexp patterns. This works fine for me. To write the changes to the file,…
0
votes
1 answer

Vbscript/UFT how to get the Tag elements name to validate Xpath of tag elements

Attached XML sample, In the attached XML I want to verify the tag Elements presence for ex: PayloadList/IFXResp/IFX/GeneralStatus/StatusCode it would be great if anyone can help me to get the above xpath code. Looking for code in vbscript/UFT to…
Anil
  • 3
  • 2
0
votes
1 answer

Parsing XML in Python

I have a large XML file and I need to format it to get some needed data from particular elements in it and print out only data needed into another file. In the XML file I have a number of text tags belonging to different conversations with id's and…
T. A
  • 1
  • 3
0
votes
1 answer

html-xml-utils Get a specific value

I am using Xubuntu 16.04.2 with html-xml-utils I tried this command hxnormalize -x site/sciz.html | hxselect 'table.vitals-table:first-child' but it doesn't work, it outputs nothing. There are about 3-5 of these tables in the html. Doing this…
knuxyl
  • 113
  • 2
  • 10
0
votes
2 answers

make[1]: exec(f77) failed (No such file or directory) on DragonFly BSD

I get this error when trying to compile html-xml-utils-6.9 on DragonFly BSD. Funny thing is, when doing grep -r f77 in the source directory, the result is empty. So no such option is in the constructed Makefile after ./configure. This is my…
Olle Härstedt
  • 3,348
  • 1
  • 18
  • 46
0
votes
1 answer

Extract HTML table content based on "thead"

Here is a basic HTML table : …
bar
rows
Suppose there are several such tables in the source file. Is there an option of hxextract, or a CSS3…