I'm scraping a webpage's HTML code and am currently trying to build a Regex to grab the information I need. The pattern repeats about 20 times in my example and is as follows: It should start with tivo (because it will either start with Ativo or Inativo) and should end in "Ver Detalhes". This pattern repeats for about 20 times as I said before.
The line of code I'm using on this is:
posts=re.findall('(ativo.*?ver det)',text,re.IGNORECASE)
But it doesn't work, as it simply gets 12 matches and I'm not understanding the reason why. I've tried using .* instead of .*? but then it only extracts 3 matches instead.
The file can be found at the following link: Source file
Is this something that is possible to extract?