Regex substring one mismatch in any location of string

Question

Can someone explain why the code below returns an empty list:

>>> import re
>>> m = re.findall("(SS){e<=1}", "PSSZ")
>>> m
[]

I am trying to find the total number of occurrences of SS (and incorporating the possibility of up to one mismatch) within PSSZ.

Avinash Raj · Accepted Answer · 2015-07-12T04:39:51.530

0

You need to remove e<= chars present inside the range quantifier. Range quantifier must be of ,

It would be,

m = re.findall("(SS){1}", "PSSZ")

or

m = re.findall(r'SS','PSSZ')

Update:

>>> re.findall(r'(?=(S.|.S))', 'PSSZ')
['PS', 'SS', 'SZ']

edited Jul 12 '15 at 04:39

answered Jul 12 '15 at 04:27

Avinash Raj

This only gives `SS` (one instance), not `PS` or `SZ`, where there is one mismatch. – warship Jul 12 '15 at 04:32
@warship .. didn't you ask for the number of occurrence of `SS` ? – Iron Fist Jul 12 '15 at 04:34
It should also return `SS`. Hopefully, with something simple like the `{1}` notation, this would be much more elegant. – warship Jul 12 '15 at 04:38
Thank you, very interesting. I assume this is the most concise way to do it. How would you go about working with triple character strings such as `SSZ`, would the regex notation change? Perhaps you can post an update. – warship Jul 12 '15 at 04:41

1 Answers1