How to find multiple space in Pandas

Question

When one of columns contains target value, I write code following.

df['address'].str.contains('\t')

In my question, I would like to find multiple space which is more than 2 spaces. I think I should use Regular expression.

How can I write code? Please give me an advice.

What you mean by multiple space? – BENY Feb 10 '20 at 02:07 — BENY, Feb 10 '20 at 02:07

score 1 · Accepted Answer · answered Feb 10 '20 at 02:08

Would this be a good example

df = pd.DataFrame({'col': ['a', 'b  ', 'c', '   d', '  e    ']})
         col
0        a
1      b
2        c
3        d
4    e

df['col'].str.contains('  *', regex=True)
0    False
1     True
2    False
3     True
4     True

score 0 · Answer 2 · answered Feb 10 '20 at 02:08

0

One way use

s=pd.Series(['one ','more   than two','only  two'])
s.str.contains('  ')
0    False
1     True
2     True
dtype: bool

If we would like find the single space freq we could do count

s.str.count(' ')
0    1
1    4
2    2
dtype: int64

answered Feb 10 '20 at 02:08

BENY

score 0 · Answer 3 · answered Feb 10 '20 at 02:41

0

A regex based approach for any consecutive (potentially duplicate) words I posted is here: Regular Expression For Consecutive Duplicate Words

answered Feb 10 '20 at 02:41

synaptikon

3 Answers3