series1_values = ['risk no', 'No', 'No', 'No', 'No', 'Yes', 'No', 'Yes',
'Medium rare', 'Female', '18-29', '$25,000 - $49,999',
'High school degree', 'South Atlantic']
series1 = pd.Series(series1_values)
series2 = pd.Series(['risk no', 'No', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes',
'Medium rare', 'Female', '60+', '$25,000 - $49,999',
'High school degree', 'South Atlantic'])
series1.isin(series2)
0 True
1 True
2 True
3 True
4 True
5 True
6 True
7 True
8 True
9 True
10 False
11 True
12 True
13 True
dtype: bool
This code says that the two series share 13 values in common (sum of the trues) but they actually only have 11 values in common. Where is it getting the extra two values from?
Index 2 and 3 should also equate to False if you see what I mean.