I have a huge HDF5 file, I want to load part of it in a pandas DataFrame to perform some operations, but I am interested in filtering some rows.
I can explain better with an example:
Original HDF5 file would look something like:
A B C D
1 0 34 11
2 0 32 15
3 1 35 22
4 1 34 15
5 1 31 9
1 0 34 15
2 1 29 11
3 0 34 15
4 1 12 14
5 0 34 15
1 0 32 13
2 1 34 15
etc etc etc etc
What I am trying to do is to load this, exactly as it is, to a pandas Dataframe but only where A==1 or 3 or 4
Until now I can just load the whole HDF5 using:
store = pd.HDFStore('Resutls2015_10_21.h5')
df = pd.DataFrame(store['results_table'])
I do not see how to include a where
condition here.