I have 2 dataframes in python that I would like to query for data.
DF1: 4M records x 3 columns. The query function seams more efficient than the loc function.
DF2: 2K records x 6 columns. The loc function seams much more efficient than the query function.
Both queries return a single record. The simulation was done by running the same operation in a loop 10K times.
Running python 2.7 and pandas 0.16.0
Any recommendations to improve the query speed?
For improve performance is possible use
numexpr
: