Pickling a DataFrame

2019-02-09 08:50发布

I am trying to pickle a DataFrame with

import pandas as pd
from pandas import DataFrame
data = pd.read_table('Purchases.tsv',index_col='coreuserid')
data.to_pickle('Purchases.pkl')

I have been running on "data" for a while and have had no issues so I know it is not a data corruption issue. I am thinking likely syntax but I have tried a number of variants. I hesitate to give the whole error message but it ends with:

\pickle.pyc in to_pickle(obj, path)
 13     """
 14     with open(path, 'wb') as f:
 15         pkl.dump(obj, f, protocol=pkl.HIGHEST_PROTOCOL)

 SystemError: error return without exception set 

The Purchases.pkl file is created but if I call

data = pd.read_pickle('Purchases.pkl')

I get EOFError. I am using Canopy 1.4 so pandas 0.13.1 which should be recent enough to have this functionality.

2条回答
聊天终结者
2楼-- · 2019-02-09 09:29

You can try create a class from your DataFrame and pickle it after.

This can help you: Pass pandas dataframe into class

查看更多
再贱就再见
3楼-- · 2019-02-09 09:36

Fast forward a few years, and now it works fine. Thanks pandas ;)

查看更多
登录 后发表回答