Iterating over Torchtext.data.BucketIterator objec

2019-08-02 15:04发布

站内文章 / Python

244 0

混吃等死

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

When I try to look into a batch, by printing the next iteration of the BucketIterator object, the AttributeError is thrown.

tv_datafields=[("Tweet",TEXT), ("Anger",LABEL), ("Fear",LABEL), ("Joy",LABEL), ("Sadness",LABEL)]
train, vld = data.TabularDataset.splits(path="./data/", train="train.csv",validation="test.csv",format="csv", fields=tv_datafields)

train_iter, val_iter = BucketIterator.splits(
(train, vld),
batch_sizes=(64, 64),
device=-1,
sort_key=lambda x: len(x.Tweet),
sort_within_batch=False,
repeat=False
)
print(next(iter(train_dl)))

回答1:

I am not sure about the specific error you are getting but, in this case, you can iterate over a batch by using the following code:

for i in train_iter:
    print i.Tweet
    print i.Anger
    print i.Fear
    print i.Joy
    print i.Sadness

i.Tweet (also others) is a tensor of shape (input_data_length, batch_size).

So, to view a single batch data (lets say batch 0), you can do print i.Tweet[:,0].

Same goes for val_iter (and test_iter, if needed).

标签： python iterator pytorch torchtext

混吃等死

女 | 书童

私信

收藏的人(0)

Ta的文章更多文章

0条评论

还没有人评论过~

Iterating over Torchtext.data.BucketIterator objec

问题:

回答1:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮