genfromtxt and numpy

2019-09-09 16:23发布

I have data in files such as "file.csv". I would like to read them with np.genfromtxt and do some statistics like average, variance etc. on some columns (X, Y, Z). However I want to make the statistics on for X > 1, Y > 3 Z > 2 etc. This is a simple example here.

This code produces almost correct results but it includes ALL Xs, Ys and Zs, I want to do the same but with the X,Y,Z conditions i specified above.

#file.csv
X,Y,Z
1,2,3
4,2,5
15,9,1
#

data = np.genfromtxt(file.csv, delimiter=',', dtype=float, unpack=True, skiprows = 0) 
X=data[0];Y=data[1];Z=data[2]
Mean = np.average(X)

--> Doing a great job getting the average. However, I want i to get average ONLY IF X > 1 (for example)... How do I make it do so?

标签： python numpy genfromtxt

2条回答

SAY GOODBYE

2楼-- · 2019-09-09 16:49

In order to average over only some fields, you break down your averaging as follows:

Find the indexes (ind) of those elements that meet a certain criteria
Find the mean of the array indexed only with the the values in ind

The following code does exactly this:

indexes = np.where(X>1)[0] # We index with '0' here to get to the 1st element of the returned tuple
Mean = np.mean(X[indexes])

0人赞添加讨论(0) 举报

戒情不戒烟

3楼-- · 2019-09-09 17:14

You could use so-called "fancy-indexing", X[X>1], to select the part of the array you want:

import numpy as np
X,Y,Z = np.genfromtxt('file.csv', delimiter=',', dtype=float, unpack=True, skiprows = 0)
print(X)
# [ nan   1.   4.  15.]
print(X[X>1])
# [  4.  15.]
print(np.average(X[X>1]))
# 9.5

To combine two masks (boolean arrays) with bit-wise logical-and, use the & operator:

print(np.average(X[(X>1)&(X<10)]))
# 4.0

0人赞添加讨论(0) 举报

genfromtxt and numpy

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间