Change column in df from column from another

2019-07-15 02:36发布

I have data:

                 date                      id 
0     2016-06-17 06:25:05    yans.bouts@yandex.ru    
1     2016-06-17 06:25:07    yans.bouts@yandex.ru       
2     2016-06-17 06:25:10    titovtanya@yandex.ru         
3     2016-06-17 06:25:11    titovtanya@yandex.ru

Other data

Email,UTC shift
yans.bouts@yandex.ru,5
inkin_sam@mail.ru,3
titovtanya@yandex.ru,3
dasha.dasha.kovaleva@mail.ru,2

I need to add UTC shift to first file date to hours. Desire output:

                     date                      id 
0     2016-06-17 11:25:05    yans.bouts@yandex.ru    
1     2016-06-17 11:25:07    yans.bouts@yandex.ru       
2     2016-06-17 09:25:10    titovtanya@yandex.ru         
3     2016-06-17 09:25:11    titovtanya@yandex.ru

I convert date to datetime, but I don't know, how convert UTC shift to hours.

2条回答
ゆ 、 Hurt°
2楼-- · 2019-07-15 03:01

You need first convert to_datetime column date if dtype is not datetime, merge by columns id and Email. Then convert to_timedelta column UTC shift, add to date and last drop unnecessary columns:

import pandas as pd


df1 = pd.DataFrame({'date': {0: '2016-06-17 06:25:05', 1: '2016-06-17 06:25:07', 2: '2016-06-17 06:25:10', 3: '2016-06-17 06:25:11'}, 
                      'id': {0: 'yans.bouts@yandex.ru', 1: 'yans.bouts@yandex.ru', 2: 'titovtanya@yandex.ru', 3: 'titovtanya@yandex.ru'}})

df2 = pd.DataFrame({'Email': {0: 'yans.bouts@yandex.ru', 1: 'inkin_sam@mail.ru', 2: 'titovtanya@yandex.ru', 3: 'dasha.dasha.kovaleva@mail.ru'}, 
                    'UTC shift': {0: 5, 1: 3, 2: 3, 3: 2}})

print (df1)
                  date                    id
0  2016-06-17 06:25:05  yans.bouts@yandex.ru
1  2016-06-17 06:25:07  yans.bouts@yandex.ru
2  2016-06-17 06:25:10  titovtanya@yandex.ru
3  2016-06-17 06:25:11  titovtanya@yandex.ru

print (df2)
                          Email  UTC shift
0          yans.bouts@yandex.ru          5
1             inkin_sam@mail.ru          3
2          titovtanya@yandex.ru          3
3  dasha.dasha.kovaleva@mail.ru          2
df1['date'] = pd.to_datetime(df1.date)

df = pd.merge(df1, df2, left_on='id', right_on='Email')
df['date'] += pd.to_timedelta(df['UTC shift'], unit='H')
df.drop(['Email','UTC shift'], axis=1, inplace=True)
print (df)
                 date                    id
0 2016-06-17 11:25:05  yans.bouts@yandex.ru
1 2016-06-17 11:25:07  yans.bouts@yandex.ru
2 2016-06-17 09:25:10  titovtanya@yandex.ru
3 2016-06-17 09:25:11  titovtanya@yandex.ru
查看更多
Fickle 薄情
3楼-- · 2019-07-15 03:14

try this:

df['date'] += pd.Timedelta(df['UTC shift'], unit='H')
查看更多
登录 后发表回答