I have a string list:
content
01/09/15, 10:07 - message1
01/09/15, 10:32 - message2
01/09/15, 10:44 - message3
I want a data frame, like:
date message
01/09/15, 10:07 message1
01/09/15, 10:32 message2
01/09/15, 10:44 message3
Considering the fact that all my strings in the list starts in that format, I can just split by -
, but I rather look for a smarter way to do so.
history = pd.DataFrame([line.split(" - ", 1) for line in content], columns=['date', 'message'])
(I'll convert the date to date time afterwards)
Any help would be appreciated.
You can use
str.extract
- where named groups can become column namesDetails
Use
str.split
by\s+-\s+
-\s+
is one or more whitespaces:If need remove
content
column addDataFrame.pop
: