I'm using a NSDateFormatter to parse a RFC 822 date on the iPhone. However, there is no way to specify optional elements in the date format. There are a couple of optional parts in the RFC 822 specification which is breaking the date parser. If nothing works out, I'd probably have to write a custom parser to obey the specs.
For example, the day name is optional in the spec. So both these dates are valid:
Tue, 01 Dec 2009 08:48:25 +0000
is parsed with the format EEE, dd MMM yyyy HH:mm:ss z
01 Dec 2009 08:48:25 +0000
is parsed with the format dd MMM yyyy HH:mm:ss z
This is what I am currently using:
+ (NSDateFormatter *)rfc822Formatter {
static NSDateFormatter *formatter = nil;
if (formatter == nil) {
formatter = [[NSDateFormatter alloc] init];
NSLocale *enUS = [[NSLocale alloc] initWithLocaleIdentifier:@"en_US"];
[formatter setLocale:enUS];
[enUS release];
[formatter setDateFormat:@"EEE, dd MMM yyyy HH:mm:ss z"];
}
return formatter;
}
+ (NSDate *)dateFromRFC822:(NSString *)date {
NSDateFormatter *formatter = [NSDate rfc822Formatter];
return [formatter dateFromString:date];
}
And parsing the date as follows:
self.entry.published = [NSDate dateFromRFC822:self.currentString];
One way is to try both formats, and take whatever returns non null value. However, there are two optional parts in the spec (day name and seconds) and there would be 4 possible combinations. Still not too bad, but it's a bit hacky.
I've used the following method to parse RFC822 dates. I believe it originally was from MWFeedParser:
In case this is helpful to anyone else.. here is a NSDate+RFC822String.swift extension based on Simucal's answer.
It also caches the last used date format that was successful, since setting the dateFormatter.dateFormat is expensive.
I believe RFC 822 specifies two optional components in the date time: day of week and the seconds past the hour.
As a hack, it is possible to the symbols for the short days of the week:
If you then change the date format to this:
EEEdd MMM yyyy HH:mm:ss z
. You'll be able to parse times with about without the day of the week. This seems to allow a space after the comma too.To be safe you should not just blindly set the symbols like this. You should get using
setShortWeekdaySymbols
and iterate over them adding the comma at the end. The reason being they are potentially different for each locale and the first day might not be Sunday.Interestingly the format
EEE, dd MMM yyyy HH:mm:ss z
will parse times without the day of week, but the comma must be there, for example, 01 Dec 2009 08:48:25 +0000
. Therefore, you could do something like Steve said but then strip off the day and pass though to the formatter. Not having the comma in the format does not seem to allow the week to be optional. Strange.Unfortunately, this still doesn't help with the optional :ss in the format. But it might allow you to have two formats rather than four.
Count the number of salient characters before deciding which formatter to use. For example, the two you give have different numbers of commas and spaces. If no known format matches the counts, then you known not even to try parsing it as a date.