Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some RSS feeds don’t follow RFC 822 #30

Open
dahlia opened this issue Dec 23, 2013 · 2 comments
Open

Some RSS feeds don’t follow RFC 822 #30

dahlia opened this issue Dec 23, 2013 · 2 comments
Labels

Comments

@dahlia
Copy link
Contributor

dahlia commented Dec 23, 2013

MSDN magazine seems so.

http://msdn.microsoft.com/ko-kr/magazine/rss/default.aspx?loc=ko&fn=MSDN%20Magazine%20Korea

      <pubDate>Tue, 15 Feb 2000 00:00:00 GMT</pubDate>
@minhoryang
Copy link

+1 at http://rss.ohmynews.com/rss/ohmynews.xml

2014-01-08T15:25:12+09:00
2014-01-08T15:23:49+09:00

@dahlia
Copy link
Contributor Author

dahlia commented May 20, 2014

http://ncf.undefined.me/webtoon/186811.xml?limit=15

Traceback (most recent call last):
  File "/Users/dahlia/Projects/libearth/libearth/crawler.py", line 59, in get_feed
    feed, crawler_hints = parser(feed_xml, feed_url)
  File "/Users/dahlia/Projects/libearth/libearth/parser/atom.py", line 51, in parse_atom
    feed_data = atom_get_feed_data(root, feed_url)
  File "/Users/dahlia/Projects/libearth/libearth/parser/atom.py", line 93, in atom_get_feed_data
    feed_data.updated_at = atom_get_updated_tag(data)
  File "/Users/dahlia/Projects/libearth/libearth/parser/atom.py", line 181, in atom_get_updated_tag
    return Rfc3339().decode(data.text)
  File "/Users/dahlia/Projects/libearth/libearth/codecs.py", line 129, in decode
    ' is not valid RFC3339 date time string')
libearth.schema.DecodeError: '2014-05-14T00:00:00' is not valid RFC3339 date time string

dahlia added a commit that referenced this issue May 25, 2014
- It becomes possible to parse RFC 3339  #30
- Proper parsing of <author>, <managingEditor>, and <webMaster>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants