This is a simple Python script that parses the ril_export.html
file from Pocket web app and writes the data to a csv file.
The csv file has eight columns. Each pocketed webpage is on a seperate row.
status - Either Unread
or Read Archive
title - The title of the webpage pocketed
href - full URL of the webpage pocketed
domain - the root or sub domain of the webpage pocketed
date_added - The calendar date that the webpage was pocketed
time_added - The time of day that the webpage was pocketed
day_of_week_added - The day of the week that the webpage was pocketed
tags - A comma seperated list of tags assigned to the webpage
- Python 2.x
- csv
- codecs
- datetime
- BeautifulSoup
- Save the
ril_export.html
file exported from Pocket in the same directory aspocket_parse.py
- Run the script from the terminal
python pocket_parse.py