Skip to content
This repository has been archived by the owner on Apr 29, 2018. It is now read-only.

Latest commit

 

History

History
101 lines (79 loc) · 5.25 KB

README.md

File metadata and controls

101 lines (79 loc) · 5.25 KB

LIFOrganiser

Provides an API to scrape the course, chapter and lesson data from LearnItFirst.com and then organise, move and rename pre-downloaded course chapters accordingly.

Description

LearnItFirst.com hosts hundreds of hours of video tutorials that make up full certification courses about everything SQL. Subscribers must download the course contents chapter by chapter. The downloaded chapters contain the lesson files (AVI videos, PDF presentation slides and supplementary content & exercises).

However, they are named fairly non-descriptively compared with the disciplined naming convention on the website's course listings. This python package sets out to solve that problem as well as automatically unzipping all the downloaded chapters within a course and organising the video & supplementary content. The module produces contains extensive logging and the code is fully documented and it can be feely distributed and modified under the Apache License v2.

Installation

  1. Download ZIP & Unzip or git clone https://github.com/adamcunnington/LIFOrganiser.git
  2. CD into LIFOrganiser
  3. python setup.py install

You may need administrator privileges for step 3. What you do here depends on your operating system. For example in Ubuntu you would say sudo python setup.py install To bypass the need for administrator privileges, you may be able to make use of the --user flag. In this way you can install the package only for the current user.

For more information, see:

Example Usage

import logging

import liforganiser

_src = "/home/ac/Downloads"
_dst = "/home/ac/Dropbox/Documents/Education/Tutorials/LearnItFirst"
_avi_dst = "/media/ADAM-PC/Tutorials/LearnItFirst"

ssas = liforganiser.Course.get(165, logging.INFO)
ssas.organise(_src, _dst, chapter_pattern=r"SSAS2008_Chapter(\d+)(?:.zip|)",
              lesson_pattern=r"Ch\d+_(\d+)(?:_|)([^\.]*)(?:\.\w+|)",
              avi_dst=_avi_dst, completed_prefix="DONE")
              
ssis = liforganiser.Course.get(158)
ssis.organise(_src, _dst, chapter_pattern=r"SSIS2008_Chapter(\d+)(?:.zip|)",
              lesson_pattern=r"\d+_(\d+)(?:-|)([^\.]*)(?:\.\w+|)",
              avi_dst=_avi_dst)
              
tsql = liforganiser.Course.get(161)
tsql.organise(_src, _dst, chapter_pattern=r"TSQL2008_Chapter(\d+)(?:.zip|)",
              lesson_pattern=r"Ch\d+_(\d+)(?:_|)([^\.]*)(?:\.\w+|)",
              completed_prefix="DONE")
              
sqlq = liforganiser.Course.get(160, logging.INFO)
sqlq.organise(_src, _dst, chapter_pattern=r"SQLQueries2008_Chapter(\d+)(?:.zip|)",
              lesson_pattern=r"\d+_(\d+)(?:-|)([^\.]*)(?:\.\w+|)",
              completed_prefix="DONE")

Check the source for complete documentation - each publically exposed name is documented.

CLI Usage

usage: __init__.py [-h] [-v | -q] [-a AVIDST] [-p PDFDST] [-c COMPLETEDPREFIX]
                   [-i [IGNOREDEXTS [IGNOREDEXTS ...]]]
                   course_id src dst chapter_pattern lesson_pattern

Scrape the course, chapter and lesson data from LearnItFirst.com and then
organise, move and rename pre-downloaded course chapters accordingly.

positional arguments:
  course_id             the unique ID integer (between 100 and 999) of the
                        course
  src                   the full path to the source directory containing the
                        course chapter directories or zip files
  dst                   the full path to the destination directory which all
                        valid, lesson-related files will be moved to by
                        default
  chapter_pattern       a regex pattern representing a valid chapter name,
                        taking into account that both directories and zip
                        files should likely be accepted
  lesson_pattern        a regex pattern representing a valid lesson file that
                        should be matched to scraped course data and renamed
                        accordingly

optional arguments:
  -h, --help            show this help message and exit
  -v, --verbosity       increase output verbosity
  -q, --quiet           decrease output verbosity
  -a AVIDST, --avidst AVIDST
                        the full path to the destination directory in which
                        AVI files only should be moved to
  -p PDFDST, --pdfdst PDFDST
                        the full path to the destination directory in which
                        PDF files only should be moved to
  -c COMPLETEDPREFIX, --completedprefix COMPLETEDPREFIX
                        a prefix to be prepended to the old directories / zip
                        files in which the contents have been succesfully
                        moved, so that re-calling organise will not match and
                        attempt to organise them again
  -i [IGNOREDEXTS [IGNOREDEXTS ...]], --ignoredexts [IGNOREDEXTS [IGNOREDEXTS ...]]
                        file extensions (without the seperator) that should be
                        ignored when renaming and moving files