This code connects to a local disk based queue and is used to submit directories to be checked against the elasticsearch index.
Differences are then submitted to the rabbit queues to be picked up by the indexing code.
The diagram below shows a rough sketch of queues involved with the different processes and scripts.
- Install the package
pip install git+https://github.com/cedadev/fbi-directory-check
This code base requires a config file to setup the local queue. This is found in rabbit_indexer/conf
Config options
Option | Description |
---|---|
queue-location | Directory path to queue databases |
Takes items off a local queue and checks the elastisearch indices (directories and files) against the filesystem. If files/directories need adding, messages are sent to the rabbit queue for processing. This is to be run as a process. AKA crawler.
Display the current number of directories in the user submitted and bot queues. These are processed to check the difference between the index and the archive and then actions are submitted to the rabbit index.
Usage:
fbi_q_check
Rescan the given directory. This will overwrite the content in the indices for this directory
Usage:
fbi_rescan_dir <dir> [-r] [--no-files] [--no-dirs] [--conf <conf>]
Options:
Option | Description |
---|---|
-r | Will search all directories recursively |
--no-files | Will exclude files from the results and only change directories |
--no-dirs | Will exclude directories from the results and only change files |
--conf | Path to configuration file |
Submit directories to be checked for consistency between the archive and the indices.
Usage:
fbi_directory_check (--dir <dir> | --file <file>) [-r] [--conf <conf>]
Options:
Option | Description |
---|---|
-r | Will search all directories recursively |
--dir | Accepts a directory path |
--file | Accepts a file input |
--conf | Path to configuration file |