mirror of
https://github.com/tiyn/stud.ip-crawler.git
synced 2025-07-14 16:37:47 +02:00
The log has options for several levels that can be set from the command line. The file is hardcoded as log.txt and can be toggled
45 lines
1.5 KiB
Markdown
45 lines
1.5 KiB
Markdown
# Stud.IP Crawler
|
|
|
|
This is a program that downloads all files available for a given Stud.IP user.
|
|
It only downloads and searches through the courses in the current semester.
|
|
If you run the program again it only downloads files that have changed since the last time running it.
|
|
|
|
## Features/To-Dos
|
|
|
|
- [x] Downloads files of given users active semester via commandline
|
|
- [x] Keeping file structure of Stud.IP
|
|
- [x] Specify username
|
|
- [x] Specify password
|
|
- [x] Specify Stud.IP-URL
|
|
- [x] Specify output directory
|
|
- [x] Specify chunk size to download big files
|
|
- [x] Specify all important database variables
|
|
- [x] Only download files after given date
|
|
- [x] Save and read download date
|
|
- [x] Possible reset of download date
|
|
- [x] Incremental file download
|
|
- [x] Store id and chdate of downloaded files
|
|
- [x] Logging
|
|
- [x] Console log
|
|
- [x] Log file
|
|
- [x] Specify log level
|
|
|
|
## Installation
|
|
|
|
- create an instance of
|
|
- `git clone https://github.com/tiyn/studip-crawler`
|
|
- `cd studip-crawler/src/`
|
|
- `pip3install -r requirements` - install dependencies
|
|
|
|
## Usage
|
|
|
|
Just run the file via `python3 run.py [options]`.
|
|
Alternatively to `python3 run.py` you can give yourself permissions using `chmod +x run.py [options]` and
|
|
run it with `./run.py [options]`.
|
|
There are several options required to work.
|
|
Run `python3 run.py -h` for a help menu and see which ones are important for you.
|
|
|
|
## Tested StudIP instances
|
|
|
|
- Carl von Ossietzky Universität Oldenburg
|