mirror of
https://github.com/tiyn/stud.ip-crawler.git
synced 2025-04-03 16:37:48 +02:00
The log has options for several levels that can be set from the command line. The file is hardcoded as log.txt and can be toggled
Stud.IP Crawler
This is a program that downloads all files available for a given Stud.IP user. It only downloads and searches through the courses in the current semester. If you run the program again it only downloads files that have changed since the last time running it.
Features/To-Dos
- Downloads files of given users active semester via commandline
- Keeping file structure of Stud.IP
- Specify username
- Specify password
- Specify Stud.IP-URL
- Specify output directory
- Specify chunk size to download big files
- Specify all important database variables
- Only download files after given date
- Save and read download date
- Possible reset of download date
- Incremental file download
- Store id and chdate of downloaded files
- Logging
- Console log
- Log file
- Specify log level
Installation
- create an instance of
git clone https://github.com/tiyn/studip-crawler
cd studip-crawler/src/
pip3install -r requirements
- install dependencies
Usage
Just run the file via python3 run.py [options]
.
Alternatively to python3 run.py
you can give yourself permissions using chmod +x run.py [options]
and
run it with ./run.py [options]
.
There are several options required to work.
Run python3 run.py -h
for a help menu and see which ones are important for you.
Tested StudIP instances
- Carl von Ossietzky Universität Oldenburg
Languages
Python
96.3%
Dockerfile
2.8%
Shell
0.9%