https://github.com/airen3339/doc_downloader
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.4%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: airen3339
- Language: Python
- Default Branch: master
- Size: 26.6 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
readme.md
多种文档下载器
本工具适用于下载豆丁、道客巴巴、淘豆网、原创力、新浪爱问、金锄头网站的可以预览的文档。只要可以预览,就可以下载。下载下来是图片格式,然后会通过reportlab库,将图片转换成PDF。
其中,由于新浪爱问网站用的都是svg格式的文件,将其转换成图片格式需要调用第三方库。Windows下可用svg2png库,Linux下可使用rsvg库。当然,在windows上面也可以安装rsvg库,需要下载CRAN,利用CRAN安装rsvg,实现svg的转换。
~~本项目还提供了一个简易的在线下载网页,[点击进入]~~
rsvg库安装方法
Binary packages for OS-X or Windows can be installed directly from CRAN:
r
install.packages("rsvg")
Installation from source on Linux or OSX requires librsvg2. On Debian or Ubuntu install librsvg2-dev:
sudo apt-get install -y librsvg2-dev
On Fedora, CentOS or RHEL we need librsvg2-devel:
sudo yum install librsvg2-devel
`
On OS-X use rsvg from Homebrew:
brew install librsvg
svg2png安装方法(仅限Windows操作系统)
1. 安装nodejs
2. 命令提示符内输入:npm install -g svg2png
3. 命令提示符内输入:Set-ExecutionPolicy -ExecutionPolicy
本项目使用方法
终端内输入:
pip install -r requirements.txt
python docDownloader.py
若使用报错,应先检查chromedriver版本与chrome版本是否兼容。若不兼容,则只需将项目中的chromedriver.exe替换为兼容的版本即可。附chromedriver下载地址
Owner
- Login: airen3339
- Kind: user
- Repositories: 187
- Profile: https://github.com/airen3339
GitHub Events
Total
Last Year
Dependencies
- Pillow ==8.3.2
- beautifulsoup4 ==4.9.3
- fire ==0.3.1
- reportlab ==3.5.53
- requests ==2.23.0
- selenium ==3.141.0
- tqdm ==4.46.0