https://github.com/airen3339/doc_downloader

https://github.com/airen3339/doc_downloader

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: airen3339
  • Language: Python
  • Default Branch: master
  • Size: 26.6 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 3 years ago · Last pushed about 3 years ago
Metadata Files
Readme

readme.md

多种文档下载器

本工具适用于下载豆丁、道客巴巴、淘豆网、原创力、新浪爱问、金锄头网站的可以预览的文档。只要可以预览,就可以下载。下载下来是图片格式,然后会通过reportlab库,将图片转换成PDF。

其中,由于新浪爱问网站用的都是svg格式的文件,将其转换成图片格式需要调用第三方库。Windows下可用svg2png库,Linux下可使用rsvg库。当然,在windows上面也可以安装rsvg库,需要下载CRAN,利用CRAN安装rsvg,实现svg的转换。

~~本项目还提供了一个简易的在线下载网页,[点击进入]~~

rsvg库安装方法

Binary packages for OS-X or Windows can be installed directly from CRAN:

r install.packages("rsvg")

Installation from source on Linux or OSX requires librsvg2. On Debian or Ubuntu install librsvg2-dev:

sudo apt-get install -y librsvg2-dev

On Fedora, CentOS or RHEL we need librsvg2-devel:

sudo yum install librsvg2-devel `

On OS-X use rsvg from Homebrew:

brew install librsvg

svg2png安装方法(仅限Windows操作系统)

1. 安装nodejs 2. 命令提示符内输入:npm install -g svg2png 3. 命令提示符内输入:Set-ExecutionPolicy -ExecutionPolicy

本项目使用方法

终端内输入: pip install -r requirements.txt python docDownloader.py 若使用报错,应先检查chromedriver版本与chrome版本是否兼容。若不兼容,则只需将项目中的chromedriver.exe替换为兼容的版本即可。附chromedriver下载地址

Owner

  • Login: airen3339
  • Kind: user

GitHub Events

Total
Last Year

Dependencies

requirements.txt pypi
  • Pillow ==8.3.2
  • beautifulsoup4 ==4.9.3
  • fire ==0.3.1
  • reportlab ==3.5.53
  • requests ==2.23.0
  • selenium ==3.141.0
  • tqdm ==4.46.0