https://github.com/charlespikachu/paperdl
Paperdl: Search and download paper from specific websites.
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org, scholar.google, ieee.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.2%) to scientific vocabulary
Keywords
arxiv
arxiv-papers
baidu
baiduwenku
google
googlescholar
Last synced: 5 months ago
·
JSON representation
Repository
Paperdl: Search and download paper from specific websites.
Basic Info
- Host: GitHub
- Owner: CharlesPikachu
- License: apache-2.0
- Language: Python
- Default Branch: main
- Homepage: https://paperdl.readthedocs.io/
- Size: 885 KB
Statistics
- Stars: 39
- Watchers: 1
- Forks: 7
- Open Issues: 0
- Releases: 0
Topics
arxiv
arxiv-papers
baidu
baiduwenku
google
googlescholar
Created almost 4 years ago
· Last pushed over 3 years ago
https://github.com/CharlesPikachu/paperdl/blob/main/
![]()
[](https://paperdl.readthedocs.io/) [](https://pypi.org/project/paperdl/) [](https://pypi.org/project/paperdl) [](https://github.com/CharlesPikachu/paperdl/blob/master/LICENSE) [](https://pypi.org/project/paperdl/) [](https://pypi.org/project/paperdl/) [](https://github.com/CharlesPikachu/paperdl/issues) [](https://github.com/CharlesPikachu/paperdl/issues) Documents: https://paperdl.readthedocs.io/ # Paperdl ``` Search and download paper from specific websites. You can star this repository to keep track of the project if it's helpful for you, thank you for your support. ``` # Support List | Source | Support Search? | Support Download? | | :----: | :----: | :----: | | [scihub](https://sci-hub.st/) | | | | [baiduwenku](https://wenku.baidu.com/) | | | | [arxiv](https://arxiv.org/) | | | | [googlescholar](https://scholar.google.com/) | | | # Install #### Pip install ``` run "pip install paperdl" ``` #### Source code install ```sh (1) Offline Step1: git clone https://github.com/CharlesPikachu/paperdl.git Step2: cd paperdl -> run "python setup.py install" (2) Online run "pip install git+https://github.com/CharlesPikachu/paperdl.git@master" ``` # Quick Start #### Calling API If you want to search and download papers from arxiv and google scholar, you can write codes as follow: ```python from paperdl import paperdl config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}} target_srcs = ['arxiv', 'googlescholar'] client = paperdl.Paperdl(config=config) client.run(target_srcs) ``` In addition, if you can not visit google, you can set config as follow: ```python config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}, 'area': 'CN'} ``` You can also only download papers by using sci-hub as follow: ```python from paperdl import paperdl config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}} client = paperdl.SciHub(config=config, logger_handle=paperdl.Logger('paper.log')) paperinfo = { 'savename': '9193963', 'ext': 'pdf', 'savedir': 'outputs', 'input': 'https://ieeexplore.ieee.org/document/9193963/', 'source': 'scihub', } client.download([paperinfo]) ``` #### Calling EXE ```sh Usage: paperdl [OPTIONS] Options: --version Show the version and exit. -m, --mode TEXT the used mode, support "search" and "download" -i, --inp TEXT the paper to download, the supported format is the same as sci-hub -s, --source TEXT the used source, support "arxiv", "scihub" and "googlescholar", you can use "," to split multi sources -d, --savedir TEXT the directory for saving papers -l, --logfilepath TEXT the logging filepath -z, --size INTEGER search size per source -p, --proxies TEXT the proxies to be adopted -a, --area TEXT your area, support "CN" and "EN" -c, --cookie TEXT the cookie copied from the target website, only used in "baiduwenku" --help Show this message and exit. ``` # Screenshot  # Projects in Charles_pikachu - [Games](https://github.com/CharlesPikachu/Games): Create interesting games by pure python. - [DecryptLogin](https://github.com/CharlesPikachu/DecryptLogin): APIs for loginning some websites by using requests. - [Musicdl](https://github.com/CharlesPikachu/musicdl): A lightweight music downloader written by pure python. - [Videodl](https://github.com/CharlesPikachu/videodl): A lightweight video downloader written by pure python. - [Pytools](https://github.com/CharlesPikachu/pytools): Some useful tools written by pure python. - [PikachuWeChat](https://github.com/CharlesPikachu/pikachuwechat): Play WeChat with itchat-uos. - [Pydrawing](https://github.com/CharlesPikachu/pydrawing): Beautify your image or video. - [ImageCompressor](https://github.com/CharlesPikachu/imagecompressor): Image compressors written by pure python. - [FreeProxy](https://github.com/CharlesPikachu/freeproxy): Collecting free proxies from internet. - [Paperdl](https://github.com/CharlesPikachu/paperdl): Search and download paper from specific websites. - [Sciogovterminal](https://github.com/CharlesPikachu/sciogovterminal): Browse "The State Council Information Office of the People's Republic of China" in the terminal. - [CodeFree](https://github.com/CharlesPikachu/codefree): Make no code a reality. - [DeepLearningToys](https://github.com/CharlesPikachu/deeplearningtoys): Some deep learning toys implemented in pytorch. - [DataAnalysis](https://github.com/CharlesPikachu/dataanalysis): Some data analysis projects in charles_pikachu. - [Imagedl](https://github.com/CharlesPikachu/imagedl): Search and download images from specific websites. - [Pytoydl](https://github.com/CharlesPikachu/pytoydl): A toy deep learning framework built upon numpy. - [NovelDL](https://github.com/CharlesPikachu/noveldl): Search and download novels from some specific websites. # More #### WeChat Official Accounts *Charles_pikachu* 
Owner
- Login: CharlesPikachu
- Kind: user
- Location: HongKong
- Company: HKU
- Website: https://charlespikachu.github.io/
- Repositories: 27
- Profile: https://github.com/CharlesPikachu
GitHub Events
Total
Last Year
Committers
Last synced: almost 3 years ago
All Time
- Total Commits: 30
- Total Committers: 2
- Avg Commits per committer: 15.0
- Development Distribution Score (DDS): 0.033
Top Committers
| Name | Commits | |
|---|---|---|
| CharlesPikachu | 1****1@q****m | 29 |
| CharlesPikachu | 3****u@u****m | 1 |
Committer Domains (Top 20 + Academic)
qq.com: 1
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 0
- Total pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- ItsaFugazi (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 37 last-month
- Total dependent packages: 0
- Total dependent repositories: 1
- Total versions: 7
- Total maintainers: 1
pypi.org: paperdl
Paperdl: Search and download paper from specific websites
- Homepage: https://github.com/CharlesPikachu/paperdl
- Documentation: https://paperdl.readthedocs.io/
- License: Apache License 2.0
-
Latest release: 0.1.6
published almost 4 years ago
Rankings
Dependent packages count: 10.1%
Stargazers count: 10.7%
Forks count: 16.8%
Average: 21.5%
Dependent repos count: 21.6%
Downloads: 48.4%
Maintainers (1)
Last synced:
6 months ago
Dependencies
docs/requirements.txt
pypi
- recommonmark *
- sphinx ==4.5.0
- sphinx_markdown_tables ==0.0.12
- sphinx_rtd_theme *
requirements.txt
pypi
- alive-progress *
- beautifulsoup4 *
- click *
- lxml *
- prettytable *
- requests *
setup.py
pypi
- lab.strip *
- readlines *