https://github.com/charlespikachu/paperdl

Paperdl: Search and download paper from specific websites.

https://github.com/charlespikachu/paperdl

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org, scholar.google, ieee.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.2%) to scientific vocabulary

Keywords

arxiv arxiv-papers baidu baiduwenku google googlescholar
Last synced: 5 months ago · JSON representation

Repository

Paperdl: Search and download paper from specific websites.

Basic Info
Statistics
  • Stars: 39
  • Watchers: 1
  • Forks: 7
  • Open Issues: 0
  • Releases: 0
Topics
arxiv arxiv-papers baidu baiduwenku google googlescholar
Created almost 4 years ago · Last pushed over 3 years ago

https://github.com/CharlesPikachu/paperdl/blob/main/


[![docs](https://img.shields.io/badge/docs-latest-blue)](https://paperdl.readthedocs.io/) [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/paperdl)](https://pypi.org/project/paperdl/) [![PyPI](https://img.shields.io/pypi/v/paperdl)](https://pypi.org/project/paperdl) [![license](https://img.shields.io/github/license/CharlesPikachu/paperdl.svg)](https://github.com/CharlesPikachu/paperdl/blob/master/LICENSE) [![PyPI - Downloads](https://pepy.tech/badge/paperdl)](https://pypi.org/project/paperdl/) [![PyPI - Downloads](https://img.shields.io/pypi/dm/paperdl?style=flat-square)](https://pypi.org/project/paperdl/) [![issue resolution](https://isitmaintained.com/badge/resolution/CharlesPikachu/paperdl.svg)](https://github.com/CharlesPikachu/paperdl/issues) [![open issues](https://isitmaintained.com/badge/open/CharlesPikachu/paperdl.svg)](https://github.com/CharlesPikachu/paperdl/issues) Documents: https://paperdl.readthedocs.io/ # Paperdl ``` Search and download paper from specific websites. You can star this repository to keep track of the project if it's helpful for you, thank you for your support. ``` # Support List | Source | Support Search? | Support Download? | | :----: | :----: | :----: | | [scihub](https://sci-hub.st/) | | | | [baiduwenku](https://wenku.baidu.com/) | | | | [arxiv](https://arxiv.org/) | | | | [googlescholar](https://scholar.google.com/) | | | # Install #### Pip install ``` run "pip install paperdl" ``` #### Source code install ```sh (1) Offline Step1: git clone https://github.com/CharlesPikachu/paperdl.git Step2: cd paperdl -> run "python setup.py install" (2) Online run "pip install git+https://github.com/CharlesPikachu/paperdl.git@master" ``` # Quick Start #### Calling API If you want to search and download papers from arxiv and google scholar, you can write codes as follow: ```python from paperdl import paperdl config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}} target_srcs = ['arxiv', 'googlescholar'] client = paperdl.Paperdl(config=config) client.run(target_srcs) ``` In addition, if you can not visit google, you can set config as follow: ```python config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}, 'area': 'CN'} ``` You can also only download papers by using sci-hub as follow: ```python from paperdl import paperdl config = {'logfilepath': 'paperdl.log', 'savedir': 'papers', 'search_size_per_source': 5, 'proxies': {}} client = paperdl.SciHub(config=config, logger_handle=paperdl.Logger('paper.log')) paperinfo = { 'savename': '9193963', 'ext': 'pdf', 'savedir': 'outputs', 'input': 'https://ieeexplore.ieee.org/document/9193963/', 'source': 'scihub', } client.download([paperinfo]) ``` #### Calling EXE ```sh Usage: paperdl [OPTIONS] Options: --version Show the version and exit. -m, --mode TEXT the used mode, support "search" and "download" -i, --inp TEXT the paper to download, the supported format is the same as sci-hub -s, --source TEXT the used source, support "arxiv", "scihub" and "googlescholar", you can use "," to split multi sources -d, --savedir TEXT the directory for saving papers -l, --logfilepath TEXT the logging filepath -z, --size INTEGER search size per source -p, --proxies TEXT the proxies to be adopted -a, --area TEXT your area, support "CN" and "EN" -c, --cookie TEXT the cookie copied from the target website, only used in "baiduwenku" --help Show this message and exit. ``` # Screenshot ![img](./docs/screenshot.gif) # Projects in Charles_pikachu - [Games](https://github.com/CharlesPikachu/Games): Create interesting games by pure python. - [DecryptLogin](https://github.com/CharlesPikachu/DecryptLogin): APIs for loginning some websites by using requests. - [Musicdl](https://github.com/CharlesPikachu/musicdl): A lightweight music downloader written by pure python. - [Videodl](https://github.com/CharlesPikachu/videodl): A lightweight video downloader written by pure python. - [Pytools](https://github.com/CharlesPikachu/pytools): Some useful tools written by pure python. - [PikachuWeChat](https://github.com/CharlesPikachu/pikachuwechat): Play WeChat with itchat-uos. - [Pydrawing](https://github.com/CharlesPikachu/pydrawing): Beautify your image or video. - [ImageCompressor](https://github.com/CharlesPikachu/imagecompressor): Image compressors written by pure python. - [FreeProxy](https://github.com/CharlesPikachu/freeproxy): Collecting free proxies from internet. - [Paperdl](https://github.com/CharlesPikachu/paperdl): Search and download paper from specific websites. - [Sciogovterminal](https://github.com/CharlesPikachu/sciogovterminal): Browse "The State Council Information Office of the People's Republic of China" in the terminal. - [CodeFree](https://github.com/CharlesPikachu/codefree): Make no code a reality. - [DeepLearningToys](https://github.com/CharlesPikachu/deeplearningtoys): Some deep learning toys implemented in pytorch. - [DataAnalysis](https://github.com/CharlesPikachu/dataanalysis): Some data analysis projects in charles_pikachu. - [Imagedl](https://github.com/CharlesPikachu/imagedl): Search and download images from specific websites. - [Pytoydl](https://github.com/CharlesPikachu/pytoydl): A toy deep learning framework built upon numpy. - [NovelDL](https://github.com/CharlesPikachu/noveldl): Search and download novels from some specific websites. # More #### WeChat Official Accounts *Charles_pikachu* ![img](./docs/pikachu.jpg)

Owner

  • Login: CharlesPikachu
  • Kind: user
  • Location: HongKong
  • Company: HKU

GitHub Events

Total
Last Year

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 30
  • Total Committers: 2
  • Avg Commits per committer: 15.0
  • Development Distribution Score (DDS): 0.033
Top Committers
Name Email Commits
CharlesPikachu 1****1@q****m 29
CharlesPikachu 3****u@u****m 1
Committer Domains (Top 20 + Academic)
qq.com: 1

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • ItsaFugazi (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 37 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 7
  • Total maintainers: 1
pypi.org: paperdl

Paperdl: Search and download paper from specific websites

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 37 Last month
Rankings
Dependent packages count: 10.1%
Stargazers count: 10.7%
Forks count: 16.8%
Average: 21.5%
Dependent repos count: 21.6%
Downloads: 48.4%
Maintainers (1)
Last synced: 6 months ago

Dependencies

docs/requirements.txt pypi
  • recommonmark *
  • sphinx ==4.5.0
  • sphinx_markdown_tables ==0.0.12
  • sphinx_rtd_theme *
requirements.txt pypi
  • alive-progress *
  • beautifulsoup4 *
  • click *
  • lxml *
  • prettytable *
  • requests *
setup.py pypi
  • lab.strip *
  • readlines *