you-get

:arrow_double_down: Dumb downloader that scrapes the web

https://github.com/soimort/you-get

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 265 committers (0.4%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.6%) to scientific vocabulary

Keywords from Contributors

distributed shellcode tensor fuzzing jax python2 asyncio astronomy forhumans neuroimaging
Last synced: 10 months ago · JSON representation

Repository

:arrow_double_down: Dumb downloader that scrapes the web

Basic Info
  • Host: GitHub
  • Owner: soimort
  • License: other
  • Language: Python
  • Default Branch: develop
  • Homepage: https://you-get.org/
  • Size: 3.37 MB
Statistics
  • Stars: 56,393
  • Watchers: 1,366
  • Forks: 9,805
  • Open Issues: 383
  • Releases: 123
Created almost 14 years ago · Last pushed about 1 year ago
Metadata Files
Readme Changelog Contributing License Security

README.md

You-Get

Build Status PyPI version Gitter

NOTICE (30 May 2022): Support for Python 3.5, 3.6 and 3.7 will eventually be dropped. (see details here)

NOTICE (8 Mar 2019): Read this if you are looking for the conventional "Issues" tab.


You-Get is a tiny command-line utility to download media contents (videos, audios, images) from the Web, in case there is no other handy way to do it.

Here's how you use you-get to download a video from YouTube:

```console $ you-get 'https://www.youtube.com/watch?v=jNQXAC9IVRw' site: YouTube title: Me at the zoo stream: - itag: 43 container: webm quality: medium size: 0.5 MiB (564215 bytes) # download-with: you-get --itag=43 [URL]

Downloading Me at the zoo.webm ... 100% ( 0.5/ 0.5MB) ├██████████████████████████████████┤[1/1] 6 MB/s

Saving Me at the zoo.en.srt ... Done. ```

And here's why you might want to use it:

  • You enjoyed something on the Internet, and just want to download them for your own pleasure.
  • You watch your favorite videos online from your computer, but you are prohibited from saving them. You feel that you have no control over your own computer. (And it's not how an open Web is supposed to work.)
  • You want to get rid of any closed-source technology or proprietary JavaScript code, and disallow things like Flash running on your computer.
  • You are an adherent of hacker culture and free software.

What you-get can do for you:

  • Download videos / audios from popular websites such as YouTube, Youku, Niconico, and a bunch more. (See the full list of supported sites)
  • Stream an online video in your media player. No web browser, no more ads.
  • Download images (of interest) by scraping a web page.
  • Download arbitrary non-HTML contents, i.e., binary files.

Interested? Install it now and get started by examples.

Are you a Python programmer? Then check out the source and fork it!

Installation

Prerequisites

The following dependencies are recommended:

Option 1: Install via pip

The official release of you-get is distributed on PyPI, and can be installed easily from a PyPI mirror via the pip package manager: (Note that you must use the Python 3 version of pip)

$ pip install you-get

Option 2: Install via Antigen (for Zsh users)

Add the following line to your .zshrc:

antigen bundle soimort/you-get

Option 3: Download from GitHub

You may either download the stable (identical with the latest release on PyPI) or the develop (more hotfixes, unstable features) branch of you-get. Unzip it, and put the directory containing the you-get script into your PATH.

Alternatively, run

$ cd path/to/you-get $ [sudo] python -m pip install .

Or

$ cd path/to/you-get $ python -m pip install . --user

to install you-get to a permanent path. (And don't omit the dot . representing the current directory)

You can also use the pipenv to install the you-get in the Python virtual environment.

$ pipenv install -e . $ pipenv run you-get --version you-get: version 0.4.1555, a tiny downloader that scrapes the web.

Option 4: Git clone

This is the recommended way for all developers, even if you don't often code in Python.

$ git clone git://github.com/soimort/you-get.git

Then put the cloned directory into your PATH, or run python -m pip install path/to/you-get to install you-get to a permanent path.

Option 5: Homebrew (Mac only)

You can install you-get easily via:

$ brew install you-get

Option 6: pkg (FreeBSD only)

You can install you-get easily via:

```

pkg install you-get

```

Option 7: Flox (Mac, Linux, and Windows WSL)

You can install you-get easily via:

$ flox install you-get

Shell completion

Completion definitions for Bash, Fish and Zsh can be found in contrib/completion. Please consult your shell's manual for how to take advantage of them.

Upgrading

Based on which option you chose to install you-get, you may upgrade it via:

$ pip install --upgrade you-get

or download the latest release via:

$ you-get https://github.com/soimort/you-get/archive/master.zip

In order to get the latest develop branch without messing up the PIP, you can try:

$ pip install --upgrade --force-reinstall git+https://github.com/soimort/you-get@develop

Getting Started

Download a video

When you get a video of interest, you might want to use the --info/-i option to see all available quality and formats:

``` $ you-get -i 'https://www.youtube.com/watch?v=jNQXAC9IVRw' site: YouTube title: Me at the zoo streams: # Available quality and codecs [ DASH ] ____________________________________ - itag: 242 container: webm quality: 320x240 size: 0.6 MiB (618358 bytes) # download-with: you-get --itag=242 [URL]

- itag:          395
  container:     mp4
  quality:       320x240
  size:          0.5 MiB (550743 bytes)
# download-with: you-get --itag=395 [URL]

- itag:          133
  container:     mp4
  quality:       320x240
  size:          0.5 MiB (498558 bytes)
# download-with: you-get --itag=133 [URL]

- itag:          278
  container:     webm
  quality:       192x144
  size:          0.4 MiB (392857 bytes)
# download-with: you-get --itag=278 [URL]

- itag:          160
  container:     mp4
  quality:       192x144
  size:          0.4 MiB (370882 bytes)
# download-with: you-get --itag=160 [URL]

- itag:          394
  container:     mp4
  quality:       192x144
  size:          0.4 MiB (367261 bytes)
# download-with: you-get --itag=394 [URL]

[ DEFAULT ] _________________________________
- itag:          43
  container:     webm
  quality:       medium
  size:          0.5 MiB (568748 bytes)
# download-with: you-get --itag=43 [URL]

- itag:          18
  container:     mp4
  quality:       small
# download-with: you-get --itag=18 [URL]

- itag:          36
  container:     3gp
  quality:       small
# download-with: you-get --itag=36 [URL]

- itag:          17
  container:     3gp
  quality:       small
# download-with: you-get --itag=17 [URL]

```

By default, the one on the top is the one you will get. If that looks cool to you, download it:

``` $ you-get 'https://www.youtube.com/watch?v=jNQXAC9IVRw' site: YouTube title: Me at the zoo stream: - itag: 242 container: webm quality: 320x240 size: 0.6 MiB (618358 bytes) # download-with: you-get --itag=242 [URL]

Downloading Me at the zoo.webm ... 100% ( 0.6/ 0.6MB) ├██████████████████████████████████████████████████████████████████████████████┤[2/2] 2 MB/s Merging video parts... Merged into Me at the zoo.webm

Saving Me at the zoo.en.srt ... Done. ```

(If a YouTube video has any closed captions, they will be downloaded together with the video file, in SubRip subtitle format.)

Or, if you prefer another format (mp4), just use whatever the option you-get shows to you:

$ you-get --itag=18 'https://www.youtube.com/watch?v=jNQXAC9IVRw'

Note:

  • At this point, format selection has not been generally implemented for most of our supported sites; in that case, the default format to download is the one with the highest quality.
  • ffmpeg is a required dependency, for downloading and joining videos streamed in multiple parts (e.g. on some sites like Youku), and for YouTube videos of 1080p or high resolution.
  • If you don't want you-get to join video parts after downloading them, use the --no-merge/-n option.

Download anything else

If you already have the URL of the exact resource you want, you can download it directly with:

``` $ you-get https://stallman.org/rms.jpg Site: stallman.org Title: rms Type: JPEG Image (image/jpeg) Size: 0.06 MiB (66482 Bytes)

Downloading rms.jpg ... 100% ( 0.1/ 0.1MB) ├████████████████████████████████████████┤[1/1] 127 kB/s ```

Otherwise, you-get will scrape the web page and try to figure out if there's anything interesting to you:

``` $ you-get https://kopasas.tumblr.com/post/69361932517 Site: Tumblr.com Title: [tumblr] tumblrmxhg13jx4n1sftq6do1640 Type: Portable Network Graphics (image/png) Size: 0.11 MiB (118484 Bytes)

Downloading [tumblr] tumblrmxhg13jx4n1sftq6do1640.png ... 100% ( 0.1/ 0.1MB) ├████████████████████████████████████████┤[1/1] 22 MB/s ```

Note:

  • This feature is an experimental one and far from perfect. It works best on scraping large-sized images from popular websites like Tumblr and Blogger, but there is really no universal pattern that can apply to any site on the Internet.

Search on Google Videos and download

You can pass literally anything to you-get. If it isn't a valid URL, you-get will do a Google search and download the most relevant video for you. (It might not be exactly the thing you wish to see, but still very likely.)

$ you-get "Richard Stallman eats"

Pause and resume a download

You may use Ctrl+C to interrupt a download.

A temporary .download file is kept in the output directory. Next time you run you-get with the same arguments, the download progress will resume from the last session. In case the file is completely downloaded (the temporary .download extension is gone), you-get will just skip the download.

To enforce re-downloading, use the --force/-f option. (Warning: doing so will overwrite any existing file or temporary file with the same name!)

Set the path and name of downloaded file

Use the --output-dir/-o option to set the path, and --output-filename/-O to set the name of the downloaded file:

$ you-get -o ~/Videos -O zoo.webm 'https://www.youtube.com/watch?v=jNQXAC9IVRw'

Tips:

  • These options are helpful if you encounter problems with the default video titles, which may contain special characters that do not play well with your current shell / operating system / filesystem.
  • These options are also helpful if you write a script to batch download files and put them into designated folders with designated names.

Proxy settings

You may specify an HTTP proxy for you-get to use, via the --http-proxy/-x option:

$ you-get -x 127.0.0.1:8087 'https://www.youtube.com/watch?v=jNQXAC9IVRw'

However, the system proxy setting (i.e. the environment variable http_proxy) is applied by default. To disable any proxy, use the --no-proxy option.

Tips:

  • If you need to use proxies a lot (in case your network is blocking certain sites), you might want to use you-get with proxychains and set alias you-get="proxychains -q you-get" (in Bash).
  • For some websites (e.g. Youku), if you need access to some videos that are only available in mainland China, there is an option of using a specific proxy to extract video information from the site: --extractor-proxy/-y.

Watch a video

Use the --player/-p option to feed the video into your media player of choice, e.g. mpv or vlc, instead of downloading it:

$ you-get -p vlc 'https://www.youtube.com/watch?v=jNQXAC9IVRw'

Or, if you prefer to watch the video in a browser, just without ads or comment section:

$ you-get -p chromium 'https://www.youtube.com/watch?v=jNQXAC9IVRw'

Tips:

  • It is possible to use the -p option to start another download manager, e.g., you-get -p uget-gtk 'https://www.youtube.com/watch?v=jNQXAC9IVRw', though they may not play together very well.

Load cookies

Not all videos are publicly available to anyone. If you need to log in your account to access something (e.g., a private video), it would be unavoidable to feed the browser cookies to you-get via the --cookies/-c option.

Note:

  • As of now, we are supporting two formats of browser cookies: Mozilla cookies.sqlite and Netscape cookies.txt.

Reuse extracted data

Use --url/-u to get a list of downloadable resource URLs extracted from the page. Use --json to get an abstract of extracted data in the JSON format.

Warning:

  • For the time being, this feature has NOT been stabilized and the JSON schema may have breaking changes in the future.

Supported Sites

| Site | URL | Videos? | Images? | Audios? | | :--: | :-- | :-----: | :-----: | :-----: | | YouTube | https://www.youtube.com/ |✓| | | | X (Twitter) | https://x.com/ |✓|✓| | | VK | https://vk.com/ |✓|✓| | | Vimeo | https://vimeo.com/ |✓| | | | Veoh | https://www.veoh.com/ |✓| | | | Tumblr | https://www.tumblr.com/ |✓|✓|✓| | TED | https://www.ted.com/ |✓| | | | SoundCloud | https://soundcloud.com/ | | |✓| | SHOWROOM | https://www.showroom-live.com/ |✓| | | | Pinterest | https://www.pinterest.com/ | |✓| | | MTV81 | https://www.mtv81.com/ |✓| | | | Mixcloud | https://www.mixcloud.com/ | | |✓| | Metacafe | https://www.metacafe.com/ |✓| | | | Magisto | https://www.magisto.com/ |✓| | | | Khan Academy | https://www.khanacademy.org/ |✓| | | | Internet Archive | https://archive.org/ |✓| | | | Instagram | https://instagram.com/ |✓|✓| | | InfoQ | https://www.infoq.com/presentations/ |✓| | | | Imgur | https://imgur.com/ | |✓| | | Heavy Music Archive | https://www.heavy-music.ru/ | | |✓| | Freesound | https://www.freesound.org/ | | |✓| | Flickr | https://www.flickr.com/ |✓|✓| | | FC2 Video | https://video.fc2.com/ |✓| | | | Facebook | https://www.facebook.com/ |✓| | | | eHow | https://www.ehow.com/ |✓| | | | Dailymotion | https://www.dailymotion.com/ |✓| | | | Coub | https://coub.com/ |✓| | | | CBS | https://www.cbs.com/ |✓| | | | Bandcamp | https://bandcamp.com/ | | |✓| | AliveThai | https://alive.in.th/ |✓| | | | interest.me | https://ch.interest.me/tvn |✓| | | | 755
ナナゴーゴー
| https://7gogo.jp/ |✓|✓| | | niconico
ニコニコ動画
| https://www.nicovideo.jp/ |✓| | | | 163
网易视频
网易云音乐
| https://v.163.com/
https://music.163.com/ |✓| |✓| | 56网 | https://www.56.com/ |✓| | | | AcFun | https://www.acfun.cn/ |✓| | | | Baidu
百度贴吧
| https://tieba.baidu.com/ |✓|✓| | | 爆米花网 | https://www.baomihua.com/ |✓| | | | bilibili
哔哩哔哩
| https://www.bilibili.com/ |✓|✓|✓| | 豆瓣 | https://www.douban.com/ |✓| |✓| | 斗鱼 | https://www.douyutv.com/ |✓| | | | 凤凰视频 | https://v.ifeng.com/ |✓| | | | 风行网 | https://www.fun.tv/ |✓| | | | iQIYI
爱奇艺 | https://www.iqiyi.com/ |✓| | | | 激动网 | https://www.joy.cn/ |✓| | | | 酷6网 | https://www.ku6.com/ |✓| | | | 酷狗音乐 | https://www.kugou.com/ | | |✓| | 酷我音乐 | https://www.kuwo.cn/ | | |✓| | 乐视网 | https://www.le.com/ |✓| | | | 荔枝FM | https://www.lizhi.fm/ | | |✓| | 懒人听书 | https://www.lrts.me/ | | |✓| | 秒拍 | https://www.miaopai.com/ |✓| | | | MioMio弹幕网 | https://www.miomio.tv/ |✓| | | | MissEvan
猫耳FM | https://www.missevan.com/ | | |✓| | 痞客邦 | https://www.pixnet.net/ |✓| | | | PPTV聚力 | https://www.pptv.com/ |✓| | | | 齐鲁网 | https://v.iqilu.com/ |✓| | | | QQ
腾讯视频 | https://v.qq.com/ |✓| | | | 企鹅直播 | https://live.qq.com/ |✓| | | | Sina
新浪视频
微博秒拍视频 | https://video.sina.com.cn/
https://video.weibo.com/ |✓| | | | Sohu
搜狐视频 | https://tv.sohu.com/ |✓| | | | Tudou
土豆
| https://www.tudou.com/ |✓| | | | 阳光卫视 | https://www.isuntv.com/ |✓| | | | Youku
优酷
| https://www.youku.com/ |✓| | | | 战旗TV | https://www.zhanqi.tv/lives |✓| | | | 央视网 | https://www.cntv.cn/ |✓| | | | Naver
네이버 | https://tvcast.naver.com/ |✓| | | | 芒果TV | https://www.mgtv.com/ |✓| | | | 火猫TV | https://www.huomao.com/ |✓| | | | 阳光宽频网 | https://www.365yg.com/ |✓| | | | 西瓜视频 | https://www.ixigua.com/ |✓| | | | 新片场 | https://www.xinpianchang.com/ |✓| | | | 快手 | https://www.kuaishou.com/ |✓|✓| | | 抖音 | https://www.douyin.com/ |✓| | | | TikTok | https://www.tiktok.com/ |✓| | | | 中国体育(TV) | https://v.zhibo.tv/
https://video.zhibo.tv/ |✓| | | | 知乎 | https://www.zhihu.com/ |✓| | |

For all other sites not on the list, the universal extractor will take care of finding and downloading interesting resources from the page.

Known bugs

If something is broken and you-get can't get you things you want, don't panic. (Yes, this happens all the time!)

Check if it's already a known problem on https://github.com/soimort/you-get/wiki/Known-Bugs. If not, follow the guidelines on how to report an issue.

Getting Involved

You can reach us on the Gitter channel #soimort/you-get (here's how you set up your IRC client for Gitter). If you have a quick question regarding you-get, ask it there.

If you are seeking to report an issue or contribute, please make sure to read the guidelines first.

Legal Issues

This software is distributed under the MIT license.

In particular, please be aware that

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Translated to human words:

In case your use of the software forms the basis of copyright infringement, or you use the software for any other illegal purposes, the authors cannot take any responsibility for you.

We only ship the code here, and how you are going to use it is left to your own discretion.

Authors

Made by @soimort, who is in turn powered by :coffee:, :beer: and :ramen:.

You can find the list of all contributors here.

Owner

  • Name: Mort Yao
  • Login: soimort
  • Kind: user
  • Location: Home, Earth

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 2,170
  • Total Committers: 265
  • Avg Commits per committer: 8.189
  • Development Distribution Score (DDS): 0.613
Past Year
  • Commits: 15
  • Committers: 3
  • Avg Commits per committer: 5.0
  • Development Distribution Score (DDS): 0.133
Top Committers
Name Email Commits
Mort Yao s****i@m****a 840
Mort Yao m****o@g****m 489
MaxwellGoblin l****e@o****m 118
jackyzy823 j****3@g****m 71
cnbeining c****g@g****m 39
lilydjwg l****g@g****m 30
David Zhuang d****g@m****a 23
Zhiming Wang z****x@g****m 15
cage 1****4@q****m 12
gongqijian g****n@g****m 12
WaferJay 4****3@q****m 12
Justsoos j****o@g****m 12
Ming Dai r****g@g****m 11
Zhang Ning z****5@g****m 11
Valdemar Erk v****k@g****m 10
hellsof h****f@h****m 8
liushuyu l****1@g****m 7
liushuyu l****1@1****m 6
lh 5****5@g****m 6
iawia002 z****d@j****m 6
perror 1****2@1****m 6
Christian Clauss c****s@m****m 6
steven7851 s****1@m****m 6
pl 0****l@g****m 6
cerenkov c****v@q****m 6
Lee, Donggu g****g@g****m 6
GuanFoxyier 1****7@q****m 5
Chuntao Hong c****g@g****m 5
JayXon j****n@g****m 5
Star Brilliant m****3@h****m 5
and 235 more...

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 0
  • Total pull requests: 167
  • Average time to close issues: N/A
  • Average time to close pull requests: 8 months
  • Total issue authors: 0
  • Total pull request authors: 136
  • Average comments per issue: 0
  • Average comments per pull request: 3.72
  • Merged pull requests: 45
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 29
  • Average time to close issues: N/A
  • Average time to close pull requests: 11 days
  • Issue authors: 0
  • Pull request authors: 19
  • Average comments per issue: 0
  • Average comments per pull request: 1.83
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • soimort (7)
  • archz2 (5)
  • ljhcage (5)
  • URenko (3)
  • crnkv (3)
  • bryanhonof (2)
  • doublevanos (2)
  • cclauss (2)
  • zidingz (2)
  • snowyu (2)
  • PercyDan54 (2)
  • VarVizner (2)
  • ouy160 (2)
  • johzzy (2)
  • khan13-UP (2)
Top Labels
Issue Labels
Pull Request Labels
confirmed (6) Bilibili (6) reviewing (3) broken (2) invalid (2) enhancement (2) python bug (1) feature request (1) waiting for response (1) patches welcome (1) good idea! (1)

Packages

  • Total packages: 4
  • Total downloads:
    • pypi 13,933 last-month
  • Total docker downloads: 2,509
  • Total dependent packages: 6
    (may contain duplicates)
  • Total dependent repositories: 144
    (may contain duplicates)
  • Total versions: 341
  • Total maintainers: 1
pypi.org: you-get

Dumb downloader that scrapes the web

  • Versions: 174
  • Dependent Packages: 6
  • Dependent Repositories: 125
  • Downloads: 13,706 Last month
  • Docker Downloads: 2,509
Rankings
Dependent repos count: 1.3%
Average: 1.9%
Docker downloads count: 2.0%
Dependent packages count: 2.2%
Downloads: 2.2%
Maintainers (1)
Last synced: 10 months ago
proxy.golang.org: github.com/soimort/you-get
  • Versions: 151
  • Dependent Packages: 0
  • Dependent Repositories: 1
Rankings
Stargazers count: 0.0%
Forks count: 0.0%
Average: 3.6%
Dependent repos count: 4.7%
Dependent packages count: 9.6%
Last synced: 10 months ago
pypi.org: you

My next dumb thing.

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 18
  • Downloads: 227 Last month
Rankings
Stargazers count: 0.0%
Forks count: 0.1%
Dependent repos count: 3.4%
Average: 4.9%
Dependent packages count: 10.1%
Downloads: 10.7%
Maintainers (1)
Last synced: 10 months ago
conda-forge.org: you-get

Command line downloader for YouTube and many other online video/ audio sources.

  • Versions: 14
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 0.2%
Forks count: 0.9%
Average: 21.6%
Dependent repos count: 34.0%
Dependent packages count: 51.2%
Last synced: 11 months ago

Dependencies

.github/workflows/python-package.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite