https://github.com/asadprodhan/how-to-manually-download-the-metadata-of-all-the-available-genomes-of-an-organism-in-ncbi
How to manually download the metadata of all the available genomes of an organism in NCBI?
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: ncbi.nlm.nih.gov -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.3%) to scientific vocabulary
Repository
How to manually download the metadata of all the available genomes of an organism in NCBI?
Basic Info
- Host: GitHub
- Owner: asadprodhan
- License: gpl-3.0
- Default Branch: main
- Size: 1.24 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
How to manually download the metadata of all the available genomes of an organism in NCBI?
AUTHOR: Dr Asad Prodhan https://asadprodhan.github.io/
Step 1: Search your organism in the NCBI database
Go to the NCBI website: https://www.ncbi.nlm.nih.gov/
Search your organism as shown for Fusarium in the red rectangular box in Fig. 1
The search results will be displayed under different categories (Fig. 1)
Click on the Genomes database as indicated by the red hexagon in Fig. 1
Figure 1: Search your organism in the NCBI database.
Step 2: Select the metadata of your interest
You will see something like Fig. 2
Tick the assembly box as indicated by the red circle in the Fig. 2
Click on the Select columns tab as indicated by the red rectangle in the Fig. 2
Figure 2: Select the metadata of your interest.
- Select the metadata of your interest and apply (Fig. 3)
Figure 3: Available metadata.
Step 3: Download the metadata
Click on the Download tab as indicated by the red circle in Fig. 4
Select Download Table from the drop-down menu
Select Tab-separated values (TSV) format
Click Download
Figure 4: Download the metadata.
Step 3: Convert the TSV file into excel sheet
A TSV file containing the metadata will be downloaded.
You can open the TSV file in excel like Fig. 5.
Figure 5: Metadata sheet.
Now, you have an excel sheet containing the metadata of all the available genome sequences of Fusarium.
You can sort out these information and short list the assembly numbers that you want download.
See this tutorial on how to download genome sequences automatically:
https://github.com/asadprodhan/How-to-download-all-the-available-genome-sequences-of-an-organism-automatically
Owner
- Name: Asad Prodhan
- Login: asadprodhan
- Kind: user
- Location: Perth, Australia
- Company: Department of Primary Industries and Regional Development
- Website: www.linkedin.com/in/asadprodhan
- Twitter: Asad_Prodhan
- Repositories: 2
- Profile: https://github.com/asadprodhan
Laboratory Scientist at DPIRD. My work involves Oxford Nanopore Sequencing and Bioinformatics for pest and pathogen diagnosis.