Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Boubik
- License: mit
- Language: Python
- Default Branch: main
- Size: 68.4 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Czech National Library Extractor
Description
The Czech National Library Extractor is a Python-based project tailored for extracting data from the Czech National Library's catalog. It enables users to generate a CSV file containing book information derived from a search query link provided by the National Library. This tool is particularly beneficial for researchers, librarians, and anyone interested in efficiently accessing and managing book data from the Czech National Library.
Table of Contents
Installation
- Clone the repository:
bash
git clone https://github.com/Boubik/CNLE.git
- Navigate to the project directory:
cd CNLE
bash
cd CNLE
- If you are using Python 3.5 or older, install virtualenv:
```bash pip install virtualenv
```
- Make virtual Python:
python -m venv venv
bash
python -m venv venv # For Python 3.6 and newer
# or
virtualenv venv # For Python 3.5 and older
- Use virtual Python:
source venv/bin/activatefor Unix like systems orvenv\Scripts\activate.batfor Windows
- For Unix-like systems:
```bash
source venv/bin/activate
```
- For Windows:
```powershell
venv\Scripts\activate.bat
```
Install the required dependencies:
pip install -r requirements.txtbash pip install -r requirements.txtSetup all needed information in the configuration file:
config.pyRun the project:
bash python app.pyConnects to localhost on the port 5000: http://localhost:5000
Features
- Extracts data from the Czech National Library.
- Saves data to a CSV file.
- Deduplication system (currently in beta).
- Hosted on cnle.boubik.cz, making it accessible as a web service.
- Sends extracted data directly to your email.
License
This project is licensed under the MIT License.
Owner
- Name: Jan Chlouba
- Login: Boubik
- Kind: user
- Repositories: 2
- Profile: https://github.com/Boubik
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: Czech National Library Extractor
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: Jan
family-names: Chloubik
email: chlouba@boubik.cz
repository-code: 'https://github.com/Boubik/CNLE'
url: 'https://cnle.boubik.cz'
abstract: >-
The Czech National Library Extractor is a Python-based
project tailored for extracting data from the Czech
National Library's catalog. It enables users to generate a
CSV file containing book information derived from a search
query link provided by the National Library. This tool is
particularly beneficial for researchers, librarians, and
anyone interested in efficiently accessing and managing
book data from the Czech National Library.
license: MIT
GitHub Events
Total
Last Year
Dependencies
- bs4 *
- selenium *