Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: Boubik
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 68.4 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

Czech National Library Extractor

Description

The Czech National Library Extractor is a Python-based project tailored for extracting data from the Czech National Library's catalog. It enables users to generate a CSV file containing book information derived from a search query link provided by the National Library. This tool is particularly beneficial for researchers, librarians, and anyone interested in efficiently accessing and managing book data from the Czech National Library.

Table of Contents

Installation

  1. Clone the repository:

bash git clone https://github.com/Boubik/CNLE.git

  1. Navigate to the project directory: cd CNLE

bash cd CNLE

  1. If you are using Python 3.5 or older, install virtualenv:

```bash pip install virtualenv

```

  1. Make virtual Python: python -m venv venv

bash python -m venv venv # For Python 3.6 and newer # or virtualenv venv # For Python 3.5 and older

  1. Use virtual Python: source venv/bin/activate for Unix like systems or venv\Scripts\activate.bat for Windows
- For Unix-like systems:

    ```bash
    source venv/bin/activate
    ```

- For Windows:

    ```powershell
    venv\Scripts\activate.bat
    ```
  1. Install the required dependencies: pip install -r requirements.txt

    bash pip install -r requirements.txt

  2. Setup all needed information in the configuration file: config.py

  3. Run the project:

    bash python app.py

  4. Connects to localhost on the port 5000: http://localhost:5000

Features

  • Extracts data from the Czech National Library.
  • Saves data to a CSV file.
  • Deduplication system (currently in beta).
  • Hosted on cnle.boubik.cz, making it accessible as a web service.
  • Sends extracted data directly to your email.

License

This project is licensed under the MIT License.

Owner

  • Name: Jan Chlouba
  • Login: Boubik
  • Kind: user

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: Czech National Library Extractor
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: Jan
    family-names: Chloubik
    email: chlouba@boubik.cz
repository-code: 'https://github.com/Boubik/CNLE'
url: 'https://cnle.boubik.cz'
abstract: >-
  The Czech National Library Extractor is a Python-based
  project tailored for extracting data from the Czech
  National Library's catalog. It enables users to generate a
  CSV file containing book information derived from a search
  query link provided by the National Library. This tool is
  particularly beneficial for researchers, librarians, and
  anyone interested in efficiently accessing and managing
  book data from the Czech National Library.
license: MIT

GitHub Events

Total
Last Year

Dependencies

requirements.txt pypi
  • bs4 *
  • selenium *