https://github.com/ggnowayback/cathodedataextractor
A document-level information extraction pipeline for layered cathode materials for sodium-ion batteries.
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: rsc.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.1%) to scientific vocabulary
Keywords
Repository
A document-level information extraction pipeline for layered cathode materials for sodium-ion batteries.
Basic Info
Statistics
- Stars: 8
- Watchers: 1
- Forks: 2
- Open Issues: 1
- Releases: 2
Topics
Metadata Files
README.md
CathodeDataExtractor
Cathodedataextractor is a lightweight document-level information extraction pipeline that can automatically extract
comprehensive properties related to synthesis parameters, cycling and rate performance of cathode materials from the
literature of layered cathode materials for sodium-ion batteries.
Installation
pip install cathodedataextractor
Features
- It is built on open-source libraries: pymatgen, text2chem, and ChemDataExtractor v2 with some modifications.
- BatterySciBERT-uncased Multi-Label text classification model for filtering documents.
- Automated comprehensive data extraction pipeline for cathode materials.
- Paragraph Multi-Class classification algorithms for documents (HTML/XML) from the RSC and Elsevier.
- A normalised entity handling process is provided.
- An effective chemical abbreviation detection module.
- Heuristic multi-level relation extraction algorithm for electrochemical properties.
In addition, the pipeline is also suitable for string sequence text extraction.
Quick start
Extract from documents
```python from glob import iglob from cathodedataextractor.informationextractionpipe import Pipeline
pipline = Pipeline() for document in iglob('*ml'): extraction_results = pipline.extract(document) ```
Extract from string
```python from cathodedataextractor.informationextractionpipe import Pipeline
extractionresults = Pipeline.fromstring( 'Apart from the conventional cationic redox of transition metals, ' 'both Na-deficit and Na-excess materials have showcased the ability ' 'to exploit oxygen redox activity as O2–/O2n– for a charge ' 'compensation mechanism. To realize cathodes with enhanced energy ' 'density, a technique like the incorporation of alkali metal ions ' 'into transition metal layers has been adopted. Recent work by Boisse ' '(13) et al. displayed the impact of honeycomb cation ordering of ' 'a highly stabilized intermediate phase for a Na2RuO3 cathode material ' 'in instigating the anionic redox activity and providing a capacity ' 'of 180 mAh g–1 at 0.2C with a capacity retention of 89% for over ' '50 cycles. More devoted efforts to realize the utmost potential ' 'of anionic redox ought to be carried out in the future.') ```
Issues?
You can either report an issue on GitHub or contact me directly. Try gouyx@mail2.sysu.edu.cn.
Citing
If the source code turns out to be helpful to your research, please cite the following work:
Gou, Y., Zhang, Y., Zhu, J. et al. A document-level information extraction pipeline for layered cathode materials for sodium-ion batteries. Sci Data 11, 372 (2024).
Owner
- Login: GGNoWayBack
- Kind: user
- Repositories: 1
- Profile: https://github.com/GGNoWayBack
GitHub Events
Total
- Issues event: 1
- Watch event: 4
- Issue comment event: 1
- Fork event: 1
Last Year
- Issues event: 1
- Watch event: 4
- Issue comment event: 1
- Fork event: 1
Committers
Last synced: about 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| GGNoWayBack | 2****9@q****m | 21 |
| GGNoWayBack | 9****k | 5 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 1
- Total pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: 2 minutes
- Total issue authors: 1
- Total pull request authors: 1
- Average comments per issue: 1.0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- GGNoWayBack (1)
Pull Request Authors
- GGNoWayBack (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 9 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 4
- Total maintainers: 1
pypi.org: cathodedataextractor
A document-level information extraction pipeline for layered cathode materials for sodium-ion batteries.
- Homepage: https://github.com/GGNoWayBack/cathodedataextractor
- Documentation: https://cathodedataextractor.readthedocs.io/
- License: MIT
-
Latest release: 0.0.4
published almost 2 years ago