https://github.com/adbar/adbar
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.4%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: adbar
- Default Branch: master
- Size: 4.88 KB
Statistics
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 0
Created almost 6 years ago
· Last pushed over 1 year ago
Metadata Files
Readme
README.md
Hi there! 👋
I'm a data engineer and scientist specializing in natural language processing. On Github I'm the author and maintainer of projects like Trafilatura, a popular open-source package to gather and extract text data used by researchers and the AI industry.
Most Popular Blog Posts
- Extracting the main text content from web pages using Python
- A simple multilingual lemmatizer for Python
- A module to extract date information from web pages
- Web scraping with R: Text and metadata extraction
Open-Source Tech Stack
| Skills | Programming languages |
| ------------------ | --------------------- |
| |
|
Owner
- Name: Adrien Barbaresi
- Login: adbar
- Kind: user
- Location: Berlin
- Company: Berlin-Brg. Academy of Sciences (BBAW)
- Website: adrien.barbaresi.eu
- Twitter: adbarbaresi
- Repositories: 37
- Profile: https://github.com/adbar
Research scientist – natural language processing, web scraping and text analytics. Mostly with Python.
GitHub Events
Total
- Delete event: 1
- Push event: 6
- Pull request event: 1
Last Year
- Delete event: 1
- Push event: 6
- Pull request event: 1
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: 18 minutes
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: 18 minutes
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- adbar (2)