Recent Releases of somef
somef - SOMEF 0.9.11: Fixing small issues regarding metadata parsers
This small release fixes: - An error when looking for multiple package files in a repository. - An error when exporting the name of a library of a package file
- Jupyter Notebook
Published by dgarijo 10 months ago
somef - SOMEF 0.9.10: Improving CodeMeta export
This version of Somef improves the extraction of requirements in CodeMeta, and improves consistency between the Somef JSON format and CodeMeta.
What's Changed
- Updated CFF/codemeta.json by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/746
- Fix #748 by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/749
- Resolve issues about authors and propertie spdx in license. Fixes #741, #744, #745. by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/751
- Fix #753 for requirements.txt and pyroject.toml by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/758
- We just get the more relevant programming languages in codemeta file. Include test. Fixes #729 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/759
- Software requirements with name and version or array of string. Fixes #754,#755 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/762
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.9...0.9.10
- Jupyter Notebook
Published by dgarijo 10 months ago
somef - SOMEF 0.9.9: Improving CodeMeta export and package file metadata
This release includes the following updates:
- We now recognize pom.xml, setup.py, pyproject.toml and package.json files, based on the CodeMeta crosswalks.
- Regular expressions for licenses.
- Improving release extraction in Gitlab
- Fixed documentation inconsistency issues
- Continuous integration and developmentStatus now are exported.
More information: https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/12?closed=1
What's Changed
- Typo fix in statements about application domains by @Daniel-Mietchen in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/726
- Resolve issues gitlab release, licenses. Development status. Fixes #721, #713, #722 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/727
- author type Organization by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/732
- Update master with changes for 0.9.9 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/735
- fix #736 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/737
- Improved file extraction workflows and continuous integration. Enhanced documentation with new properties by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/738
- Fix #471, initial version by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/739
- Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/743
New Contributors
- @Daniel-Mietchen made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/726
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.8...0.9.9
- Jupyter Notebook
Published by dgarijo 11 months ago
somef - SOMEF 0.9.8: Improving CodeMeta export
This release of somef improves significantly the errors on the CodeMeta export, updating the support to v3.0
What's Changed
- Resolve issue when wiki without content. Fixes #697 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/701
- Several changes about referencePublication, citation, scholarlyArticle. Fixes #640, #685 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/703
- Title without content and spdx id for licenses. Fixes #700, #679 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/706
- Normalization of citations (CFF, Bib). Fixes #629 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/708
- Improve codemeta export. Fixes #678 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/710
- spdx_id must be identifier in license. by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/715
- fix in codeMeta ns by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/717
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.7...0.9.8
- Jupyter Notebook
Published by dgarijo about 1 year ago
somef - SOMEF 0.9.7_2: Update Docker image and poetry actions
Update of the release to configure the poetry action correctly (again)
- Jupyter Notebook
Published by dgarijo about 1 year ago
somef - SOMEF 0.9.7_1: Update Docker image and poetry actions
Update of the release to configure the poetry action correctly
- Jupyter Notebook
Published by dgarijo about 1 year ago
somef - SOMEF 0.9.7: Update Docker image and poetry actions
This release updates the readme and incorporates edits to the Dockerfile and poetry github action for pypi.
What's Changed
- new publish with poetry. Script to update version in publish by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/693
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.6...0.9.7
- Jupyter Notebook
Published by dgarijo about 1 year ago
somef - SOMEF 0.9.6: Fix gitlab issues and TTL export
This release fixes issues with Gitlab repos and TTL export library problems. It also converts the project into a poetry project to help reproducibility and library maintenance.
What's Changed
- SOMEF-639: Replace hardcoded config paths by @StankovskiA in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/641
- Reverting nltk version to avoid problems by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/648
- updated yarrrml.yml by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/657
- yml mapping default by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/659
- Update constants.py by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/662
- Mapping json in memory instead of disk by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/665
- Update turtle_export sending dictionary directly. New rml.ttl by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/666
- Setup poetry envirnment by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/652
- Updated readme.md with instructions to install poetry by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/669
- updated apply_mapping and correct wrong version by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/675
- Pushing the fixes of instructions about "poetry shell" to dev branch by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/682
- Resolve problem in codemeta export and buildinstrucctions. Fixes #677 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/684
- New process and test to chek gitlab selfhosted. Fixes #687 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/689
New Contributors
- @StankovskiA made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/641
- @juanjemdIos made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/657
- @Anas-Elhounsri made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/682
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.5...0.9.6
- Jupyter Notebook
Published by dgarijo about 1 year ago
somef - SOMEF 0.9.5: Moving towards Python 3.10
This version of SOMEF moves towards adopting Python 3.10 and fixes quite a few issues (logo detection, identifiers, etc.), retrains the models to work with the latest versions of scikit learn and adds an initial categorization implementation.
Thanks to @tpronk and @kuefmz for their contributions!
What's Changed
- Categorization implementation by @rafaeel9 in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/583
- Software types by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/589
- Fixes issue #591 by @tpronk in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/592
- Fixing all tests by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/603
- Towards 0.9.5 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/604
- Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/605
- Fix #553 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/606
- Fix #526 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/607
- Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/608
- Towards next version by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/609
- Preparing for Python 3.10 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/612
- Preparing dependencies for next release by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/614
- Add retrained models for ROLF with scikit-learn 1.3.2 by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/613
- aligning branches by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/615
- Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/616
- Preparing for 0.9.5 (classifiers re-trained) by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/617
- Fix error: LinearSVC object has no attribute predict_proba by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/619
- update by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/620
- Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/625
New Contributors
- @tpronk made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/592
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.4...0.9.5
- Jupyter Notebook
Published by dgarijo almost 2 years ago
somef - SOMEF 0.9.4: Adding a new small category
This release adds a new small category for recognizing arxiv paper links found in a readme file. These papers are useful to find potential links to the papers where the software is proposed as a publication
- Jupyter Notebook
Published by dgarijo over 2 years ago
somef - SOMEF 0.9.3: Fix configuration options
This release adds a hotfix for configuring somef when a configuration file did exist before (it wasn't overwritten)
- Jupyter Notebook
Published by dgarijo about 3 years ago
somef - SOMEF 0.9.2: Fix for Pypi
Release to update package in pypi
- Jupyter Notebook
Published by dgarijo about 3 years ago
somef - SOMEF 0.9.1: Fixing init dependency error
This release adds a hotfix due to the ROLF integration in 0.9.0 that prevented running somef
- Jupyter Notebook
Published by dgarijo about 3 years ago
somef - SOMEF 0.9.0: Multiple bug fixes and export improvement
This release addresses major bugs in the extraction and introduces the following improvements:
- New features for recognizing packages and badges.
- Now somef will try to fix links of logos and images if by default they do not resolve.
- For Codemeta/TTL somef will try to reconcile the information gathered to maximize its utility. Also only providing links where links are supposed to be provided.
- somef will now look for install files among others.
- We have a new output specification, available here
- ROLF integration by @kuefmz for classifying software categories.
- Now we include an RML mapping to comply against the new spec.
New Contributors
- @kuefmz made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/480
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.8.2...0.9.0
- Jupyter Notebook
Published by dgarijo about 3 years ago
somef - SOMEF 0.8.2: Minor fixes for logo detection and feature for keeping temporary files
Somef 0.8.2 fixing a version issue for pypi release. See https://github.com/KnowledgeCaptureAndDiscovery/somef/releases/tag/0.8.2 for notes.
- Jupyter Notebook
Published by dgarijo over 3 years ago
somef - SOMEF 0.8.2: Minor fixes for logo detection and feature for keeping temporary files
This release of SOMEF fixes some issues detecting logos (confusing them with badges) and adds a small feature so somef does not delete the repository after its analysis. That way, these files can be kept for further analysis if needed.
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.8.1...0.8.2
- Jupyter Notebook
Published by dgarijo over 3 years ago
somef - SOMEF 0.8.1: Minor fixes in regular expressions
This version of SOMEF addresses minor issues in regular expressions that made the application crash when parsing certain image links. New tests have been added to address the corresponding issues.
- Jupyter Notebook
Published by dgarijo almost 4 years ago
somef - SOMEF 0.8.0: Major bug fixes and new functionalities
This version of SOMEF fixes some issues with the new markdown parser, adds documentation to the project and includes the following new features: - Package distributions from python badges - Ontologies present in the repository - Citation files - Extended metadata for the types of files recognized - Docker compose files - colab notebooks
Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.7.2...0.8.0
- Jupyter Notebook
Published by dgarijo almost 4 years ago
somef - SOMEF 0.7.2: Bug fixes for Windows
This release addresses a small bug where the package would not run in Windows due to an encoding error.
- Jupyter Notebook
Published by dgarijo almost 4 years ago
somef - SOMEF 0.7.1: Bug fixes for new parser
This version of SOMEF fixes: - Fixes new parser problems - Improves the logo and image capture - Improves how arxiv links are captured for papers - Improves problems with empty repositories for bulk processing Details: https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/8?closed=1
- Jupyter Notebook
Published by dgarijo almost 4 years ago
somef - SOMEF 0.7.0: Support for Gitlab and new features
This version of SOMEF introduces the following features: - Support for Gitlab repositories - Recognize repo status badges - Recognize logos and images used in the readme. - Recognize support channels where help for the target software component may be provided. - Recognize executable scripts associated with the target software component - Bugs and small issues. See https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/5?closed=1 for more information
- Jupyter Notebook
Published by dgarijo about 4 years ago
somef - SOMEF 0.6.0: New features and bug fixes
This release of SOMEF includes new features: - Better recognition for license files. - Readthedocs URLs - Arxiv links - Code of conduct files - Improved documentation (for new fields and older fields)
In addition, codemeta, ttl and JSON exports have been updated, classification pipelines have been updated to Python 3.9 (version of Scikitlearn 1.0.0) and multiple bugs and redundancies have been reduced. Check https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/4?closed=1 for more information about the issues addressed.
- Jupyter Notebook
Published by dgarijo about 4 years ago
somef - SOMEF 0.5.1: Codemeta export bug fixes
This release addresses small bug fixes for making the codemeta export more robust. For example, when a repository does not have a code release, now the program will not fail.
- Jupyter Notebook
Published by dgarijo almost 5 years ago
somef - SOMEF 0.5.0
This release addresses the following issues:
- Automated evaluation reports for header analysis, so we can compare improvements without having to re-annotate corpus.
- Fixed errors in annotation of fields being GitHub API (they were a custom file exploration)
- Now we export a new category: Acknowledgements
- Fixed errors on empty exported metadata fields.
- Disambiguated category Issues
- See https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/3?closed=1 for more details.
- Jupyter Notebook
Published by dgarijo almost 5 years ago
somef - SOMEF 0.4.0
This release improves on the following aspects of SOMEF:
- Notebooks and Dockerfiles are recognized.
- Docs folders are extracted.
- License is discovered through file inspection.
- New Codemeta specific export
- Fixed the technique in the JSON output to accommodate the new types of techniques supported
- MyBinder links extraction
- Zenodo DOIs extraction
- Tests
- Attempt at extracting a long title of a repo
- Jupyter Notebook
Published by dgarijo almost 5 years ago
somef - SOMEF 0.3.0: Code reorganization and bug fixes
This version of SOMEF includes full documentation and provides the following features: - Cleanup and reorganization of repository contents. - Synchronized package with GitHub releases - Prepared package release. - Bug fixes and code refactoring - Export in JSON-LD and Turtle
- Jupyter Notebook
Published by dgarijo over 5 years ago
somef - SoMEF 0.2.0: Updates in CLI
This version of the system is more robust and includes: * Answer in JSON format * Configuration allowing to make authentication optional * Initial integration of the header analysis.
- Jupyter Notebook
Published by dgarijo almost 6 years ago
somef - SoMEF 0.1.0: First release of the CLI
This version of SM2KG incorporates the following features: * Ranking of classifiers (ordered by accuracy) with the best pipelines to be used by SM2KG. * First version of the CLI for using the classifiers, creating a JSON result file * First analysis on detecting commonly used words for the different sections.
- Jupyter Notebook
Published by dgarijo almost 6 years ago
somef - SoMEF 0.0.1: First release of the framework
SoMEF is a software metadata extraction framework designed to automatically distinguish the description, installation instructions, invocation and citation of scientific software metadata from a README file
- Jupyter Notebook
Published by dgarijo over 6 years ago