Recent Releases of somef

somef - SOMEF 0.9.11: Fixing small issues regarding metadata parsers

This small release fixes: - An error when looking for multiple package files in a repository. - An error when exporting the name of a library of a package file

- Jupyter Notebook
Published by dgarijo 10 months ago

somef - SOMEF 0.9.10: Improving CodeMeta export

This version of Somef improves the extraction of requirements in CodeMeta, and improves consistency between the Somef JSON format and CodeMeta.

What's Changed

  • Updated CFF/codemeta.json by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/746
  • Fix #748 by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/749
  • Resolve issues about authors and propertie spdx in license. Fixes #741, #744, #745. by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/751
  • Fix #753 for requirements.txt and pyroject.toml by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/758
  • We just get the more relevant programming languages in codemeta file. Include test. Fixes #729 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/759
  • Software requirements with name and version or array of string. Fixes #754,#755 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/762

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.9...0.9.10

- Jupyter Notebook
Published by dgarijo 10 months ago

somef - SOMEF 0.9.9: Improving CodeMeta export and package file metadata

This release includes the following updates: - We now recognize pom.xml, setup.py, pyproject.toml and package.json files, based on the CodeMeta crosswalks. - Regular expressions for licenses. - Improving release extraction in Gitlab - Fixed documentation inconsistency issues - Continuous integration and developmentStatus now are exported. More information: https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/12?closed=1

What's Changed

  • Typo fix in statements about application domains by @Daniel-Mietchen in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/726
  • Resolve issues gitlab release, licenses. Development status. Fixes #721, #713, #722 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/727
  • author type Organization by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/732
  • Update master with changes for 0.9.9 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/735
  • fix #736 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/737
  • Improved file extraction workflows and continuous integration. Enhanced documentation with new properties by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/738
  • Fix #471, initial version by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/739
  • Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/743

New Contributors

  • @Daniel-Mietchen made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/726

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.8...0.9.9

- Jupyter Notebook
Published by dgarijo 11 months ago

somef - SOMEF 0.9.8: Improving CodeMeta export

This release of somef improves significantly the errors on the CodeMeta export, updating the support to v3.0

What's Changed

  • Resolve issue when wiki without content. Fixes #697 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/701
  • Several changes about referencePublication, citation, scholarlyArticle. Fixes #640, #685 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/703
  • Title without content and spdx id for licenses. Fixes #700, #679 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/706
  • Normalization of citations (CFF, Bib). Fixes #629 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/708
  • Improve codemeta export. Fixes #678 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/710
  • spdx_id must be identifier in license. by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/715
  • fix in codeMeta ns by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/717

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.7...0.9.8

- Jupyter Notebook
Published by dgarijo about 1 year ago

somef - SOMEF 0.9.7_2: Update Docker image and poetry actions

Update of the release to configure the poetry action correctly (again)

- Jupyter Notebook
Published by dgarijo about 1 year ago

somef - SOMEF 0.9.7_1: Update Docker image and poetry actions

Update of the release to configure the poetry action correctly

- Jupyter Notebook
Published by dgarijo about 1 year ago

somef - SOMEF 0.9.7: Update Docker image and poetry actions

This release updates the readme and incorporates edits to the Dockerfile and poetry github action for pypi.

What's Changed

  • new publish with poetry. Script to update version in publish by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/693

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.6...0.9.7

- Jupyter Notebook
Published by dgarijo about 1 year ago

somef - SOMEF 0.9.6: Fix gitlab issues and TTL export

This release fixes issues with Gitlab repos and TTL export library problems. It also converts the project into a poetry project to help reproducibility and library maintenance.

What's Changed

  • SOMEF-639: Replace hardcoded config paths by @StankovskiA in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/641
  • Reverting nltk version to avoid problems by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/648
  • updated yarrrml.yml by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/657
  • yml mapping default by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/659
  • Update constants.py by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/662
  • Mapping json in memory instead of disk by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/665
  • Update turtle_export sending dictionary directly. New rml.ttl by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/666
  • Setup poetry envirnment by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/652
  • Updated readme.md with instructions to install poetry by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/669
  • updated apply_mapping and correct wrong version by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/675
  • Pushing the fixes of instructions about "poetry shell" to dev branch by @Anas-Elhounsri in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/682
  • Resolve problem in codemeta export and buildinstrucctions. Fixes #677 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/684
  • New process and test to chek gitlab selfhosted. Fixes #687 by @juanjemdIos in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/689

New Contributors

  • @StankovskiA made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/641
  • @juanjemdIos made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/657
  • @Anas-Elhounsri made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/682

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.5...0.9.6

- Jupyter Notebook
Published by dgarijo about 1 year ago

somef - SOMEF 0.9.5: Moving towards Python 3.10

This version of SOMEF moves towards adopting Python 3.10 and fixes quite a few issues (logo detection, identifiers, etc.), retrains the models to work with the latest versions of scikit learn and adds an initial categorization implementation.

Thanks to @tpronk and @kuefmz for their contributions!

What's Changed

  • Categorization implementation by @rafaeel9 in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/583
  • Software types by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/589
  • Fixes issue #591 by @tpronk in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/592
  • Fixing all tests by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/603
  • Towards 0.9.5 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/604
  • Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/605
  • Fix #553 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/606
  • Fix #526 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/607
  • Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/608
  • Towards next version by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/609
  • Preparing for Python 3.10 by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/612
  • Preparing dependencies for next release by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/614
  • Add retrained models for ROLF with scikit-learn 1.3.2 by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/613
  • aligning branches by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/615
  • Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/616
  • Preparing for 0.9.5 (classifiers re-trained) by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/617
  • Fix error: LinearSVC object has no attribute predict_proba by @kuefmz in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/619
  • update by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/620
  • Dev by @dgarijo in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/625

New Contributors

  • @tpronk made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/592

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.9.4...0.9.5

- Jupyter Notebook
Published by dgarijo almost 2 years ago

somef - SOMEF 0.9.4: Adding a new small category

This release adds a new small category for recognizing arxiv paper links found in a readme file. These papers are useful to find potential links to the papers where the software is proposed as a publication

- Jupyter Notebook
Published by dgarijo over 2 years ago

somef - SOMEF 0.9.3: Fix configuration options

This release adds a hotfix for configuring somef when a configuration file did exist before (it wasn't overwritten)

- Jupyter Notebook
Published by dgarijo about 3 years ago

somef - SOMEF 0.9.2: Fix for Pypi

Release to update package in pypi

- Jupyter Notebook
Published by dgarijo about 3 years ago

somef - SOMEF 0.9.1: Fixing init dependency error

This release adds a hotfix due to the ROLF integration in 0.9.0 that prevented running somef

- Jupyter Notebook
Published by dgarijo about 3 years ago

somef - SOMEF 0.9.0: Multiple bug fixes and export improvement

This release addresses major bugs in the extraction and introduces the following improvements:

  • New features for recognizing packages and badges.
  • Now somef will try to fix links of logos and images if by default they do not resolve.
  • For Codemeta/TTL somef will try to reconcile the information gathered to maximize its utility. Also only providing links where links are supposed to be provided.
  • somef will now look for install files among others.
  • We have a new output specification, available here
  • ROLF integration by @kuefmz for classifying software categories.
  • Now we include an RML mapping to comply against the new spec.

New Contributors

  • @kuefmz made their first contribution in https://github.com/KnowledgeCaptureAndDiscovery/somef/pull/480

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.8.2...0.9.0

- Jupyter Notebook
Published by dgarijo about 3 years ago

somef - SOMEF 0.8.2: Minor fixes for logo detection and feature for keeping temporary files

Somef 0.8.2 fixing a version issue for pypi release. See https://github.com/KnowledgeCaptureAndDiscovery/somef/releases/tag/0.8.2 for notes.

- Jupyter Notebook
Published by dgarijo over 3 years ago

somef - SOMEF 0.8.2: Minor fixes for logo detection and feature for keeping temporary files

This release of SOMEF fixes some issues detecting logos (confusing them with badges) and adds a small feature so somef does not delete the repository after its analysis. That way, these files can be kept for further analysis if needed.

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.8.1...0.8.2

- Jupyter Notebook
Published by dgarijo over 3 years ago

somef - SOMEF 0.8.1: Minor fixes in regular expressions

This version of SOMEF addresses minor issues in regular expressions that made the application crash when parsing certain image links. New tests have been added to address the corresponding issues.

- Jupyter Notebook
Published by dgarijo almost 4 years ago

somef - SOMEF 0.8.0: Major bug fixes and new functionalities

This version of SOMEF fixes some issues with the new markdown parser, adds documentation to the project and includes the following new features: - Package distributions from python badges - Ontologies present in the repository - Citation files - Extended metadata for the types of files recognized - Docker compose files - colab notebooks

Full Changelog: https://github.com/KnowledgeCaptureAndDiscovery/somef/compare/0.7.2...0.8.0

- Jupyter Notebook
Published by dgarijo almost 4 years ago

somef - SOMEF 0.7.2: Bug fixes for Windows

This release addresses a small bug where the package would not run in Windows due to an encoding error.

- Jupyter Notebook
Published by dgarijo almost 4 years ago

somef - SOMEF 0.7.1: Bug fixes for new parser

This version of SOMEF fixes: - Fixes new parser problems - Improves the logo and image capture - Improves how arxiv links are captured for papers - Improves problems with empty repositories for bulk processing Details: https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/8?closed=1

- Jupyter Notebook
Published by dgarijo almost 4 years ago

somef - SOMEF 0.7.0: Support for Gitlab and new features

This version of SOMEF introduces the following features: - Support for Gitlab repositories - Recognize repo status badges - Recognize logos and images used in the readme. - Recognize support channels where help for the target software component may be provided. - Recognize executable scripts associated with the target software component - Bugs and small issues. See https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/5?closed=1 for more information

- Jupyter Notebook
Published by dgarijo about 4 years ago

somef - SOMEF 0.6.0: New features and bug fixes

This release of SOMEF includes new features: - Better recognition for license files. - Readthedocs URLs - Arxiv links - Code of conduct files - Improved documentation (for new fields and older fields)

In addition, codemeta, ttl and JSON exports have been updated, classification pipelines have been updated to Python 3.9 (version of Scikitlearn 1.0.0) and multiple bugs and redundancies have been reduced. Check https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/4?closed=1 for more information about the issues addressed.

- Jupyter Notebook
Published by dgarijo about 4 years ago

somef - SOMEF 0.5.1: Codemeta export bug fixes

This release addresses small bug fixes for making the codemeta export more robust. For example, when a repository does not have a code release, now the program will not fail.

- Jupyter Notebook
Published by dgarijo almost 5 years ago

somef - SOMEF 0.5.0

This release addresses the following issues: - Automated evaluation reports for header analysis, so we can compare improvements without having to re-annotate corpus. - Fixed errors in annotation of fields being GitHub API (they were a custom file exploration) - Now we export a new category: Acknowledgements - Fixed errors on empty exported metadata fields. - Disambiguated category Issues - See https://github.com/KnowledgeCaptureAndDiscovery/somef/milestone/3?closed=1 for more details.

- Jupyter Notebook
Published by dgarijo almost 5 years ago

somef - SOMEF 0.4.0

This release improves on the following aspects of SOMEF: - Notebooks and Dockerfiles are recognized. - Docs folders are extracted. - License is discovered through file inspection. - New Codemeta specific export - Fixed the technique in the JSON output to accommodate the new types of techniques supported - MyBinder links extraction - Zenodo DOIs extraction - Tests - Attempt at extracting a long title of a repo

- Jupyter Notebook
Published by dgarijo almost 5 years ago

somef - SOMEF 0.3.0: Code reorganization and bug fixes

This version of SOMEF includes full documentation and provides the following features: - Cleanup and reorganization of repository contents. - Synchronized package with GitHub releases - Prepared package release. - Bug fixes and code refactoring - Export in JSON-LD and Turtle

- Jupyter Notebook
Published by dgarijo over 5 years ago

somef - SoMEF 0.2.0: Updates in CLI

This version of the system is more robust and includes: * Answer in JSON format * Configuration allowing to make authentication optional * Initial integration of the header analysis.

- Jupyter Notebook
Published by dgarijo almost 6 years ago

somef - SoMEF 0.1.0: First release of the CLI

This version of SM2KG incorporates the following features: * Ranking of classifiers (ordered by accuracy) with the best pipelines to be used by SM2KG. * First version of the CLI for using the classifiers, creating a JSON result file * First analysis on detecting commonly used words for the different sections.

- Jupyter Notebook
Published by dgarijo almost 6 years ago

somef - SoMEF 0.0.1: First release of the framework

SoMEF is a software metadata extraction framework designed to automatically distinguish the description, installation instructions, invocation and citation of scientific software metadata from a README file

- Jupyter Notebook
Published by dgarijo over 6 years ago