Recent Releases of gnfinder
gnfinder - v1.0.4
CHANGELOG
[v1.0.4] - 2022-11-03 Thu
- Fix [#132]: return words surronding a name without preprocessing.
[v1.0.3] - 2022-10-13 Thu
- Fix [#131]: recognize no-break and wide spaces as spaces during tokenization.
[v1.0.2] - 2022-10-13 Thu
- Fix: update gndoc to
v0.3.2to increasae threshold to access remote URLs
- Go
Published by dimus over 3 years ago
gnfinder - v0.19.5
CHANGELOG
[v0.19.5] - 2022-05-10 Tue
- Add: update gnverifier to v1.0.0-RC1
- Add: MCSbase to web UI.
[v0.19.4] - 2022-05-03 Tue
- Add: species group, cardinality score for verification.
[v0.19.3] - 2022-04-10 Sun
- Add: update output to use MainTaxon, templates.
[v0.19.2] - 2022-04-09 Sat
- Add: update gnlib to v0.13.0.
[v0.19.1] - 2022-04-09 Sat
- Fix: output field misspelling in JSON.
[v0.19.0] - 2022-04-09 Sat
- Add: add IRMNG to web UI
- Add: update to gnlib v0.12.0, use its stats module.
[v0.18.3] - 2022-03-22 Tue
- Add: update Go to v1.18, modules.
- Fix [#119]:
Taxon,Morphologicalshould not be recognized as uninomials.
- Go
Published by dimus about 4 years ago
gnfinder - v0.18.0
CHANGELOG
[v0.18.0] - 2022-02-28 Mon
Add [#117]: bring verificaton in sync with gnames v0.8.0
Add [#116]: add --all-matches flag to show all verification results.
Add: update input and output objects and REST API
Introducing some backward incompatibility.
See https://apidoc.globalnames.org/gnfinder-beta
- Go
Published by dimus about 4 years ago
gnfinder - v0.17.0
CHANGELOG
Add [#111]: update bayes calculations.
Add [#110]: update verification process using most recent code. Stats for kingdoms distribution and the main clade that contains most of the names in the text. Verification JSON is not fully backward compatible.
Add [#109]: add classification path to CSV and TSV outputs.
- Go
Published by dimus over 4 years ago
gnfinder - v0.16.0
CHANGELOG
[v0.16.0]
Add [#94]: Add web-based user interface.
[v0.15.5]
Add [#105]: Support for URL name-finding in REST API.
[v0.15.4]
Add [#104]: merge petectLanguage to language. It allows to simplify logic for language settings. It also changes API signature for parameters. Now parameter "language" recognizes
* "": empty string that goes to default "eng" setting for language
* "detect": finds language by an algorithm
* "eng": sets language to English
* "deu": sets language to German
All other settings default to "eng" (English)
- Go
Published by dimus over 4 years ago
gnfinder - v0.15.0
CHANGELOG
Add [#99]: add TSV format and make ouput format an option for REST API. Add: update modules Add: update Go to 1.17 Add [#98]: an option to return names positions in bytes from the text start instead of UTF-8 characters.
Fix [#100]: fix csv/tsv fields number for verification
- Go
Published by dimus over 4 years ago
gnfinder - v0.14.0
CHANGELOG
- Add [#91]: convert/extract plain texts locally
- Add [#89]: configuration file and environment variables.
- Add [#87]: support PDFs, MS Word, Excel, RTF, HTML, UTF16 etc via Apache Tika.
- Add [#86]: an option to return unique found names.
- Go
Published by dimus almost 5 years ago
gnfinder - v0.12.0
CHANGELOG
This release is not backward compatible with command line flags and name verification. It introduces CSV format as a default output and JSON compact and pretty formats as optional. Verification format is now the same as at https://verifier.globalnames.org, and is using this site as verification engine.
- Add: [#81]: represent new lines in verbatim output as "\n".
- Add: [#80]: use CSV, JSON, JSON pretty for output.
- Add: [#79]: adjust prior odds using the density of found names in a text.
- Add: [#78]: fix Odds value for names with 'grey' genus and species.
- Add: [#77]: add RESTful interface.
- Add: [#76]: remove subcommands from CLI.
- Add: [#75]: update tests, remove ginkgo depencency for tests.
- Add: [#73]: benchmark and optimize tokenizer.
- Add: [#71]: use
embedintroduced in Go v1.16. - Add: [#70]: migrate code to use gner tokenizer.
- Add: [#69]: Output Odds as a log10.
- Add: [#68]: Refactor the code with interfaces to be consistent withother projects.
- Add: [#64]: Remove common words from species.
- Add: [#63]: Remove geo-names as uninomials.
- Add: [#62]: Remove human names as uninomials.
- Go
Published by dimus about 5 years ago
gnfinder - v0.11.1 Dictionaries update
CHANGELOG
- Add: Update dictionaries.
- Fix [#51]: Remove 'Piper' from black list, add new words to dictionaries.
- Go
Published by dimus almost 6 years ago
gnfinder - Cleaner protobuf schema and fixes in CLI JSON
Output formats are not backward compatible
CHANGELOG
- Add [#49]: Cleanup protobuf and JSON outputs. Introducing backward incompatible changes in the output. Standardising CLI JSON to camelcase, introducing cardinality instead of string for a name type, adding canonical simple and full canonical foms for matched and current names. Removing current name unless it is a synonym.
- Go
Published by dimus about 6 years ago
gnfinder - gRPC supports nom. annotations, surrounding words
CHANGELOG
- Add [#46]: gRPC serves nomenclatural annotation and words surrounding name-strings.
- Go
Published by dimus about 6 years ago
gnfinder - Catch nomenclatural annotations, preserve words around name
CHANGELOG
- Add [#44]: save nomenclatural annotation for new species, combinations, subscpecies etc.
- Add [#45]: return desired number of words before and after a name-candidate.
- Go
Published by dimus about 6 years ago
gnfinder - Options refactoring, version output
CHANGELOG
- Add [#36]: Refactor GNfinder options.
- Add [#35]: Add version info to gRPC server.
- Add [#34]: Better language detection.
- Add [#33]: Make it possible to force Bayes not only "on" but also "off".
- Add [#32]: Add benchmarks to
gnfinder_test.go.
- Go
Published by dimus over 6 years ago
gnfinder - Fix a bug in tokenizer
CHANGELOG
- Fix [#30]: Tokenizer breaks if a text ends on a dash followed by space.
- Go
Published by dimus over 6 years ago
gnfinder - Enhanced verification output, fixes in dictionaries
CHANGELOG
- Add [#29]: Enhance verification results. Now preferred data sources have the same fields as the best result. Classification has IDs and ranks.
- Add: Update dictionaries with wide-used Latin common names marked as grey.
- Add [#28]: Generic names from ICN (botanical) code might have authors in parentheses that look the same as subgenus part of ICZN names. As a result parsing such names creates fake uninomials. We removed such fake uninomials from uninomial white dictionary.
- Go
Published by dimus about 7 years ago
gnfinder - Fixes and refactoring
Changelog
- Add [#27]: Refactor code to make it more maintainable
- Add [#26]: Command line app tests
- Fix [#25]: Make CLI app work again (cobra-based cli does not allow
root command with input without flags so
gndinfer text.txtwas broken).
- Go
Published by dimus about 7 years ago
gnfinder - Verification with updated gnindex API
- Add [#21]: support updated gnindex API
- Go
Published by dimus over 7 years ago
gnfinder - Update gRPC server, it's output is now similar to CLI output
- Add [#22]: Go module support for more stable builds
- Add [#19]: bring gRPC output close to cli output. Breaks backward compatibility of gRPC.
- Add [#20]: update API interaction with gnindex.
- Add [#17]: return offsets for the start and the end of name-strings.
- Fix [#18]: gRPC works with diacritics in text input.
- Go
Published by dimus over 7 years ago
gnfinder - gRPC server and Docker support
- Add: [#16]: docker support. Command
make dockercreates docker image. - Add: [#15]: enable gRPC to set data-source IDs for verification
- Add: [#14]: setting for name verification data-sources as well as command line flag. Currently tests for gRPC are located in [Ruby gem gndinder] project.
- Add: [#12]: gRPC-based HTTP API to access gnfinder from other languages.
- Add: StemEditDistance for fuzzy matching by stem.
- Go
Published by dimus over 7 years ago
gnfinder - Improvements in verification
- Add [#11]: Quality Summary and Preferred data sources in verification
- Add [#9]: Additional information how to install in README.md
- Add [#8]: Retry verification if any error happens in the process.
- Add [#7]: Add EditDistance field to verification output.
Add [#6]: Add 'NoMatch' value to verification 'MatchType'.
Fix [#5]: Hide verification "data" if it is empty.
Remove [#6]: Remove Verified field, as it repeats 'NoMatch' information.
- Go
Published by dimus almost 8 years ago