Recent Releases of libfolia

libfolia - v2.21.1

  • Fix for segfault on edge cases without /etc/services (https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1094197), thanks to Chris Hofstaedtler and Johannes Schauer Marin Rodrigues

- C++
Published by proycon over 1 year ago

libfolia - v2.21

- C++
Published by kosloot over 1 year ago

libfolia - v2.20

[Ko van der Sloot] * require C++17 now * small refactorings * cleanup in GitHub CI file

- C++
Published by kosloot over 1 year ago

libfolia - v2.19

  • again bumped the .so version, as we break the ABI
  • Refactored the Class hierarchy for clearer code
    • introducing an AbstractFeature class as base for all derived Features
  • improved exception and error handling, including line numbers in messages when possible
  • added code to detect mismatch between annotators and processors (were not detected until now)
  • cleaner en better C++ code, const correctness and such...

- C++
Published by kosloot about 2 years ago

libfolia - v2.18

  • buming the .so version due to ABI breaks
  • fix for --canonical option in folialint
  • fix for https://github.com/LanguageMachines/libfolia/issues/56
  • improved checking for empty nodes
  • several code improvements. const correctness etc.
  • fix for https://github.com/LanguageMachines/libfolia/issues/55
  • better check for illegal Correction's: https://github.com/proycon/folia/issues/77
  • added Doxygen config
  • better handling of XML comment nodes

- C++
Published by kosloot about 2 years ago

libfolia - v2.17

  • assume ticcutils >= 0.34 to force NFC normalization
  • refactored str() and unicode() text extraction functions.
  • a lot of work on code quality

- C++
Published by kosloot over 2 years ago

libfolia - v2.16

  • fix for https://github.com/LanguageMachines/libfolia/issues/54
  • Clearer error messages (adding filename, if present)
  • some code cleaned/clarified
  • added code to parse, store and output XML's Processing Instruction nodes
  • added Etymology annotation
  • annotation

- C++
Published by kosloot over 2 years ago

libfolia - v2.15

  • fixed a terrible typo/bug in subclasses.cxx: using el-referable(), where el->referable() was meant
  • plugged a small potential memory-leak
  • fixed some offset problems in text handling
  • fixed https://github.com/LanguageMachines/libfolia/issues/52
  • foliadiff script now returns better message on failure
  • switching to C++14
  • code polishing
  • updated GitHub action

- C++
Published by kosloot about 3 years ago

libfolia - v2.14

[Ko van der Sloot] * implemented an ADD_FORMATTING TextPolicy to extract otherwise hidden text from <t-hspace> and <t-hyph> Markup elements (see https://github.com/proycon/foliapy/issues/25) * fix for https://github.com/LanguageMachines/libfolia/issues/51 * General improvements: - include a filename when throwing during Document processing - added setutest() member for XmlText class - general code quality

- C++
Published by kosloot over 3 years ago

libfolia - v2.13

[Ko van der Sloot] * removed dependency on libtar * quick fix for ignoring text inside <t-hbr>: https://github.com/proycon/foliapy/issues/25

[Maarten van Gompel] * updated minimum required libxml2 version

- C++
Published by proycon over 3 years ago

libfolia - v2.12

  • fix for https://github.com/LanguageMachines/libfolia/issues/49
  • ABI breached, so bunped the .so file version
  • cleaner C++ code, more C++11 now, removing CppCheck warnings
  • using more recent TiccUtils (for enum_flags.h)
  • several small improvements
  • improved GitHub action

- C++
Published by kosloot over 3 years ago

libfolia - v2.11

[Ko vd Sloot] * Significant refactoring, code cleaning, code reduction, and extra comments * fixed memory leaks in the test (and also tests destroy() function now) * Added some safeguards against multiple setnames for text_annotation. This is a limitation discussed in https://github.com/proycon/folia/issues/104 * added code to handle text extraction for "empty" rows. * implemented a fix for empty cell's. https://github.com/proycon/foliatools/issues/41 * added a fix for text offsets in embedded elements in a structure that may NOT carry text itself. Like cell inside a table.

[Maarten van Gompel] * codemeta.json: updated metadata according to (proposed) CLARIAH requirements

- C++
Published by proycon almost 4 years ago

libfolia - v2.10

  • several code improvements, suggested by CPPcheck and scan-build
  • start using TextPolicy::debug
  • impemented implicitspace logic for whitespace issue proycon/folia#101

- C++
Published by kosloot over 4 years ago

libfolia - v2.9

  • Reworked the FoliaElement class hierarchy. Much clearer now
  • re-arranged file structure. Separating some files into smaller files
  • text extraction:
    • numereous changes and additions to handle spaces better.
    • refactored the code, using a new TextPolicy class for clarity
    • added code for handling 'tag' attributes using callbacks
  • improved handling of Correction
  • numerous code refactorings for clearity and speed
  • adapted and improved documentation

- C++
Published by kosloot almost 5 years ago

libfolia - v2.8.1

Bug fix release. [Ko vd Sloot] * re-added the ltrim() function for backward compatibility

- C++
Published by kosloot about 5 years ago

libfolia - v2.8

  • implements FoLiA v2.5, with a new 'model' for white-spaces in texts.
  • bumped the .so version to 17

[Maarten van Gompel] * added TextMarkup tag * added TextMarkup tag * added tag attribute * fix for proycon/folia#88, proycon/folia#92, proycon/folia#93, proycon/folia#94 * added text normalization functions to support the new text model, maintaining backward compatibility.

[Ko vd Sloot] * parse and preserve the xml:space attribute. * added a 'space' normalizer. ALL exotic spaces (like em-space and en-space) are replaced by the standard ascii space * fixed https://github.com/LanguageMachines/libfolia/issues/48 * code cleanup/refactoring * ditch TravisCI and implemented a GitHub action

- C++
Published by kosloot about 5 years ago

libfolia - v2.7

  • implemented a more relaxed MetaData scheme, allowing mixing 'foreign' and 'native' MetaData
  • bumped the .so version to 15
  • features may be present in and nodes now

- C++
Published by kosloot over 5 years ago

libfolia - v2.6.1

[Maarten van Gompel]

  • Updated for FoLiA v2.4.1: strip leading/trailing whitespace in text content (proycon/folia#88)

[Ko vd Sloot]

  • Fixed problem with text-consistency errors for within

- C++
Published by proycon over 5 years ago

libfolia - v2.6

[Maarten van Gompel]

  • Updated for FoLiA v2.4
  • Revised external implementation
  • Implemented Modality annotation

[Ko vd Sloot] * cleanup and extra sanity tests * Implemented an 'explicit' mode for Document (FoLiA v2.3) and in folialint

- C++
Published by proycon over 5 years ago

libfolia - v2.5.1

[Maarten van Gompel] * Bugfix: Fixed handling of control characters, strip control characters by default

[Ko vd Sloot] * fix in date handling (lookup table for month -> integer conversion ) * minor refactoring * some documentation

- C++
Published by proycon over 5 years ago

libfolia - v2.5

[Maarten van Gompel] * Adapted to FoLiA v2.3 * Support parsing of the new explicit form

[Ko vd Sloot] * folialint: updated usage() and man page * minor refactoring

- C++
Published by proycon almost 6 years ago

libfolia - v2.4

  • comment in Doxygen format added
  • bumped the library version to 14
  • fix for https://github.com/proycon/folia/issues/82
  • fix for https://github.com/proycon/folia/issues/42
  • fixed problem with using new tag names on pre 1.6 documents
  • better checks in folia_engine on text inconsistencies and such (https://github.com/LanguageMachines/libfolia/issues/43)
  • confidence output is more consistent now
  • removed the folia_builder (was not used)
  • code refactorings and cleanup, removing unused functions

- C++
Published by kosloot about 6 years ago

libfolia - v2.3.2

Bug fix release * fix for https://github.com/LanguageMachines/foliautils/issues/37 * fix for https://github.com/LanguageMachines/foliautils/issues/38 * fixes in Correction handling. * fixed a Multi-Threading problem with the static reverse_old map

- C++
Published by kosloot over 6 years ago

libfolia - v2.3.1

  • Bug fix release for gcc 9.1 It stumbles upon some inline functions
  • replaced call to unsafe 'tmpnam()' by 'TiCC::tempname()'

- C++
Published by kosloot over 6 years ago

libfolia - v2.3

new features: * autodeclare mode introduced (as in FoLiApy) * folialint by default doesn't autodeclare. use -a or --autodeclare to use it * Better detecting of declaration errors in general * the select function now also enables the possibility to search recursively upto the first matching sibling. * bumped library version to 13

other changes: * some exceptions are changed. * less exceptions are thrown. An empty result is returned instead. * folialint now accept bote -d and --debug * real fix for issue70 * small bug fixes and refactorings * accept empty Comment and Description nodes

- C++
Published by kosloot almost 7 years ago

libfolia - v2.2.1

Bug fix release: * There were some problems handling NO setname vs. EMPTY setname, during incremental parsing in folia::Engine. This was sorted out now: https://github.com/proycon/folia/issues/74 This related to some ucto and frog issues too: https://github.com/LanguageMachines/ucto/issues/70 https://github.com/LanguageMachines/frog/issues/72

- C++
Published by kosloot almost 7 years ago

libfolia - v2.2

Bug fix release. * Folia::Engine choked on some complex FoLiA. Solved by refactoring and in fact simplifying some code. (Frog issue #77 revealed this) * added flush() on document output to streams. (frog issue #72) * improved output in debugging mode

- C++
Published by kosloot almost 7 years ago

libfolia - v2.1

Bug fixes and enhancements: * provenance: - added 'generate_id' attribute with 'auto()' and 'next()' values - some code improvements * bugs: - When using the FoLiA-engine, we have to save the ORIGINAL annotationdefaults, and use these when parsing.

- C++
Published by kosloot almost 7 years ago

libfolia - v2.0

This is a major release. * Supports the new FoLiA 2.0 features: - provenance support - more stricter checking on annotation declarations - added the new TextMarkupReference class - supports Hidden Words. - All structure elements can have the 'space' attribute - support for groupannotations - many more.

  • API and ABI breaches:

    • library version bumped to version 10
    • many functions are renamed
    • the text() functions have an ENUM parameter now to select for STRICT, RETAIN or HIDDEN
  • bug fixes

    • support for xlink: improved
    • there as a rare mixup between nodes and nodes in the folia::Engine
    • all nodes get a 't' attribute now on serializing.
    • reading Extrenal FoLiA could get in an endless loop
  • code refactoring and cleanup

- C++
Published by kosloot about 7 years ago

libfolia - v1.16

Stabilizing release for folia1.5. The next release will support the new FoLiA 2.0

Changes: * renamed folia::Processor to folia::Engine * extended and improved Engine code a lot * avoid spurious newline on Document output * Will read and ignore some FoLiA 2.0 additions * numerous small additions and fixes * make sure that the XmlParser uses the HUGE model everywhere

- C++
Published by kosloot about 7 years ago

libfolia - v1.15

  • added (still experimental) code for a FoLiA Builder, Processor and TextProcessor class. Use with care. The API may change unannounced!
  • a foliadiff script (using folialint) is installed now
  • several refactorings, to make the code more clear.
  • the 'ref' attribute was not serialized for TextContent
  • several smaller small bug fixes
  • the .so version is bumped to 9 because of a lot of API/ABI changes

- C++
Published by kosloot over 7 years ago

libfolia - v1.13

[Ko van der Sloot] * disabled WordReference test. It was incomplete, and hard to do * use icu:: namespace

[Maarten van Gompel] * added codemeta.json * fix spelling errors in error messages

- C++
Published by kosloot about 8 years ago

libfolia - v1.12

  • configuration cleanup. MacOSX is better supported now.
  • folialint now supports --fixtext (handle with care!)
  • library version bumped to 8.0, due to changes in the API
  • regenerated FoLiA properties (to FoLiA version 1.5.1)
  • several small bug fixes

- C++
Published by kosloot over 8 years ago

libfolia - v1.11

Bug fix release: * handling of <comment> tags within <t> nodes * better handling of <wref> tags. Forbid forward references

- C++
Published by kosloot over 8 years ago

libfolia - v.1.10.1

Minor fix * bumped the .so version to 7.0

- C++
Published by kosloot over 8 years ago

libfolia - v1.10

Major Release, implementing FoLiA spec 1.5 * added text checking for all 1.5 documents and up * added offset and ref checking for Text in all 1.5 documents and up * 'empty' text inside TextContent, PhonContent and Textmarkup is significant * better version checking * text checking can be dis/enabled using FOLIATEXTCHECK environment variable * added submetadata mechanism * implemented aliases for annotation setnames * added an xmlstring() serializer for Document * bug fixes: - in LineBreak serializing - XmlComment is textless. - miscellaneous small fixes

- C++
Published by kosloot over 8 years ago

libfolia - v1.9

Bug fix release * accept ICU 50 too (was 52) to make CentOS happy * XmlComment INSIDE <t> lead to crashes. fixed. * code changes in code that is only executed for documents in folia 1.5 format (that shouldn't exist in the wild)

- C++
Published by kosloot almost 9 years ago

libfolia - v1.8

Implements FoLiA spec 1.4.3

API changed. Bumped library version to 6.2.0

  • added textclass attribute
  • added experimental textchecking code. only working for FoLiA documents according to spec 1.5. NOT RELEASED YET! Work in Progress
  • fix in generateid. AUTOGENERATE_ID property was ignored.
  • numerous small bug fixes

- C++
Published by kosloot almost 9 years ago

libfolia - v1.7

API changed so bumped library version to 6.1.0 [Ko van der Sloot] * textcontent() and phoncontent() return const pointers, and also work for TexContent and PhonContent elements now * some refactoring, as suggested by CPPCHECK * typos * added dangerous functions to manipulate the class of a TextContent * added reference counting on annotations. This allows to remove unneeded declarations. * small bug fixes: - str() should never throw. - avoid memory leak

[maarten van Gompel] * fixes in folia_properties for FoliA spec 1.4.1

- C++
Published by kosloot about 9 years ago

libfolia - v1.6

This release implements FoLiA spec 1.4 - ABI breakage. .so name bumped to 6.0.0 reason: - new properties added - implementation of generateId() is changed - enhancements to folialint. Saving a document with --strip also implies canonical output (--kanon) - some bug fixes

- C++
Published by kosloot over 9 years ago

libfolia - v1.5

  • Bumped the .so name. Should have been done in 1.4!
  • addition: text() mebmer for document-
  • minor bug fixes:
    • isNCname test now conforms to XML definition
    • improved am error messag in Document
    • check empty attributes in Feature() construction

- C++
Published by kosloot over 9 years ago

libfolia - v1.4

This version implements FoLiA spec: 1.3.2 - multiple ForeignData nodes - added more Feature nodes, like Polarity, Strenght - Source, Target, Relation, Predicate, Sentiment Statement, Observation Annotations and Layers. - Comment node - better version checking.(and a bit relaxed too)

some bug fixes and code improvement. - str() works more as expected - fixup ref 'id' vs. 'xml:id' - improved sanity check to better test errors in the specs. - added a language getter and setter.

- C++
Published by kosloot over 9 years ago

libfolia - v1.3.1

Very minor release update to facilitate debian packaging ( + updated README)

- C++
Published by proycon almost 10 years ago

libfolia - v1.3

Maintenance release: - long options --help and --version added - fix in LineBreak: text() generates a newline

- C++
Published by kosloot almost 10 years ago

libfolia - v1.2

This release adds new features from the FoLiA 1.2 specification. This includes: - ForeignData nodes - Foreign metadata node - less restrictions on aref/@type and ref/@type - fully implemented xlink:type="simple" and xlink:type="locator"

Also minor bug fixes and code improvements are included

- C++
Published by kosloot about 10 years ago

libfolia - v1.0.1

Bugfix release. Fixes linker and symbol lookup failure on Mac OS X (clang). Closes issue #3

- C++
Published by proycon about 10 years ago

libfolia - v1.0

Major release: Implemented on the common properties shared by both the Python and C++ versions.

From now on, both implementations will support exact the same tags, attributes and syntax.

- C++
Published by kosloot about 10 years ago

libfolia - v0.13

First release of libfolia from GIT

- C++
Published by kosloot over 10 years ago