Recent Releases of deepparse

deepparse - 0.9.13

0.9.13

Fix dependency issues with Gensim and Scipy versions.

- Python
Published by MAYAS3 over 1 year ago

deepparse - 0.9.12

0.9.12

Bug-fix the call to the BPEmb class instead of the BPEmbBaseURLWrapperBugFix to fix the download URL in download_models.

- Python
Published by davebulaval over 1 year ago

deepparse - 0.9.11

0.9.11

Fix Sentry version error in Docker Image.

- Python
Published by davebulaval over 1 year ago

deepparse - 0.9.10

Fix and improve documentation.
Remove fixed dependencies version.
Fix app errors.
Add data validation for 1) multiple consecutive whitespace and 2) newline.
Fixes some errors in tests.
Add an argument to the DatasetContainer interface to use a pre-processing data cleaning function before validation.
Hot-fix the issue with the BPEmb base URL download problem. See issue 221.
Fix the NumPy version due to a major release with breaking changes.
Fix the SciPy version due to breaking change with Gensim.
Fix circular import in the API app.
Fix deprecated max_request_body_size in Sentry.

- Python
Published by davebulaval over 1 year ago

deepparse - 0.9.9

Add version to Seq2Seq and AddressParser.
Add a Deepparse as an API using FastAPI.
Add a Dockerfile and a docker-compose.yml to build a Docker container for the API.
Bug-fix the default pre-processors that were not all apply but only the last one.

- Python
Published by davebulaval over 2 years ago

deepparse - 0.9.8 and weights release

Hot-Fix wheel install (See issue 196).

- Python
Published by davebulaval over 2 years ago

deepparse - 0.9.7

New models release with more meta-data.
Add a feature to use an AddressParser from a URI.
Add a feature to upload the trained model to a URI.
Add an example of how to use URI for parsing from and uploading to.
Improve error handling of path_to_retrain_model.
Bug-fix pre-processor error.
Add verbose override and improve verbosity handling in retrain.
Bug-fix the broken FastText installation using fasttext-wheel instead of fasttext ( see here and here).

- Python
Published by davebulaval almost 3 years ago

deepparse - 0.9.6

Add Python 3.11.
Add pre-processor when parsing addresses.
Add pin_memory=True when using a CUDA device to increase performance as suggested
by Torch documentation.
Add torch.nograd() context manager in _call__() to increase performance.
Reduce memory swap between CPU and GPU by instantiating Tensor directly on the GPU device.
Improve some Warnings clarity (i.e. category and message).
Bug-fix MacOS multiprocessing. It was impossible to use in multiprocess since we were not testing whether torch
multiprocess was set properly. Now, we set it properly and raise a warning instead of an error.
Drop Python 3.7 support since newer Python versions are faster
and Torch 2.0 does not support Python 3.7.
Improve error handling with wrong checkpoint loading in AddressParser retrain_path use.
Add torch.compile integration to improve performance (Torch 1.x still supported) with mode="reduce-overhead" as
suggested in the documentation. It
increases the performance by about 1/100.

- Python
Published by davebulaval almost 3 years ago

deepparse - 0.9.5

Fixed tags converter bug with data processor.

- Python
Published by davebulaval almost 3 years ago

deepparse - 0.9.4

Improve codebase.

- Python
Published by davebulaval about 3 years ago

deepparse - 0.9.3

Improve error handling.
Bug-fix FastText error not handled in test API.
Add feature to allow newpredictiontags to retrain CLI.

- Python
Published by davebulaval about 3 years ago

deepparse - 0.9.2

Improve Deepparse server error handling and error output
Remove deprecated argument savingdir in downloadfasttextmagnitudeembeddings and downloadfasttextembeddings functions
Add offline argument to remove verification of the latest version
Bug-fix cache handling in download model
Add download_models CLI function
https://github.com/GRAAL-Research/deepparse/issues/156

- Python
Published by davebulaval over 3 years ago

deepparse - 0.9.1

Hotfix cli.download_model attention model bug

- Python
Published by davebulaval over 3 years ago

deepparse - 0.9

Add save_model_weights method to AddressParser to save model weights (PyTorch state dictionary)
Improve CI
Added verbose flag for the test to activate or deactivate the test verbosity (it overrides the AddressParser verbosity)
Add Docker image
Add val_dataset to retrain API to allow the use of a specific val dataset for training
Remove deprecated download_from_url function
Remove deprecated dataset_container argument
Fixed error and docs
Added the UK retrain example

- Python
Published by davebulaval over 3 years ago

deepparse - Create Zenodo DOI

Create Zenodo DOI

- Python
Published by davebulaval over 3 years ago

deepparse - 0.8.2

Bug-fix retrain attention model naming parsing
Improve error handling when not a DatasetContainer is used in retrain and test API
Add DOI

- Python
Published by davebulaval over 3 years ago

deepparse - 0.8.1

Refactored function download_from_url to download_from_public_repository.
Add error management when retrain a FastText like model on Windows with a number of workers (num_workers) greater than 0.
Improve dev tooling
Improve CI
Improve code coverage and pylint
Add Codacy

- Python
Published by davebulaval over 3 years ago

deepparse - 0.8

Improve SEO.
Add cache_dir arg in all CLI functions.
Improve handling of HTTP error in models version verification.
Improve doc.
Add a note for parsing data cleaning (i.e. lowercase, commas removal, and hyphen replacing).
Add hyphen parsing cleaning step (with a bool flag to activate or not) to improve some country address parsing (see issue 137).
Add ListDatasetContainer for Python list dataset.

- Python
Published by davebulaval over 3 years ago

deepparse - 0.7.6

Re-release the version 0.7.5 into 0.7.6 due to manipulation error and change in PyPi (now delete does not delete release by yank does).

- Python
Published by davebulaval over 3 years ago

deepparse - 0.7.5

Bug-fix Poutyne version handling that causes a print error when a version is 1.11 when retraining
Add the option to create a named retrain parsing model using by default the architecture setting or using the user-given name
Hot-fix missing raise for DataError validation of address to parse when address is tuple
Bug-fix handling of string column name for CSVDatasetContainer that raised ValueError
Improve parse CLI doc and fix error in doc stating JSON format is supported as input data
Add batch_size to parse CLI
Add minimum version to Gensim 4.0.0.
Add a new CLI function, retrain, to retrain from the command line
Improve doc
Add cache_dir to the BPEmb embedding model and to AddressParser to change the embeddings cache directory and models weights cache directory
Change the saving_dir argument of download_fastext_embeddings and download_fasttext_magnitude_embeddings function to cache_dir. saving_dir is now deprecated and will be remove in version 0.8.
Add a new CLI function, test, to test from the command line

- Python
Published by davebulaval over 3 years ago

deepparse - 0.7.4

Improve parsed address print
Bug-fix #124: comma-separated list without whitespace in CSVDatasetContainer
Add a report when addresses to parse and tags list len differ
Add an example on how to fine-tune using our CSVDatasetContainer
Improve data validation for data to parse

- Python
Published by davebulaval almost 4 years ago

deepparse - 0.7.3

Add freeze layers parameters to freeze layers during retraining

- Python
Published by davebulaval almost 4 years ago

deepparse - 0.7.2

Added JSON output support
Add logging output of parse cli function
Hotfix Poutyne version handling

- Python
Published by davebulaval almost 4 years ago

deepparse - 0.7.1

Hotfix for missing dependency
Fixed bug with poutyne version handling

- Python
Published by MAYAS3 almost 4 years ago

deepparse - 0.7

Improved CLI
Fixed bug in CLI export dataset
Improved the doc of the CLI

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.7

Fixed errors in data validation
Improved doc over data validation
Bugfix data slicing error with data containers
Add an example on how to use a retrained model

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.6

Fixed errors in code examples
Improved doc of downloadfromurl
Improve error management of retrain and test

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.5

Improve error handling of empty data and whitespace-only data.
Parsing now include two validation on the data quality (not empty and not whitespace only)
DataContainer now includes data quality test (not empty, not whitespace only, tags not empty, tag the same len as an address, and data is a list of tuples)
New CSVDatasetContainer
DataContainer can now be used to predict using a flag.
Add a CLI to parse addresses from the command line.

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.4

Bugfix reloading of retraining attention model (PR #110)
Improve error handling
Improve doc

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.3

Fixed the printing capture to raise the error with Poutyne as of version 1.8. We keep the previous approach as for compatibilities with previous Poutyne version. Added a flag to disable or not Tensorboard during retraining.

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6.2

Improved (slightly) code speed of data padding method as per PyTorch list or array to Tensor recommendation.
Improved doc for RuntimeError due to retraining FastText and BPEmb model in the same directory.
Added error handling RuntimeError when retraining.

- Python
Published by davebulaval about 4 years ago

deepparse -

Hot-fixed EOS bug #106

- Python
Published by davebulaval about 4 years ago

deepparse - 0.6

Added Attention mechanism models
Fixed EOS bug

- Python
Published by davebulaval about 4 years ago

deepparse - 0.5.1

Fixed address_comparer hint typing error
Fixed some docs errors
Retrain and test now have more defaults parameters
Various small code and tests improvements

- Python
Published by davebulaval over 4 years ago

deepparse - 0.5

Added Python 3.9
Added feature to allow a more flexible way to retrain
Added a feature to allow retrain of a new seq2seq architecture
Fixed prediction tags bug when parsing with new tags after retraining

- Python
Published by davebulaval over 4 years ago

deepparse - 0.4.4

Fixed ImportError.

- Python
Published by MAYAS3 over 4 years ago

deepparse - 0.4.3

Fixed typos in one name of a file.
Added tools to compare addresses (tagged or not).
Fixed some tests errors.

- Python
Published by davebulaval over 4 years ago

deepparse - 0.4.2

Added __eq__ method to FormattedParsedAddress.
Improved device management.
Improved testing.

- Python
Published by davebulaval over 4 years ago

deepparse - 0.4.1

Added method to specify the format of address components of a FormattedParsedAddress. Formatting can specify the field separator, the field to be capitalized, and the field to be upper case.

- Python
Published by davebulaval over 4 years ago

deepparse - 0.4

Added verbose flag to training and test base on the __init__ of address parser.
Added a feature to retrain our models with prediction tags dictionary different from the default one.
Added in-doc code examples.
Added code examples.
Small improvement of our model implementation.

- Python
Published by davebulaval over 4 years ago

deepparse - 0.3.6

Added a method for a dict conversion of parsed addresses for simpler Pandas integration.
Added examples for parsing addresses and how to convert them into a DataFrame.
Fixed error with download module.

- Python
Published by davebulaval over 4 years ago

deepparse - 0.3.5

Added verbose flag to training and test base on the init of address parser.
Breaking change Since SciPy 1.6 is released on Python 3.7+, we don't support Python 3.6.
Added management for Windows where the FastText model cannot be pickled. On Windows, we use Gensim fasttext model, which takes more RAM.

- Python
Published by davebulaval almost 5 years ago

deepparse - 0.3.4

Fixed a bug when use batched address. Since we were sorting the address during the forward pass, the output prediction tags were not aligned with the supposed parsed address. We have removed the sorting, and now the results are more aligned with our research.

- Python
Published by davebulaval almost 5 years ago

deepparse - 0.3.3

We have improved the loading of data during prediction. We now use a data loader.
Updated the performance table of the model with the data loader approach.
Fixed missing import in the parser module.
Bug fix of the max_len for the predictions.

- Python
Published by davebulaval almost 5 years ago

deepparse - 0.3.2

Fixed docs formatting

- Python
Published by MAYAS3 about 5 years ago

deepparse - 0.3.1

0.3

Added Libpostal time in doc Documentation improvement Added new models evaluation to doc Release new models to improve noisy data parsing (#57)

0.3.1

Fixed an error in the build

- Python
Published by davebulaval about 5 years ago

deepparse - 0.3

Added Libpostal time in doc
Documentation improvement
Added new models evaluation to doc
Release new models to improve noisy data parsing (#57)

- Python
Published by davebulaval about 5 years ago

deepparse - 0.2.3

Improved documentation

- Python
Published by davebulaval about 5 years ago

deepparse - 0.2.2

Fixed error with experiment and verbosity as logging trigger on or off

- Python
Published by davebulaval about 5 years ago

deepparse - 0.2.1

Fixed README

- Python
Published by davebulaval about 5 years ago

deepparse - 0.2

Added Fasttext light model using pymagnitude-light Added finetuning API to allow finetuning of our models (retrain) Added a method to test models (test) Added metric, loss and dataset container to facilitate finetuning training Added an example of finetuning Added way to load retrain model when instantiation of AddressParser

- Python
Published by davebulaval about 5 years ago

deepparse - 0.1.3.1

Added verbose flag for model weights download.

- Python
Published by davebulaval about 5 years ago

deepparse - 0.1.3

Added "contributing to" Added fix for comma problem (#56) Added content in Address Parser doc for tags definition Fixed Pylint bug with PyTorch 1.6 Fixed pack_padded cpu error with PyTorch new release (https://github.com/pytorch/pytorch/issues/43227)

- Python
Published by davebulaval over 5 years ago

deepparse - 0.1.2

Modification of assets URL Bugfix dictionary #51 Fixed logo Fixed deepParse -> deepparse Fixed setup authors

- Python
Published by davebulaval over 5 years ago

deepparse - Beta release

Initial release of the pre-trained models
Beta version of the API

We are working on a way to reduce the memory footprint for fastText model. We welcome comments and issues to improve the API.

- Python
Published by davebulaval over 5 years ago