Recent Releases of tensor2tensor
tensor2tensor - v1.15.7
- Multistep Adam Optimizer many thanks to @AgoloCuongHoang for contributing in #1773 !
- Residual Shuffle-Exchange Network thanks to @EmilsOzolins in #1805 !
- Not pinning the gym version.
- Python
Published by afrozenator about 6 years ago
tensor2tensor - v1.15.6
Added basic support for TF2 modeling in f65b5e4e0be50b284f9b21d56d3d2a46792cdecf thanks to @rjpower !
Other misc fixes: - Fixing feature encoder for tf.string variable length features. - adding hparam to make encoder self-attention optional. - Documentation update, thanks @w-hat
- Python
Published by afrozenator about 6 years ago
tensor2tensor - v1.15.5
PRs: * #1788 by @mzilinec adding an option to select different TPU zone.
Some more code cleanups, regarding tf.compat.v1
- Python
Published by afrozenator about 6 years ago
tensor2tensor - v1.15.4
- Flush out some more contrib remnants.
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.15.3
- Some changes to handle 1.x to 2.x for tf contrib
- TODO(afrozm): Write more
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.15.2
Some changes needed to be able to import problems with TF 2.0
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.15.1
- Move away from tf.flags to absl-py's flags.
- Move away from std::string to tensorflow::string
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.15.0
Final T2T major release
It is now in maintenance mode — we keep it running and welcome bug-fixes, but encourage users to use the successor library Trax.
PRs Merged
- #1724 by @Separius - use batchsize in _testimg2img_transformer thanks!
- #1726 by @senarvi - Fix decoding in prepend mode thanks!
- #1733 by @prasastoadi - En-Id untokenized parallel corpora thanks!
- #1748 by @gabegrand adding a Text2RealProblem class -- thanks a lot @gabegrand
Bug Fixes
- Fix features and decoding on TPUs by @mts42000
- @iansimon and Kristy Choi around shape assertions and modalities
- @superbobry fixed cases where tf.TensorShape was constructed with float dimensions
Misc
- Trax was moved into its own repo: https://github.com/google/trax
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.14.1
PRs Merged
- #1720 thanks @przemb
- #1698 #1699 test/util file fixes thanks to @Vooblin
- Fix serving response from Cloud ML Engine (#1688) thanks to @evalphobia
- Refine automatic mixed precision support via hyper param (#1681) thanks @vinhngx
- correct return shape of relpos2abspos() (#1686) thanks to @Separius
- save attention weights for relative attention v2 (#1682) thanks to @Ghostvv
- Update generator_utils.py (#1674) thanks to @TanguyUrvoy
Docs
- Transformer tutorial (#1675) many thanks to @Styleoshin
Problems
- 4 new dialog problems by @ricsinaruto in #1642
Models
- Extend NeuralStack to support Dequeu by reading/writing in both directions, thanks @narphorium
TRAX
- Lots of work on SimPLe tuning hyperparameters by @koz4k , @lukaszkaiser and @afrozenator
- async data collection for RL in TRAX
- New memory efficient Transformer using Reversible layers, thanks to Nikita Kitaev, @lukaszkaiser and Anselm Levskaya
- Losses and metrics are layers now in trax, thanks to @lukaszkaiser
- Activations in TRAX thanks to @joaogui1 in #1684 and #1666
- Python
Published by afrozenator over 6 years ago
tensor2tensor - v1.14.0
Models / Layers:
- NeuralStack and NeuralQueue added, in https://github.com/tensorflow/tensor2tensor/commit/838aca4960f851cd759307481ea904038c1a1ab5 - thanks @narphorium !
- Open Sourcing the Search Space used in EvolvedTransformer - https://github.com/tensorflow/tensor2tensor/commit/4ce366131ce69d1005f035e14677609f7dfdb580
- Masked local n-D attention added in - https://github.com/tensorflow/tensor2tensor/commit/2da59d24eb9367cbed20c98df559beccd11b7582
Problems:
- Add English-Spanish translation problem (#1626) thanks @voluntadpear !
- MovingMNist added in https://github.com/tensorflow/tensor2tensor/commit/121ee60a3b57a092264aa5b5bf69ad194cafb118 thanks @MechCoder !
Bug Fixes:
- Loss twice multiplied with loss_coef (#1627) by @davidmrau - thanks a lot David!
- Fix log_prob accumulation during decoding, thanks @lmthang !
- Fixed high usage of TPU HBM "Arguments" during serving in https://github.com/tensorflow/tensor2tensor/commit/d38f3435ded822e585d1fc7136f3ece857a41c8d thanks @ziy !
- Should not generate summary during decoding in dotproductrelative_atention (#1618) thanks @phamthuonghai !
Misc changes:
- Implement sequence packing as a tf.data.Dataset transformation - https://github.com/tensorflow/tensor2tensor/commit/560c008f7d87502174765fac5ae3d822bbf6b243 thanks @robieta !
- Lots of work on t2t_distill and model exporting by @ziy - thanks @ziy !
RL:
Introduce Rainbow. (#1607) by @konradczechowski in #1607 Changes to MBRL by @konradczechowski , @koz4k in multiple PRs.
PRs:
- Adding automatic mixed precision support (#1637) thanks a lot to @vinhngx !
- Documentation for creating own model #1589 thanks @hbrylkowski !
- Adding extra linear to semantic hashing discretization bottleneck. #1578 thanks @martiansideofthemoon !
- Using partial targets at inference time. (#1596) thanks @EugKar !
- Updated link to DeepMind Math dataset (#1583) thanks @MaxSobolMark !
- Only strip end of line (#1577) thanks @funtion !
- correct typo in addtimingsignal_nd (#1651) many thanks to @Separius !
- fix decode bug (#1645) many thanks to @dong-s !
- Change confusing function name (#1669) thanks @lazylife7157 !
TRAX:
Base
- Forked optimizers from JAX and make them objects in https://github.com/tensorflow/tensor2tensor/commit/1c7c10c60abc31308b40ae6c850e5c9e363dd4a9
- Trax layers are now stateful and support custom gradients.
- Multi-device capability added.
- Memory efficient trainer added in https://github.com/tensorflow/tensor2tensor/commit/b2615aab938af99418ac0d1318338bf3030357fa ! Thanks Nikita Kitaev!
- Adafactor optimizer added in TRAX - https://github.com/tensorflow/tensor2tensor/commit/63c015f964c1166d181d8efd232abd856574fd83
- Demo Colab added in https://github.com/tensorflow/tensor2tensor/commit/cec26dbd782ea7e4c07377e8d1f9391eb0c5a65c thanks @levskaya
- Demo colab for trax layers - https://github.com/tensorflow/tensor2tensor/commit/7632ed01e739cd124c8bac85f121f0f49ddd86cf
- Transformer, TransformerLM, Reversible Transformer, PositionLookupTransformer and Resnet50 are some of the models that TRAX now supports.
RL
- Many PPO changes to be able to work on Atari.
- Distributed PPO where the envs can run in multiple parallel machines using gRPC
- SimulatedEnvProblem by @koz4k - a gym env that simulates a step taken by a trainer of a Neural Network in https://github.com/tensorflow/tensor2tensor/commit/2c761783a7aacd6800d445d10ad3676a56365514
- Implement SerializedSequenceSimulatedEnvProblem
by @koz4k
- https://github.com/tensorflow/tensor2tensor/commit/f7f8549a6421723154b366996b2c6559048ac3fb
- Transformer can be used as a policy now, thanks to @koz4k in https://github.com/tensorflow/tensor2tensor/commit/33783fd63bd0debe2138c5569698b31d9af350f6 !
- Python
Published by afrozenator almost 7 years ago
tensor2tensor - v1.13.4
Minor fix to 1.13.3, please see release notes there.
- Python
Published by afrozenator about 7 years ago
tensor2tensor - v1.13.3
TODO(afrozm): Document more.
- Various PRs.
- Development on TRAX
- Python
Published by afrozenator about 7 years ago
tensor2tensor - v1.13.2
- jax, jaxlib moved to extras in setup.py
PRs: fixed getstandardizedlayers spelling, thanks @cbockman in #1529 serving utils fixes - Thanks @Drunkar ! in #1495 Fixing a checkpoint name bug in #1487, thanks @lzhang10
Enhancements: * DeepMind Math dataset. * VideoGlow paper added to T2T Papers. * Mixture Transformer * A very basic PPO implementation in TRAX. * More TRAX and RL changes.
Bugs: Correct flat CIFAR modality to not consider 0 as padding
- Python
Published by afrozenator about 7 years ago
tensor2tensor - v1.13.0
** Modalities refactor: Thanks to Dustin, all modalities are now an enum and just functions, making it easier to understand what's happening in the model. Thanks Dustin!
Model-Based Reinforcement Learning for Atari using T2T, please find a nice writeup in at https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/rl/README.md -- thanks a lot to all the authors! @lukaszkaiser @mbz @piotrmilos @blazejosinski Roy Campbell @konradczechowski @doomie Chelsea Finn @koz4k Sergey Levine @rsepassi George Tucker and @henrykmichalewski !
TRAX = T2T + [JAX](https://github.com/google/jax) - please try out and give us feedback at #1478
New Models: * Evolved Transformer, thanks @stefan-it for adding the paper in #1426 * textCNN model by @ybbaigo in #1421
Documentation and Logging: * MultiProblem by @cwbeitel in #1399 * ML Enginge logging in #1390 by @lgeiger
Thanks again @cwbeitel and @lgeiger -- good docs and logging goes a long way for understandability.
Bugs fixed: * t2t_decoder checkpoint fix in #1471 by @wanqizhu * xrange fix for py3 by in #1468 @lgeiger * Fixing COCO dataset in #1466 by @hbrylkowski * Fix math problems by @artitw * Decoding rev problems enzh by @googlehjx on #1389 * And honourable mentions to @qixiuai , #1440
Many many thanks @wanqizhu @lgeiger @hbrylkowski @artitw @googlehjx and @qixiuai for finding and fixing these and sorry for missing anyone else -- this is really really helpful.
Code Cleanups: * Registry refactor and optimizer registry by @jackd in #1410 and #1401 * Numerous very nice cleanup PRs ex: #1454 #1451 #1446 #1444 #1424 #1411 #1350 by @lgeiger
Many thanks for the cleanups @jackd and @lgeiger -- and sorry if I missed anyone else.
- Python
Published by afrozenator about 7 years ago
tensor2tensor - v1.13.1
Bug Fixes: * RL fixes for Model Based RL in #1505 - thanks @koz4k * Serving util corrections in #1495 by @Drunkar -- thanks! * Fix step size extraction in checkpoints by @lzhang10 in #1487 -- thanks!
- Python
Published by afrozenator about 7 years ago
tensor2tensor - v.1.12.0
Summary of changes:
PRs: * A lot of code cleanup thanks a ton to @lgeiger ! This goes a long way with regards to code maintainability and is much appreciated. Ex: PR #1361 , #1350 , #1344 , #1346 , #1345 , #1324 * Fixing LM decode, thanks @mikeymezher - PR #1282 * More fast decoding by @gcampax, thanks! - PR #999 * Avoid error on beam search - PR #1302 by @aeloyq , thanks! * Fix invalid list comprehension, unicode simplifications, py3 fixes #1343, #1318 , #1321, #1258 thanks @cclauss ! * Fix isgeneratepersplit hard to spot bug, thanks a lot to @kngxscn in PR #1322 * Fix py3 compatibility issues in PR #1300 by @ywkim , thanks a lot again! * Separate train and test data in MRPC and fix broken link in PR #1281 and #1247 by @ywkim - thanks for the hawk eyed change! * Fix universal transformer decoding by @artitw in PR #1257 * Fix babi generator by @artitw in PR #1235 * Fix transformer moe in #1233 by @twilightdema - thanks! * Universal Transformer bugs corrected in #1213 by @cfiken - thanks! * Change beam decoder stopping condition, makes decode faster in #965 by @mirkobronzi - many thanks! * Bug fix, problem0_steps variable by @senarvi in #1273 * Fixing a typo, by @hsm207 in PR #1329 , thanks a lot!
New Model and Problems: * New problem and model by @artitw in PR #1290 - thanks! * New model for scalar regression in PR #1332 thanks to @Kotober * Text CNN for classification in PR #1271 by @ybbaigo - thanks a lot! * en-ro translation by @lukaszkaiser ! * CoNLL2002 Named Entity Recognition problem added in #1253 by @ybbaigo - thanks!
New Metrics: * Pearson Correlation metrics in #1274 by @luffy06 - thanks a lot! * Custom evaluation metrics, this was one of the most asked features, thanks a lot @ywkim in PR #1336 * Word Error Rate metric by @stefan-falk in PR #1242 , many thanks! * SARI score for paraphrasing added.
Enhancements: * Fast decoding !! Huge thanks to @aeloyq in #1295 * Fast GELU unit * Relative dot product visualization PR #1303 thanks @aeloyq ! * New MTF models and enhacements, thanks to Noam, Niki and the MTF team * Custom eval hooks in PR #1284 by @theorm - thanks a lot !
RL: Lots of commits to Model Based Reinforcement Learning code by @konradczechowski @koz4k @blazejosinski @piotrmilos - thanks all !
- Python
Published by afrozenator over 7 years ago
tensor2tensor - v1.11.0
PRs: * Bug fixes in the insight server thanks to @haukurb ! * Fix weights initialization in #1196 by @mikeymezher - thanks ! * Fix Universal Transformer convergence by @MostafaDehghani and @rllin-fathom in #1194 and #1192 - thanks ! * Fix add problem hparams after parsing the overrides in #1053 thanks @gcampax ! * Fixing error of passing wrong dir in #1185 by @stefan-falk , thanks !
New Problems: * Wikipedia Multiproblems by @urvashik - thanks ! * New LM problems in de, fr, ro by @lukaszkaiser - thanks !
RL: * Continual addition to Model Based RL by @piotrmilos , @konradczechowski @koz4k and @blazejosinski !
Video Models: * Many continual updates thanks to @mbz and @MechCoder - thanks all !
- Python
Published by afrozenator over 7 years ago
tensor2tensor - v1.10.0
NOTE: - MTF code in Tensor2Tensor has been moved to github.com/tensorflow/mesh - thanks @dustinvtran
New Problems: - English-Setswana translation problem, thanks @jaderabbit
New layers, models, etc: - Add Bayesian feedforward layer, thanks @dustinvtran - Lots of changes to the RL pipeline, thanks @koz4k , @blazejosinski , @piotrmilos , @lukaszkaiser , @konradczechowski - Lots of work on video mdoels, thanks @mbz , @MechCoder - Image transformer with local1d and local 2d spatial partitioning, thanks @nikiparmar @vaswani
Usability: - Support DistributionStrategy in Tensor2Tensor for multi-GPU, thanks @smit-hinsu ! - Pass datadir to featureencoders, thanks @stefan-falk - variablescope wrapper for avgcheckpoints, thanks @Mehrad0711 - Modalities cleanup, thanks @dustinvtran - Avoid NaN while adding sinusoidal timing signals, thanks @peakji - Avoid a ascii codec error in CNN/DailyMail, thanks @shahzeb1 - Allow exporting T2T models as tfhub modules, thanks @cyfra
- Python
Published by afrozenator over 7 years ago
tensor2tensor - v1.9.0
PRs accepted: Cleaning up the code for gru/lstm as transition function for universal transformer. Thanks @MostafaDehghani ! Clipwrapper by @piotrmilos ! Corrected transformer spelling mistake - Thanks @jurasofish! Fix to universal transformer update weights - Thanks @cbockman and @cyvius96 ! Common Voice problem fixes and refactoring - Thanks @tlatkowski ! Infer observation datatype and shape from the environment - Thanks @koz4k !
New Problems / Models:
* Added a simple discrete autoencoder video model. Thanks @lukaszkaiser !
* DistributedText2TextProblem, a base class for Text2TextProblem for large-datasets. Thanks @afrozenator!
* Stanford Natural Language Inference problem added StanfordNLI in stanford_nli.py. Thanks @urvashik !
* Text2TextRemotedir added for problems with a persistent remote directory. Thanks @rsepassi !
* Add a separate binary for vocabulary file generation for subclasses of Text2TextProblem. Thanks @afrozenator!
* Added support for non-deterministic ATARI modes and sticky keys. Thanks @mbz !
* Pretraining schedule added to MultiProblem and reweighting losses. Thanks @urvashik !
* SummarizeWikiPretrainSeqToSeq32k and Text2textElmo added.
* AutoencoderResidualVAE added, thanks @lukaszkaiser !
* Discriminator changes by @lukaszkaiser and @aidangomez
* Allow scheduled sampling in basic video model, simplify default video modality. Thanks @lukaszkaiser !
Code Cleanups: * Use standard vocab naming and fixing translate data generation. Thanks @rsepassi ! * Replaced manual ops w/ dotproductattention in maskedlocalattention_1d. Thanks @dustinvtran ! * Eager tests! Thanks @dustinvtran ! * Separate out a video/ directory in models/. Thanks @lukaszkaiser ! * Speed up RL test - thanks @lukaszkaiser !
Bug Fixes: * Don't daisy-chain variables in Universal Transformer. Thanks @lukaszkaiser ! * Corrections to mixing, dropout and sampling in autoencoders. Thanks @lukaszkaiser ! * WSJ parsing only to use 1000 examples for building vocab. * Fixed scoring crash on empty targets. Thanks David Grangier! * Bug fix in transformer_vae.py
Enhancements to MTF, Video Models and much more!
- Python
Published by afrozenator almost 8 years ago
tensor2tensor - v1.8.0
Introducing MeshTensorFlow - this enables training really big models O(Billions) of parameters.
Models/Layers:
* Layers Added: NAC and NALU from https://arxiv.org/abs/1808.00508 Thanks @lukaszkaiser !
* Added a sparse graph neural net message passing layer to tensor2tensor.
* Targeted dropout added to ResNet. Thanks @aidangomez !
* Added VQA models in models/research/vqa_*
* Added Weight Normalization layer from https://arxiv.org/abs/1602.07868.
Datasets/Problems:
* MSCoCo paraphrase problem added by @tlatkowski - many thanks!
* VideoBairRobotPushingWithActions by @mbz !
Usability: * Code cleaup in autoencoder, works both on image and text. Thanks @lukaszkaiser * Set the default value of Text2TextProblem.maxsubtokenlength to 200, this prevents very long vocabulary generation times. Thanks @afrozenator * Add examples to distributedtraining.md, update support for async training, and simplify runstdserver codepath. Thanks @rsepassi ! * Store variable scopes in T2TModel; add T2TModel.initializefromckpt. Thanks @rsepassi ! * Undeprecate exporting the model from the trainer Thanks @gcampax ! * Doc fixes, thanks to @stefan-it :) * Added t2tprune: simple magnitude-based pruning script for T2T Thanks @aidangomez ! * Added task sampling support for more than two tasks. Thanks @urvashik !
Bug Fixes:
* Override servinginputfn for video problems.
* StackWrapper eliminates problem with repeating actions. Thanks @blazejosinski !
* Calculated lengths of sequences using raw in lstm.py
* Update universaltransformer_util.py to fix TypeError Thanks @zxqchat !
Testing: * Serving tests re-enabled on Travis using Docker. Thanks @rsepassi !
Many more fixes, tests and work on RL, Glow, SAVP, Video and other models and problems.
- Python
Published by afrozenator almost 8 years ago
tensor2tensor - v1.7.0
- Added a MultiProblem class for Multitask Learning. Thanks @urvashik !
- Added decoding option to pass through the features dictionary to predictions. Thanks @rsepassi !
- Enabled MLEngine path to use Cloud TPUs. Thanks @rsepassi !
- Added a simple One-Hot Symbol modality. Thanks @mbz !
Added Cleverhans integration. Thanks @aidangomez !
Problem definitions added for:
- Allen Brain Atlas problems. Thanks @cwbeitel !
- LSUN Bedrooms dataset.
- Added various NLP datasets. Thanks @urvashik !
- MSR Paraphrase Corpus,
- Quora Question Pairs,
- Stanford Sentiment Treebank,
- Question Answering NLI classification problems,
- Recognizing Textual Entailment,
- Corpus of Linguistic Acceptability,
- Winograd NLI.
- Added a data generator for WSJ parsing.
Model additions:
- Implemented Targeted Dropout for Posthoc Pruning. Thanks @aidangomez !
- Added self attention to VQA attention model.
- Added fast block parallel transformer model
- Implemented auxiliary losses from Stochastic Activation Pruning for Robust Adversarial Defense. Thanks @alexyku !
- Added probability based scheduled sampling for SV2P problem. Thanks @mbz !
- Reimplementated Autoencoder and Eval. Thanks @piotrmilos !
- Relative memory efficient unmasked self-attention.
Notable bug fixes:
- bug with data_gen in style transfer problem Thanks @tlatkowski !
- wmt_enfr dataset should not use vocabulary based on "small" dataset. Thanks @nshazeer !
Many more fixes, tests and work on Model based RL, Transfomer, Video and other models and problems.
- Python
Published by afrozenator almost 8 years ago
tensor2tensor - v1.6.6
- added Mozilla common voice as Problem and style transfer one others!
- improvements to ASR data preprocessing (thanks to jarfo)
- decoding works for Transformer on TPUs and for timeseries problems
- corrections and refactoring of the RL part
- Removed deprecated Experiment API code, and support SessionRunHooks on TPU.
- many other corrections and work on video problems, latent variables and other
Great thanks to everyone!
- Python
Published by lukaszkaiser almost 8 years ago
tensor2tensor - v1.6.5
registry.hparamsnow returns anHParamsobject instead of a function that returns anHParamsobject- New
MultistepAdamOptimizerthanks to @fstahlberg - New video models and problems and improvements to
VideoProblem - Added
pylintrcand lint tests to Travis CI - Various fixes, improvements, and additions
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.6.3
--random_seedis unset by default now. Set it to an integer value to get reproducible results.- bAbI text understanding tasks added
- Have the ML Engine and TPU codepaths use TF 1.8
- Various cloud-related bug fixes
WikisumWebdata generation fixes- Various other fixes
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.6.2
- Lambada and wikitext103 datasets.
- ASR model with Transformer and iPython notebook.
- Many other improvements including RL code, autoencoders, the latent transformer (transformer_vae) and more.
- Python
Published by lukaszkaiser about 8 years ago
tensor2tensor - v1.6.1
- Release scripts to generate the data for Generating Wikipedia by Summarizing Long Sequences
- New
RTransformermodel, a recurrent Transformer - New English-Estonian translation dataset thanks to @stefan-it
- New
ROC_AUCmetric thanks to @jjtan - Various fixes, improvements, additions, etc.
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.6.0
--problemscommand-line flag renamed to--problemhparams.problemsrenamed tohparams.problem_hparamsandhparams.problem_instancesrenamed tohparams.problem(and neither are lists now)- Dropped support for TensorFlow 1.4
- Various additions, fixes, etc.
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.5.7
- Distillation codepath added
- Improved support for serving language models
- New
TransformerScorermodel which return log prob of targets oninfer - Support for
bfloat16weights and activations on TPU - SRU gate added to
common_layers --checkpoint_pathsupported in interactive decoding- Improved support for multiple outputs
VideoProblembase class- Various fixes, additions, etc.
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.5.6
- Scalar summary support on TPUs
- New
SquadandSquadConcatproblem for question answering (and relevant base class) - New video problems
bfloat16support forTransformeron TPUs- New
SigmoidClassLabelModalityfor binary classification - Support batch prediction with Cloud ML Engine
- Various fixes, improvements, additions
- Python
Published by rsepassi about 8 years ago
tensor2tensor - v1.5.5
- Updates to experimental RL codebase
ImageTransformeron TPU- Various updates, fixes, additions, etc.
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.5.4
- Updates to the RL codebase
- Tests updated to use TensorFlow 1.6
- Various fixes, additions, etc.
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.5.3
- More flexible Cloud ML Engine usage thanks to @bbarnes52
- Fixes thanks to @stefan-it @wes-turner @deasuke @bwilbertz
- Various other additions, fixes, etc.
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.5.2
Note: The Text2TextProblem has been refactored so if you have subclassed it you may need to rename some methods. Some vocabulary files may need to be renamed as well.
Text2TextProblem,Text2ClassProblemandText2SelfProblembase classes make specifying new text-based problems easy. See text_problems.py.- New models and problems, including for image generation and speech-to-text
- Various bug fixes, feature additions, improvements, etc.
- Test model export and serving for Python 2.7 and TensorFlow 1.5
- Update Travis tests to test against TensorFlow version 1.4, 1.5, and 1.6
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.5.1
- TF 1.4 compatibility bug fix for Cloud ML Engine
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.5.0
- Launch training on Cloud TPUs
- Launch training and hyperparameter tuning on Cloud ML Engine
- New
models/researchsubdirectory for more experimental models - Some documentation updates
- Bug fixes
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.4.4
- Cloud ML Engine support added
- New experimental RL module thanks to @piotrmilos
- Various bug fixes, improvements, etc.
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.4.3
Note: Tensor2Tensor now requires TensorFlow 1.5.
- Working
t2t-bleuthanks to @martinpopel - Improvements to image models:
resnet,revnet, andshake_shake - Image problems refactor: faster input pipeline, richer ImageNet data preprocessing. Note that
ImageModality.bottomno longer normalizes images; that's now done in the input pipeline. - Improvements for running on Google's Cloud TPUs, coming to you soon...
- Various bug fixes, improvements, and additions
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.4.2
- New export method for exporting to TensorFlow Serving
- Script for BLEU evaluation thanks to @martinpopel
- Better TensorBoard metrics (what was removed has returned), with options to summarize gradients (
--hparams='summarize_grads=True') - Various bug fixes, doc updates, new features, as usual
Internals:
- Scripts in
bin/are now thin and executable - Main training utility library moved to
trainer_lib.py
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.4.1
- Support for multi-device evaluation
- Support for early stopping in distributed training
- Refactor Librispeech problem to use a new speech recognition base class
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.4.0
This release is a significant refactor of T2T internals.
T2TModelsubclasses now have the ability to override the entire Estimator model function with theestimator_model_fnmethod, making them much more flexible. Subclasses can also now overridebottom,body,top,loss, andoptimize.Problemsubclasses now have the ability to override the entire Estimator input function with theinput_fnmethod, making them much more flexible.- The key components of the trainer and decoder -
Experiment,Estimator,RunConfig,HParams- are all much more easily constructed and used by library callers throughtpu_trainer_lib.py. - We decided to drop support for MultiModel, i.e. training on multiple problems, because it added too much code complexity for the benefit gained. We will consider adding support back in a way that doesn't overcomplicate things too much if there's sufficient interest.
There are also the usual new models, feature improvements, bug fixes.
- New
image_fashion_mnistdataset - New
revnet104model, implementing a large Reversible Residual Network - Set
--decode_hparams=write_beam_scores=Trueto include beam scores when writing to a file - Beginnings of new interactive visualization server at insights/
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.3.2
- Small improvements for attention vizualization in colab.
- Python
Published by lukaszkaiser over 8 years ago
tensor2tensor - v1.3.1
- Improvements for TF Eager compatibility
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.3.0
- WARNING: Checkpoints produced with old versions break with this new release due to new variable scoping
- Various changes make T2T models and problems compatible with the new TF Eager mode - we'll have more on that soon
tpu_trainerbecoming more fully featured- Internal refactoring moving towards more flexibility in specifying the Estimator
input_fnandmodel_fn
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.2.8
- Batch norm should now work in T2T - fixed the custom variable getters
- Simplified
ImageModalityand removal ofSmallImageModality - Simplified
ClassLabelModalityand removal ofClassLabel1DModality - New modality with CTC loss
- New vanilla_gan model that's a good example of a simple GAN
- TPU advances: Xception, Resnet50, and Transformer verified to work, code path uses Experiment, usage doc for Cloud TPU alpha customers
- Various small fixes, improvements, features
- Python
Published by rsepassi over 8 years ago
tensor2tensor -
- Fixed data generators for translation tasks. Great thanks to @vince62s and @martinpopel for your PRs and reviews and all the help!
- Updated LSTM models and attention. Great thanks @kolloldas for the attention work and @epurdyf for pointing out initializer problems!
- Added some variations of the transformer model.
- Bug-fixes and cleanups.
- Python
Published by lukaszkaiser over 8 years ago
tensor2tensor - v1.2.6
- Refactored Translate problems courtesy of @vince62s
- Fast beam search decoding for the Transformer model (set
--decode_hparams='use_last_position=True'with thet2t-decoderto use it) - Various improvements and bug fixes
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.2.5
- Various bug fixes, improvements, and additions
- Checkpoint Breaking Note: We'd like to have good defaults as well as have immutable hparams sets and so we have an updating naming scheme we're trying, starting with Transformer hparams.
transformer_baseis now an alias name that points to a versioned hparams set, i.e.transformer_basenow callstransformer_base_v2. The previous version oftransformer_baseis nowtransformer_base_v1(so if you have an old checkpoint, use hparams settransfomer_base_v1). This way if you're just trying something out, you can usetransformer_baseand know that you have a set of up-to-date good defaults. If you want to maintain reproducibility across T2T versions, use one of the versioned names, e.g.transformer_base_v2.
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.2.4
- Various cleanups, fixes, and feature additions - see commit history
- More robust Travis CI tests
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.2.3
- Transformer now supports fast decoding! The decoding path used to recompute the entire sequence on each additional timestep, but now caches as it goes.
- We now support
SavedModelexports - New more thorough documentation
- Travis builds for all PRs and commits for Python 2 and 3!
- The decoding flags for
t2t_decoderhave all been merged into a singleHParamsobject that can be modified with the flag--decode_hparams - Various feature additions, bug fixes, and improvements
- Note: Parameter checkpoints for the Transformer model may be broken because of a bug with variable sharing in
layer_norm
- Python
Published by rsepassi over 8 years ago
tensor2tensor - v1.2.2
- Easy modular dataset construction with
Problem.dataset - New
ClassLabelEncoderfor classification problems - Decoding now only loads the model once for all decodes (no more reloads across batches)
- Various other fixes, improvements, including a move to
tf.estimator.Estimator
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.2.1
- New
t2t-decoderbinary for decoding.t2t-trainerno longer does decoding - Various fixes and improvements to code and documentation
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.2.0
- We've updated our TensorFlow requirement to version 1.3.0, which allowed us to move our input pipeline to the new
tf.contrib.data.DatasetAPI - Attention visualizations available by iPython notebook
- Multiple new layers and utilities, bug fixes, etc.
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.9
- Many problems have been renamed to make the hierarchy more clear; you will need to rename input files
- Various model and layer improvements and additions
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.8
- Simplified mixture of experts code - breaks checkpoints that use them
- New models and attention functions
- Other cleanups and corrections
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.7
- Improved residual block structure; note that this breaks old checkpoints (sorry!)
- Multiple improvements in utilities (
T2TModel, evaluation, new modalities, new metrics)
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.5
Large code moves:
* common_*.py and modalities.py have moved from models/ to the new folder layers/. models/ folder is now just for T2TModel definitions.
* trainer_utils.py has been broken up into several files (devices.py, decoding.py, input_fn_builder.py, model_builder.py.
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.1
- One-line datagen/train/decode!
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.1.0
- Pluggable
Problems for both data generation and training. - Multiple new problems and models added, including a Chinese-English translation task
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.0.14
- Datasets/problems can now be registered so no need to edit the T2T codebase to add one
- Distributed training has been fixed
- New Chinese-English dataset
wmt_zhen_tokens_32k - New optimizer available:
YellowFinOptimizer
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.0.13
- Corrected
TF_CONFIGfor distributed training make_tf_configs.pyis now an installed scriptt2t-make-tf-configs
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - 1.0.11
Correcting text encoder problems and other merges.
- Python
Published by lukaszkaiser almost 9 years ago
tensor2tensor - v1.0.7
- Improved Python 3 compatibility in data generation
- New BlueModel with shake-shake
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.0.6
- Small improvements to data generation (download progress bar, atomic download to destination file to avoid partially downloaded data corruption)
- Small bug fixes and doc updates
- Python
Published by rsepassi almost 9 years ago
tensor2tensor - v1.0.5
- Enable users to register models and hparams without editing the code
- Parameter attention
- We have a gitter room!
- Python
Published by rsepassi almost 9 years ago