What's Changed

Features

fix imports by @jonas-eschle in https://github.com/zfit/zfit/pull/653

Full Changelog: https://github.com/zfit/zfit/compare/0.27.0...0.27.1

- Python
Published by jonas-eschle 11 months ago

zfit - Major speedups and fixes for binned PDF

0.2.7 (10 Jul 2025)

Major Features and Improvements

significantly improved performance, especially when using a GPU for certain fits.

Breaking changes

default padding of KDE switched to 0.1 instead of False. This should only improve KDEs at the boundary but technically changes the behavior

Bug fixes and small changes

Fix KDE1DimExact and ExponentialTFP incorrectly handling label parameter: both classes now properly store and return the label attribute separately from name
Fix AttributeError when calling freeze() method twice on FitResult
Add PositivePDF functor and to_positive() method to BasePDF for ensuring PDF output values are always positive with a minimum epsilon (default is 1e-100). This also handles NaN values by replacing them with epsilon.
Add clipping functionality to parameter setting methods: set_value and set_values now accept a clip parameter that clips values to parameter limits instead of raising errors when values are outside bounds
Remove conditional numeric checks in favor of unconditional assertions: some internal checks that were only performed when run.numeric_checks=True are now always performed. To re-enable the old conditional behavior, set zfit.run.numeric_checks = True for debugging numerical issues
Clean up code by removing commented debug code, unused variables, and duplicate imports
Simplify assertion handling in numerical integration to properly filter out None operations
Add FitResult.x attribute to access the best fit values of the parameters directly.
Add OptimizeResultMixin to FitResult providing full scipy.optimize.OptimizeResult compatibility with attributes like success, fun, x, jac, hess, hess_inv, nfev, njev, nhev, nit, and maxcv
Fix critical bugs in binned PDF extended normalization methods:
- _auto_ext_pdf incorrectly used ext_normalization() instead of normalization(), causing extended PDF values to be multiplied by yield squared instead of yield
- _auto_ext_log_pdf incorrectly used znp.log(ext_normalization(norm)) instead of log_normalization(norm)
add to_cached to unbinned PDFs, returning a cached PDF. These do not support analytic gradients, an error will be raised if attempted.

- Python
Published by jonas-eschle 11 months ago

Major Features and Improvements

asymptotic uncertainties for weighted fits have been optimized and fixed, working for yields and constraints correctly

Breaking changes

renamed effsize weight corretion to actual sumw2

Bug fixes and small changes

Allow BinnedSamplerData to be instantiated from a histogram and fix variance handling if not given.
Enhance the precision of binned loss functions
make adaptive bandwidth default in KDE

- Python
Published by jonas-eschle about 1 year ago

Major Features and Improvements

New weighted corrections for Hesse uncertainty calculation. The corrections are now selectable by specifying weighcorr=..., allowing for (currently) three values: "asymptotic" (current, and default), False (no correction), and "effsize" (new correction). effsize scales the uncertainties by the "effective size" of the dataset, a significantly faster, yet not asymptotically correct method. Useful for a quick estimate of the uncertainties.

Bug fixes and small changes

allow the usage of ZfitData objects (binned and unbinned version respectively) in update_data method of sampler objects zfit.data.SamplerData and zfit.data.BinnedSamplerData (for example, the objects that are returned by create_sampler)
add an attribute num_entries to the Data object that returns the number of entries in the data and an attribute samplesize reflecting the sum of the weights, deprecating the ambiguous nevents attribute. The former can be used to define the array shape, the latter matters in statistical tests.
The EDM calculation was optimized; additionally,it would error if the matrix was singular. The behavior is now changed to return 999'999 in this case. Could be reconsidered in the future to provide a "best fit" instead.
A simple bug in the LevenbergMarquardt minimizer was fixed that would error due to a wrong return shape of an internal result.

- Python
Published by jonas-eschle over 1 year ago

zfit - Relax TF and TFP version

Relax TF requirement to >= 2.16 (instead of 2.18) and equivalent TensorFlow-Probability to >= 0.24 (instead of 0.25)

- Python
Published by jonas-eschle over 1 year ago

zfit - Minor fix in minuit

Fixes the case if no covariance is available from minuit

- Python
Published by jonas-eschle over 1 year ago

zfit - Upgrade to new TF version

0.24.0 (10 Nov. 2024)

Upgrade to new TF version

Requirement changes

TensorFlow ~2.18
TensorFlow Probability ~0.25

- Python
Published by jonas-eschle over 1 year ago

zfit - Minor fixes

remain order of some internal parameters to increase full reproducibility
minuit minimizer could have no covariance albeit being advertised, lead to error in some cases

- Python
Published by jonas-eschle over 1 year ago

zfit - New minimizers and a better arbitrary function loss

Major Features and Improvements

Minimizers can use the new SimpleLoss.from_any method that allows other libraries to hook into the minimization. For example, using zfit-physics, minimizers can directly minimize RooFit RooNllVar (as created by createNLL described here <https://root.cern.ch/doc/master/classRooAbsPdf.html#a24b1afec4fd149e08967eac4285800de>_
Added the well performing LevenbergMarquardt minimizer, a new implementation of the Levenberg-Marquardt algorithm.
New BFGS minimizer implementation of Scipy, ScipyBFGS.
Reactivate a few minimizers: ScipyDogleg, ScipyNCG, ScipyCOBYLA and ScipyNewtonCG

Breaking changes

removed multiple, old deprecated methods and arguments

Deprecations

use stepsize instead of step_size in the zfit.Parameter constructor

Bug fixes and small changes

add possibility to not jit by using force_eager in tf.function or raise a z.DoNotCompile error
SimpleLoss can now be added together with another SimpleLoss
get_params supports an autograd argument to filter parameters that do not support automatic differentiation. An object with parameters can advertise, which parameters are differentiable (with autograd_params); by default, all parameters are assumed to be differentiable, the same effect as True. If autograd is performed on parameters that do not support it, an error is raised.
Use kanah sum for larger likelihoods by default to improve numerical stability
Using the same zfit.Parameter for multiple arguments (i.e. to specify a common width in a PDF with a different width for left and right) could cause a crash due to some internal caching. This is now fixed.
Minimizers have now been renamed without the trailing V1. The old names are still available but will be removed in the future.

- Python
Published by jonas-eschle over 1 year ago

0.22.0 22 Aug 2024

Bug fixes and small changes

change the truncated PDF with a yield to reflect a dynamic change in shape

Requirement changes

Upgrade from Pydantic V1 to V2

- Python
Published by jonas-eschle almost 2 years ago

Bug fixes and small changes

full argument for binned NLLs was not working properly and return a partially optimized loss value.
jit compile all methods of the loss (gradient, hessian) to avoid recompilation every time. This can possibly speed up different minimizers significantly.

- Python
Published by jonas-eschle almost 2 years ago

zfit - Minor fixes and the JohnsonSU

0.21.0 (2 Jul 2024)

Major Features and Improvements

add JohnsonSU PDF, the Johnson SU distribution.

Bug fixes and small changes

increase reliability of zfit.dill.dump and zfit.dill.dumps with an additional verify argument that reloads the dumped object to verify it was correctly dumped and retries if it wasn't.
fix missing imported namespaces
fixed a memory leak when creating multiple parameters
add data loaders to zfit.data namespace

Thanks

Davide Lancierini for finding and helping to debug the dill dumping issue
James Herd for finding and reproducing the memory leak

- Python
Published by jonas-eschle almost 2 years ago

Bug fixes and small changes

enhanced loss: simple loss can take a gradient and hesse function and the default base loss provides fallbacks that work correctly between value_gradient and gradient. This maybe matters if you've implemented a custom loss and should fix any issues with it.
multiprocessing would get stuck due to an upstream bug in TensorFlow <https://github.com/tensorflow/tensorflow/issues/66115>_. Working around it by disabling an unused piece of code.

Thanks

acampoverde for finding the bug in the multiprocessing

- Python
Published by jonas-eschle about 2 years ago

0.20.2 (16 Apr 2024)

Two small bugfixes - fix backwards incompatible change of sampler - detect if a RegularBinning has been transformed, raise error.

- Python
Published by jonas-eschle about 2 years ago

zfit - Minor fixes for stability

For a guide to the 0.20 changes, see here

0.20.1 (14 Apr 2024)

Major Features and Improvements

fix dumping and add convenience wrapper zfit.dill to dump and load objects with dill (a more powerful pickle). This way, any zfit object can be saved and loaded, such as FitResult that contains all other important objects to recreate the fit.
improved performance for numerical gradient calculation, fixing also a minor numerical issue.

Bug fixes and small changes

runing binned fits without a graph could deadlock, fixed.

- Python
Published by jonas-eschle about 2 years ago

zfit - Usability overhaul and a ton of new PDFs

0.20.0 (12 Apr 2024)

Complete overhaul of zfit with a focus on usability and a variety of new pdfs!

Major Features and Improvements

Parameter behavior has changed, multiple parameters with the same name can now coexist! The NameAlreadyTakenError has been successfully removed (yay!). The new behavior only enforces that names and matching parameters within a function/PDF/loss are unique, as otherwise inconsistent expectations appear (for the full discussion on this, see here <https://github.com/zfit/zfit/discussions/342>_).
Space and limits have a complete overhaul in front of them, in short, these overcomplicated objects get simplified and the limits become more usable, in terms of dimensions. The full discussion and changes can be found here <https://github.com/zfit/zfit/discussions/533>_ .
add an unbinned Sampler to the public namespace under zfit.data.Sampler: this object is returned in the create_sampler method and allows to resample from a function without recreating the compiled function, i.e. loss. It has an additional method update_data to update the data without recompiling the loss and can be created from a sample only. Useful to have a custom dataset in toys.
allow to use pandas DataFrame as input where zfit Data objects are expected
Methods of PDFs and loss functions that depend on parameters take now the value of a parameter explicitly as arguments, as a mapping of str (parameter name) to value.
Python 3.12 support
add GeneralizedCB PDF which is similar to the DoubleCB PDF but with different standard deviations for the left and right side.
Added functor for PDF caching CachedPDF: pdf, integrate PDF methods can be cacheable now
add faddeeva_humlicek function under the zfit.z.numpy namespace. This is an implementation of the Faddeeva function, combining Humlicek's rational approximations according to Humlicek (JQSRT, 1979) and Humlicek (JQSRT, 1982).
add Voigt profile PDF which is a convolution of a Gaussian and a Cauchy distribution.
add TruncatedPDF that allows to truncate in one or multiple ranges (replaces "MultipleLimits" and "MultiSpace")
add LogNormal PDF, a log-normal distribution, which is a normal distribution of the logarithm of the variable.
add ChiSquared PDF, the standard chi2 distribution, taken from tensorflow-probability implementation <https://www.tensorflow.org/probability/api_docs/python/tfp/distributions/Chi2>_.
add StudentT PDF, the standard Student's t distribution, taken from tensorflow-probability implementation <https://www.tensorflow.org/probability/api_docs/python/tfp/distributions/StudentT>_.
add GaussExpTail and GeneralizedGaussExpTail PDFs, which are a Gaussian with an exponential tail on one side and a Gaussian with different sigmas on each side and different exponential tails on each side respectively.
add QGauss PDF, a distribution that arises from the maximization of the Tsallis entropy under appropriate constraints, see here <https://en.wikipedia.org/wiki/Q-Gaussian_distribution>_.
add `BifurGauss PDF, a Gaussian distribution with different sigmas on each side of the mean.
add Bernstein PDF, which is a PDF defined by a linear combination of Bernstein polynomials given their coefficients.
add Gamma PDF, the Gamma distribution.
Data has now a with_weights method that returns a new data object with different weights and an improved with_obs that allows to set obs with new limits. These replace the set_weights and set_data_range methods for a more functional approach.
add label to different objects (PDF, Data, etc.) that allows to give a human-readable name to the object. This is used in the plotting and can be used to identify objects. Notably, Parameters have a label that can be arbitrary. Space has one label for each observable if the space is a product of spaces. Space.label is a string and only possible for one-dimensional spaces, while Space.labels is a list of strings and can be used for any, one- or multi-dimensional spaces.
add zfit.data.concat(...) to concatenate multiple data objects into one along the index or along the observables. Similar to pd.concat.
PDFs now have a to_truncated method that allows to create a truncated version of the PDF, possibly with different and multiple limits. This allows to easily create a PDF with disjoint limits.
Data and PDF that take obs in the initialization can now also take binned observables, i.e. a zfit.Space with binning=... and will return a binned version of the object (zfit.data.BinnedData or zfit.pdf.BinnedFromUnbinned, where the latter is a generic wrapper). This is equivalent of calling to_binned on the objects)
zfit.Data can be instantiated directly with most data types, such as numpy arrays, pandas DataFrames etc insead of using the dedicated constructors from_numpy, from_pandas etc. The constructors may still provide additional functionality, but overall, the switch should be seamless.

Breaking changes

This release contains multiple "breaking changes", however, the vast majority if not all apply only for edge cases and undocummented functions.

a few arguments are now keyword-only arguments. This can break existing code if the arguments were given as positional arguments. Just use the appropriate keyword arguments instead. (Example: instead of using zfit.Space(obs, limits) use zfit.Space(obs, limits=limits)). This was introduced to make the API more robust and to avoid errors due to the order of arguments, with a few new ways of creating objects.
Data.from_root: deprecated arguments branches and branch_aliases have been removed. Use obs and obs_aliases instead.
NameAlreadyTakenError was removed, see above for the new behavior. This should not have an effect on any existing code except if you relied on the error being thrown.
Data objects had an intrinsic, TensorFlow V1 legacy behavior: they were actually cut when the data was retrieved. This is now changed and the data is cut when it is created. This should not have any impact on existing code and just improve runtime and memory usage.
Partial integration used to use some broadcasting tricks that could potentially fail. It uses now a dynamic while loop that could be slower but works for arbitrary PDFs. This should not have any impact on existing code and just improve stability (but technically, the data given to the PDF if doing partial integration is now "different", in the sense that it's now not different anymore from any other call)
if a tf.Variable was used to store the number of sampled values in a sampler, it was possible to change the value of that variable to change the number of samples drawn. This is now not possible anymore and the number of samples should be given as an argument n to the resample method, as was possible since a long time.
create_sampler has a breaking change for fixed_params: when the argument was set to False, any change in the parameters would be reflected when resampling. This highly statebased behavior was confusing and is now removed. The argument is now called params and behaves as expected: the sampler will remember the parameters at the time of creation, possibly updated with params and will not change anymore. To sample from a different set of parameters, the params have to be passed to the resample method explicitly.
the default names for hesse and errors have now been changed to hesse and errors, respectively. This was deprecated since a while and both names were available for backwards compatibility. The old names are now removed. If you get an error, minuit_hessse or minuit_minos not found, just replace it with hesse and errors.

Deprecations

result.fminfull is deprecated and will be removed in the future. Use result.fmin instead.
Data.set_data_range is deprecated and will be removed in the future. Use with_range instead.
Space has many deprecated methods, such as rect_limits and quite a few more. The full discussion can be found here <https://github.com/zfit/zfit/discussions/533>_.
fixed_params in create_sampler is deprecated and will be removed in the future. Use params instead.
fixed_params attribute of the Sampler is deprecated and will be removed in the future. Use params instead.
uncertainties in GaussianConstraint is deprecated and will be removed in the future. Use either explicitly sigma or cov.
the ComposedParameter and ComplexParameter argument value_fn is deprecated in favor of the new argument func. Identical behavior.
zfit.run(...) is deprecated and will be removed in the future. Simply remove it should work in most cases. (if an explicity numpy, not just array-like, cast is needed, use np.asarray(...). But usually this is not needed). This function is an old relic from the past TensorFlow 1.x, tf.Session times and is not needed anymore. We all remember well these days :)

Bug fixes and small changes

complete overhaul of partial integration that used some broadcasting tricks that could potentially fail. It uses now a dynamic while loop that could be slower but works for arbitrary PDFs and no problems should be encountered anymore.
FitResult can now be used as a context manager, which will automatically set the values of the parameters to the best fit values and reset them to the original values after the context is left. A new method update_params allows to update the parameters with the best fit values explicitly.
result.fmin now returns the full likelihood, while result.fminopt returns the optimized likelihood with potential constant subtraction. The latter is mostly used by the minimizer and other libraries. This behavior is consistent with the behavior of other methods in the loss that return by default the full, unoptimized value.
serialization only allowed for one specific limit (space) of each obs. Multiple, independent limits can now be serialized.
Increased numerical stability: this was compromised due to some involuntary float32 conversions in TF. This has been fixed.
arguments sigma and cov are now used in GaussianConstraint, both mutually exclusive, to ensure the intent is clear.
improved hashing and precompilation in loss, works now safely also with samplers.
seed setting is by default completely randomized. This is a change from the previous behavior where the seed was set to a more deterministic value. Use seeds only for reproducibility and not for real randomness, as some strange correlations between seeds have been observed. To guarantee full randomness, just call zfit.run.set_seed() without arguments.
zfit.run.set_seed now returns the seed that was set. This is useful for reproducibility.

Experimental

a simple plot mechanism has been added with pdf.plot.plotpdf to plot PDFs. This is simple and fully interacts with matplotlib, allowing to plot quickly in a more interactive way.
zfit.run.experimental_disable_param_update: this is an experimental feature that allows to disable the parameter update in a fit as is currently done whenever minimize is called. In conjunction with the new method update_params(), this can be used as result = minimizer.minimize(...).update_params() to keep the same behavior as currently. Also, the context manager of FitResult can be used to achieve the same behavior in a context manager (with minimizer.minimize(...) as result: ...) also works.

Requirement changes

upgrade to TensorFlow 2.16 and TensorFlow Probability 0.24

Thanks

huge thanks to @iasonkrommydas for the addition of various PDFs and to welcome him on board as a new contributor!
@anjabeck for the addition of the ChiSquared PDF

- Python
Published by jonas-eschle about 2 years ago

zfit - Add missing dep

Hotfix release to add missing deps attrs

- Python
Published by jonas-eschle over 2 years ago

zfit - Bug fixes in randomness and improved caching

0.18.1 (22 Feb 2024)

Major Features and Improvements

reduced the number of graph caching reset, resulting in significant speedups in some cases

Bug fixes and small changes

use random generated seeds for numpy and TF, as they can otherwise have unwanted correlations

Thanks

@anjabeck for the bug report and the idea to use random seeds for numpy and TF @acampoverde for reporting the caching issue

- Python
Published by jonas-eschle over 2 years ago

zfit - TF 2.15 upgrade, drop Python 3.8

Uprade to TensorFlow 2.15 (from 2.12/2.13 previously) and TensorFlow-Probability 0.23 Drop Python 3.8 support

- Python
Published by jonas-eschle over 2 years ago

zfit - Fixes in uncertainty corrections

0.17.0

Major Features and Improvements

add correct uncertainty for unbinned, weighted fits with constraints and/or that are extended.
allow mapping in zfit.param.set_values for values

Bug fixes and small changes

fix issues where EDM goes negative, set to 999
improved stability of the loss evaluation
improved uncertainty calculation accuracy with zfit_error

Thanks

Daniel Craik for the idea of allowing a mapping in set_values

- Python
Published by jonas-eschle over 2 years ago

zfit - 0.16.0 Minor fixes and improvements

0.16.0 (26 July 2023)

Major Features and Improvements

add full option to loss call of value, which returns the unoptimized value allowing for easier statistical tests using the loss. This is the default behavior and should not break any backwards compatibility, as the "not full loss" was arbitrary.
changed the FitResult to print both loss values, the unoptimized (full) and optimized (internal)

Bug fixes and small changes

bandwidth preprocessing was ignored in KDE
unstack_x with an obs as argument did return the wrong shape

Thanks

@schmitse for reporting the bug in the KDE bandwidth preprocessing @lorenzopaolucci for bringing up the absolute value of the loss in the fitresult as an issue

- Python
Published by jonas-eschle over 2 years ago

zfit - Minor fixes for binned fits

0.15.5

fix a bug in histmodifier that would not properly take into account the yield of the wrapped PDF

- Python
Published by jonas-eschle almost 3 years ago

0.15.2 (20 July 2023)

Fix missing attrs dependency

Major Features and Improvements

add option full in loss to return the full, unoptimized value (currently not default), allowing for easier statistical tests using the loss

- Python
Published by jonas-eschle almost 3 years ago

Requirement changes

TensorFlow upgraded to ~=2.13.0
as TF 2.13.0 ships with the arm64 macos wheels, the requirement of tensorflow_macos is removed

Thanks

Iason Krommydas for helping with the macos requirements for TF

- Python
Published by jonas-eschle almost 3 years ago

zfit - Pin pydantic dependency to < V2

Major Features and Improvements

zfit broke for pydantic 2, which upgraded.

Requirement changes

restrict pydantic to <2.0.0

- Python
Published by jonas-eschle almost 3 years ago

What's Changed

Python 3.11 and TF 2.12 by @jonas-eschle in https://github.com/zfit/zfit/pull/462
fix param caching by @jonas-eschle in https://github.com/zfit/zfit/pull/472

- Python
Published by jonas-eschle almost 3 years ago

zfit - Minor fixes in CB and caching

0.13.2 (15. June 2023)

Bug fixes and small changes

fix a caching problem with parameters (could cause issues with larger PDFs as params would be "remembered" wrongly)
more helpful error message when jacobian (as used for weighted corrections) is analytically asked but fails
make analytical gradient for CB integral work

- Python
Published by jonas-eschle about 3 years ago

0.13.1 (20 Apr 2023)

Bug fixes and small changes

array bandwidth for KDE works now correctly

Requirement changes

fixed uproot for Python 3.7 to <5

Thanks

@schmitse for reporting and solving the bug in the KDE bandwidth with arrays

- Python
Published by jonas-eschle about 3 years ago

Version 0.13.0

Major Features and Improvements

last Python 3.7 version

Bug fixes and small changes

SampleData is not used anymore, a Data object is returned (for simple sampling). The create_sampler will still return a SamplerData object though as this differs from Data.

Experimental

Added support on a best-effort for human-readable serialization of objects including an HS3-like representation, find a tutorial on serialization here . Most built-in unbinned PDFs are supported. This is still experimental and not yet fully supported. Dumping can be performed safely, loading maybe easily breaks (also between versions), so do not rely on it yet. Everything else - apart of trying to dump - should only be used for playing around and giving feedback purposes.

Requirement changes

allow uproot 5 (remove previous restriction)

Thanks

to Johannes Lade for the amazing work on the serialization, which made this HS3 implementation possible!

- Python
Published by jonas-eschle about 3 years ago

zfit - Reproducibility fix

Fix some issues with setting seed globally and locally, fits should now be reproducible.

For all changes, see the changelog

Thanks to @schmitse for discovering the bug

- Python
Published by jonas-eschle about 3 years ago

zfit - Binned and sampling fixes

Many smaller fixes that are crucial, most notably to avoid a bias in sampling.

Bug fixes and small changes

create_extended added None to the name, removed.
SimpleConstraint now also takes a function that has an explicit params argument.
add name argument to create_extended.
adding binned losses would error due to the removed fit_range argument.
setting a global seed made the sampler return constant values, fixed (unoptimized but correct). If you ran a fit with a global seed, you might want to rerun it.
histogramming and limit checks failed due to a stricter Numpy check, fixed.

Thanks

@P-H-Wagner for finding the bug in SimpleConstraint.
Dan Johnson for finding the bug in the binned loss that would fail to sum them up.
Hanae Tilquin for spotting the bug with TensorFlows changed behavior or random states inside a tf.function, leading to biased samples whenever a global seed was set.

- Python
Published by jonas-eschle over 3 years ago

zfit - Reduced memory consumption and convenience functions

Major Features and Improvements

reduce the memory footprint on (some) fits, especially repetitive (loops) ones. Reduces the number of cached compiled functions. The cachesize can be set with zfit.run.set_cache_size(int) and specifies the number of compiled functions that are kept in memory. The default is 10, but this can be tuned. Lower values can reduce memory usage, but potentially increase runtime.

Bug fixes and small changes

Enable uniform binning for n-dimensional distributions with integer(s).
Sum of histograms failed for calling the pdf method (can be indirectly), integrated over wrong axis.
Binned PDFs expected binned spaces for limits, now unbinned limits are also allowed and automatically converted to binned limits using the PDFs binning.
Speedup sampling of binned distributions.
add to_binned and to_unbinned methods to PDF

Thanks

Justin Skorupa for finding the bug in the sum of histograms and the missing automatic conversion of unbinned spaces to binned spaces.

- Python
Published by jonas-eschle almost 4 years ago

zfit - Binned and mixed fits

Public release of binned fits and upgrade to Python 3.10 and TensorFlow 2.9.

While binned fits are now supported, they still lack some stability, public testing and features. Feedback is, as usual, very welcome!

Major Features and Improvements

improved data handling in constructors from_pandas (which allows now to have weights as columns, dataframes that are a superset of the obs) and from_root (obs can now be spaces and therefore cuts can be direcly applied)
add hashing of unbinned datasets with a hashint attribute. None if no hash was possible.

Bug fixes and small changes

SimpleLoss correctly supports both functions with implicit and explicit parameters, also if they are decorated.
extended sampling errored for some cases of binned PDFs.
ConstantParameter errored when converted to numpy.
Simultaneous binned fits could error with different binning due to a missing sum over a dimension.
improved stability in loss evaluation of constraints and poisson/chi2 loss.
reduce gradient evaluation time in errors for many parameters.
Speedup Parameter value assignement in fits, which is most notably when the parameter update time is comparably large to the fit evaluation time, such as is the case for binned fits with many nuisance parameters.
fix ipyopt was not pickleable in a fitresult
treat parameters sometimes as "stateless", possibly reducing the number of retraces and reducing the memory footprint.

Requirement changes

nlopt and ipyopt are now optional dependencies.
Python 3.10 added
TensorFlow ~= 2.9.0 is now required and the corresponding TensorFlow-Probability version ~= 0.17.0

Thanks

@YaniBion for discovering the bug in the extended sampling and testing the alpha release
@ResStump for reporting the bug with the simultaneous binned fit

- Python
Published by jonas-eschle almost 4 years ago

Major Features and Improvements

Save results by pickling, unpickling a frozen (FitResult.freeze()) result and using zfit.param.set_values(params, result) to set the values of params.

Deprecations

the default name of the uncertainty methods hesse and errors depended on the method used (such as 'minuithesse', 'zfiterrors' etc.) and would be the exact method name. New names are now 'hesse' and 'errors', independent of the method used. This reflects better that the methods, while internally different, produce the same result. To update, use 'hesse' instead of 'minuithesse' or 'hessenp' and 'errors' instead of 'zfiterrors' or 'minuitminos' in order to access the uncertainties in the fitresult. Currently, the old names are still available for backwards compatibility. If a name was explicitly chosen in the error method, nothing changed.

Bug fixes and small changes

KDE datasets are now correctly mirrored around observable space limits
multinomial sampling would return wrong results when invoked multiple times in graph mode due to a non-dynamic shape. This is fixed and the sampling is now working as expected.
increase precision in FitResult string representation and add that the value is rounded

Thanks

schmitse for finding and fixing a mirroring bug in the KDEs
Sebastian Bysiak for finding a bug in the multinomial sampling

- Python
Published by jonas-eschle over 4 years ago

0.8.2 (20 Sep 2021)

Bug fixes and small changes

fixed a longstanding bug in the DoubleCB implementation of the integral.
remove outdated deprecations

- Python
Published by jonas-eschle over 4 years ago

zfit - KDEs for large data and numerical integration

Kernel Density Estimation in 1 dimension for large data sets.

Overview on KDEs

Introduction tutorial notebook

Major Features and Improvements

allow FitResult to freeze(), making it pickleable. The parameters are replaced by their name, the objects such as loss and minimizer as well.
improve the numerical integration by adding a one dimensional efficient integrator, testing for the accuracy of multidimensional integrals. If there is a sharp peak, this maybe fails to integrate and the number of points has to be manually raised
add highly performant kernel density estimation (mainly contributed by Marc Steiner) in 1 dimension which allow for the choice of arbitrary kernels, support boundary mirroring of the data and allow for large (millions) of data samples:
- :class:~zfit.pdf.KDE1DimExact for the normal density estimation
- :class:~zfit.pdf.KDE1DimGrid using a binning
- :class:~zfit.pdf.KDE1DimFFT using a binning and FFT
- :class:~zfit.pdf.KDE1DimISJ using a binning and an algorithm (ISJ) to solve the optimal bandwidth

For an introduction, see either :ref:sec-kernel-density-estimation or the tutorial :ref:sec-components-model

add windows in CI

Breaking changes

the numerical integration improved with more sensible values for tolerance. This means however that some fits will greatly increase the runtime. To restore the old behavior globally, do for each instance pdf.update_integration_options(draws_per_dim=40_000, max_draws=40_000, tol=1) This will integrate regardless of the chosen precision and it may be non-optimal. However, the precision estimate in the integrator is also not perfect and maybe overestimates the error, so that the integration by default takes longer than necessary. Feel free to play around with the parameters and report back.

Bug fixes and small changes

Double crystallball: move a minus sign down, vectorize the integral, fix wrong output shape of pdf
add a minimal value in the loss to avoid NaNs when taking the log of 0
improve feedback when taking the derivative with respect to a parameter that a function does not depend on or if the function is purely Python.
make parameters deletable, especially it works now to create parameters in a function only and no NameAlreadyTakenError will be thrown.

Requirement changes

add TensorFlow 2.6 support (now 2.5 and 2.6 are supported)

Thanks

Marc Steiner for contributing many new KDE methods!

- Python
Published by jonas-eschle almost 5 years ago

zfit - Python 3.9 and TF 2.5 support

Major Features and Improvements

add Python 3.9 support
upgrade to TensorFlow 2.5