Recent Releases of dbscan

dbscan - dbscan 1.2.2

dbscan 1.2.2 (2025-01-24)

Changes

  • Removed dependence on the /bits/stdc++.h header.

dbscan 1.2.1 (2025-01-23)

Changes

  • Various refactoring by m-muecke

New Features

  • HDBSCAN gained parameter clusterselectionepsilon to implement clusters selected from Malzer and Baum (2020).
  • Functions ncluster() and nnoise() were added.
  • hullplot now() marks noise as x.
  • Added clplot().
  • pointdensity now also accepts a dist object as input and has the new type "gaussian" to calculate a Gaussian kernel estimate.
  • Added the DBCV index.

Bugfixes

  • extractFOCS: Fixed total_score.
  • Rewrote minimal spanning tree code.

- C++
Published by mhahsler over 1 year ago

dbscan -

New Features

  • dbscan has now tidymodels tidiers (glance, tidy, augment).
  • kNNdistplot can now plot a range of k/minPts values.
  • added stats::nobs methods for the clusterings.
  • kNN and frNN now contains the used distance metric.

Changes

  • dbscan component dist was renamed to metric.
  • Removed redundant sort in kNNdistplot (reported by Natasza Szczypien).
  • Refactoring use more performant anyNA(x) instead of any(is.na(x)) and many more (by m-muecke).
  • Reorganized the C++ source code.
  • README now uses bibtex.
  • Tests use now testthat edition 3 (m-muecke).

- C++
Published by mhahsler almost 2 years ago

dbscan -

New Features

  • is.corepoint() for DBSCAN.
  • coredist() and mrdist() for HDBSCAN.
  • find connected components with comps().

Changes

  • reachability plot now shows all undefined distances as a dashed line.

Bugfix

  • memory leak in mrd calculation fixed.

- C++
Published by mhahsler over 4 years ago

dbscan -

dbscan 1.1-9 (2022-01-10)

Changes

  • We use now roxygen2.

New Features

  • Added predict for hdbscan (as suggested by moredatapls)

- C++
Published by mhahsler over 4 years ago

dbscan -

dbscan 1.1-8 (2021-04-26)

Bugfixes

  • LOF: fixed numerical issues with k-nearest neighbor distance on Solaris.

dbscan 1.1-7 (2021-04-21)

Bugfixes

  • Fixed description of k in knndistplot and added minPts argument.
  • Fixed bug for tied distances in lof (reported by sverchkov).

Changes

  • lof: the density parameter was changes to minPts to be consistent with the original paper and dbscan. Note that minPts = k + 1.

- C++
Published by mhahsler about 5 years ago

dbscan -

Improvements

  • Improved speed of LOF for large ks (following suggestions by eduardokapp).
  • kNN: results is now not sorted again for kd-tree queries which is much faster (by a factor of 10).
  • ANN library: annclose() is now only called once when the package is unloaded. This is in preparation to support persistent kd-trees using external pointers.
  • hdbscan lost parameter xdist.

Bugfixes

  • removed dependence on methods.
  • fixed problem in hullplot for singleton clusters (reported by Fernando Archuby).
  • GLOSH now also accepts data.frames.
  • GLOSH returns now 0 instead of NaN if we have k duplicate points in the data.

- C++
Published by mhahsler over 5 years ago

dbscan -

New Features

  • kNN and frNN gained parameter query to query neighbors for points not in the data.
  • sNN gained parameter jp to decide if the shared NN should be counted using the definition by Jarvis and Patrick.

- C++
Published by mhahsler over 6 years ago

dbscan -

Bugfixes

  • kNNdist now correctly returns the distances to the kth neighbor (reported by zschuster).
  • dbscan: check eps and minPts parameters to avoid undefined results (reported by ArthurPERE).

New Features

  • kNNdist gained parameter all to indicate if a matrix with the distance to all nearest neighbors up to k should be returned.

- C++
Published by mhahsler almost 7 years ago

dbscan -

Bugfix

  • pointdensity was double counting the query point (reported by Marius Hofert).

- C++
Published by mhahsler over 7 years ago

dbscan -

New Features

  • OPTICS now calculates eps if it is omitted.

Bugfix

  • Example now only uses igraph conditionally since it is unavailable on Solaris (reported by B. Ripley).

- C++
Published by mhahsler about 8 years ago

dbscan - dbscan_1.1-0

New Features

  • HDBSCAN was added.
  • extractFOSC (optimal selection of clusters for HDBSCAN) was added.
  • GLOSH outlier score was added.
  • hullplot uses now filled polygons as the default.
  • hullplot now used PCA if the data has more than 2 dimensions.
  • Added NN superclass for kNN and frNN with plot and with adjacencylist().
  • Added shared nearest neighbor clustering as sNNclust() and sNN to calculate the number of shared nearest neighbors.
  • Added pointdensity function.
  • Unsorted kNN and frNN can now be sorted using sort().
  • kNN and frNN now also accept kNN and frNN objects, respectively. This can be used to create a new kNN (frNN) with a reduced k or eps.
  • Datasets added: DS3 and moon.

Interface Changes

  • Improved interface for dbscan() and optics(): ... it now passed on to frNN.
  • OPTICS clustering extraction methods are now called extractDBSCAN and extractXi.
  • kNN and frNN are now objects with a print function.
  • dbscan now also accepts a frNN object as input.
  • jpclust and sNNclust now return a list instead of just the cluster assignments.

- C++
Published by mhahsler about 9 years ago

dbscan - dbscan_1.0-0

New Features

  • The package has now a vignette.
  • Jarvis-Patrick clustering is now available as jpclust().
  • Improved interface for dbscan() and optics(): ... is now passed on to frNN.
  • OPTICS clustering extraction methods are now called extractDBSCAN and extractXi.
  • hullplot uses now filled polygons as the default.
  • hullplot now used PCA if the data has more than 2 dimensions.
  • kNN and frNN are now objects with a print function.
  • dbscan now also accepts a frNN object as input.

- C++
Published by mhahsler over 9 years ago

dbscan - dbscan_0.9-8

Changes in version 0.9-8 (2016-08-05)

  • OPTICS: added a predecessor correction step that is used by the ELKI implementation (Matt Piekenbrock).
  • Added hullplot to plot a scatter plot with added convex cluster hulls.
  • Fixed a memory problem in frNN (reported by Yilei He).

- C++
Published by mhahsler almost 10 years ago

dbscan -

Changes in version 0.9-7 (2016-04-14)

  • OPTICSXi is now implemented (thanks to Matt Piekenbrock).
  • DBSCAN now also accepts MinPts (with a capital M) to be compatible with the fpc version.
  • DBSCAN objects are now also of class db scan_fast to avoid clashes with fpc.
  • DBSCAN and OPTICS have now predict functions.
  • Added test for unhandled NAs.
  • Fixed LOF for more than k duplicate points (reported by Samneet Singh).

- C++
Published by mhahsler about 10 years ago

dbscan -

  • OPTICS: fixed second bug reported by Di Pang
  • all methods now also accept dist objects and have a search method "dist" which precomputes distances.

- C++
Published by mhahsler over 10 years ago

dbscan - CRAN Release 0.9-5

  • OPTICS: fixed bug with first observation reported by Di Pang
  • OPTICS: clusterings can now be extracted using optics_cut

- C++
Published by mhahsler over 10 years ago

dbscan - CRAN Release 0.9-4

  • added tests (testthat).
  • input data is now checked if it can safely be coerced into a numeric matrix (storage.mode double).
  • fixed self matches in kNN and frNN (now returns the first NN correctly).

- C++
Published by mhahsler over 10 years ago