Recent Releases of trace

trace - TRACE: Text Reuse Analysis and Comparison Engine

Revoke Global initialization of the SentenceTransformer model to speed up.

- Python
Published by kreeedit over 2 years ago

trace - TRACE: Text Reuse Analysis and Comparison Engine

TRACE is a simple Python script that compares the similarities between different text files (even when they are multilingual or paraphrased) using two methods: MinHash and SentenceTransformer. It allows you to specify the directory containing the text (txt) files, the size of the text windows to compare, the step size to move for each new window, and the similarity threshold. It also creates a network graph (textsimilaritynetwork.gexf) of the text similarities to see the relations of the different texts. The result of the analytics is stored in a json file (result.json)

- Python
Published by kreeedit over 2 years ago