https://github.com/ctuavastlab/jsongrinder.jl

Machine learning with Mill.jl for JSON documents

https://github.com/ctuavastlab/jsongrinder.jl

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: ieee.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.3%) to scientific vocabulary

Keywords

flux hierarchical-data json julia machine-learning multi-instance-learning

Keywords from Contributors

the-human-brain hybrid-differential-equations neural-sde surrogate matrix-exponential pde jacobians ida numerics control
Last synced: 5 months ago · JSON representation

Repository

Machine learning with Mill.jl for JSON documents

Basic Info
Statistics
  • Stars: 50
  • Watchers: 3
  • Forks: 9
  • Open Issues: 9
  • Releases: 69
Topics
flux hierarchical-data json julia machine-learning multi-instance-learning
Created over 7 years ago · Last pushed about 1 year ago
Metadata Files
Readme Contributing License

README.md

JsonGrinder.jl logo JsonGrinder.jl logo


License Docs Build Status codecov

JsonGrinder.jl is a library that facilitates processing of JSON documents into Mill.jl structures for machine learning. It provides functionality for JSON schema inference, extraction of JSON documents to a suitable representation for machine learning, and constructing a model operating on this data.

Watch our introductory talk from JuliaCon 2021.

Installation

Run the following in REPL:

julia ] add JsonGrinder

Getting Started

Citation

Kindly cite our work with the following entries if you find it interesting, please:

@article{Mandlik2022, author = {{\v{S}}imon Mandl{\'{i}}k and Mat{\v{e}}j Ra{\v{c}}insk{\'{y}} and Viliam Lis{\'{y}} and Tom{\'{a}}{\v{s}} Pevn{\'{y}}}, issn = {1533-7928}, issue = {298}, journal = {Journal of Machine Learning Research}, pages = {1-5}, title = {JsonGrinder.jl: automated differentiable neural architecture for embedding arbitrary JSON data}, volume = {23}, url = {http://jmlr.org/papers/v23/21-0174.html}, year = {2022}, }

@article{Mandlik2024, author = {{\v{S}}imon Mandl{\'{i}}k and Tom{\'{a}}{\v{s}} Pevn{\'{y}} and V{\'{a}}clav {\v{S}}m{\'{i}}dl and Luk{\'{a}}{\v{s}} Bajer}, journal = {IEEE Transactions on Information Forensics and Security}, title = {Malicious Internet Entity Detection Using Local Graph Inference}, year = {2024}, volume = {19}, pages = {3554-3566}, doi = {10.1109/TIFS.2024.3360867} }

  • this implementation (fill in the used version)

@software{JsonGrinder, author = {{\v{S}}imon Mandl{\'{i}}k and Tom{\'{a}}{\v{s}} Pevn{\'{y}} and Mat{\v{e}}j Ra{\v{c}}insk{\'{y}}}, title = {JsonGrinder.jl: a flexible library for automated feature engineering and conversion of JSONs to Mill.jl structures}, url = {https://github.com/CTUAvastLab/JsonGrinder.jl}, version = {...}, }

Contribution guidelines

If you want to contribute to JsonGrinder.jl, be sure to review the contribution guidelines.

We use GitHub issues for tracking requests and bugs.

Owner

  • Name: Joint research lab of Czech Technical University in Prague and Avast
  • Login: CTUAvastLab
  • Kind: organization
  • Location: Prague

GitHub Events

Total
  • Create event: 6
  • Commit comment event: 14
  • Release event: 6
  • Watch event: 4
  • Issue comment event: 6
  • Push event: 39
Last Year
  • Create event: 6
  • Commit comment event: 14
  • Release event: 6
  • Watch event: 4
  • Issue comment event: 6
  • Push event: 39

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 675
  • Total Committers: 11
  • Avg Commits per committer: 61.364
  • Development Distribution Score (DDS): 0.317
Past Year
  • Commits: 38
  • Committers: 1
  • Avg Commits per committer: 38.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Matěj Račinský m****y@a****m 461
Pevnak p****k@g****m 93
Šimon Mandlík s****k@g****m 91
github-actions[bot] 4****] 17
horakka5 3****5 4
SundaraRaman R s****n@g****m 2
Viliam Lisy v****y@P****m 2
CompatHelper Julia c****y@j****g 2
kozvojtex 5****x 1
Julia TagBot 5****t 1
Michal Najman m****n@a****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 58
  • Total pull requests: 68
  • Average time to close issues: 7 months
  • Average time to close pull requests: 2 months
  • Total issue authors: 6
  • Total pull request authors: 7
  • Average comments per issue: 1.45
  • Average comments per pull request: 0.53
  • Merged pull requests: 37
  • Bot issues: 0
  • Bot pull requests: 42
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • racinmat (39)
  • simonmandlik (11)
  • pevnak (3)
  • horakka5 (2)
  • reubano (1)
  • JuliaTagBot (1)
Pull Request Authors
  • github-actions[bot] (44)
  • racinmat (16)
  • simonmandlik (4)
  • ondraveres (2)
  • horakka5 (2)
  • digital-carver (1)
  • drvojtex (1)
Top Labels
Issue Labels
enhancement (15)
Pull Request Labels
hacktoberfest-accepted (3)

Packages

  • Total packages: 1
  • Total downloads:
    • julia 1 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 70
juliahub.com: JsonGrinder

Machine learning with Mill.jl for JSON documents

  • Versions: 70
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 1 Total
Rankings
Dependent repos count: 9.9%
Stargazers count: 16.3%
Forks count: 19.4%
Average: 21.1%
Dependent packages count: 38.9%
Last synced: 6 months ago