Projects

Scientific Software

Updated 10 months ago

Multi-view-AE — Peer-reviewed • Rank 5.5 • Science 100%

Multi-view-AE: A Python package for multi-view autoencoder models - Published in JOSS (2023)

autoencoder multi-modal multi-modal-autoencoder multi-modal-variational-autoencoder multivae multiview multiviewae mvae representation-learning variational-autoencoder

Mathematics

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

MM-PoE — Peer-reviewed • Rank 5.8 • Science 93%

MM-PoE: Multiple Choice Reasoning via. Process of Elimination using Multi-Modal Models - Published in JOSS (2025)

inference llm multi-modal question-answering

Artificial Intelligence and Machine Learning

Scientific Software · Peer-reviewed

Updated 10 months ago

IncrementalInference • Rank 11.1 • Science 77%

Clique recycling non-Gaussian (multi-modal) factor graph solver; also see Caesar.jl.

bayes bayes-network bayes-tree belief-propagation caesar chapman-kolmogorov factor-graphs filtering-algorithm inference isam julia-language multi-hypothesis multi-modal nonparametric optimization parametric robotics slam state-estimation sum-product

Updated 10 months ago

Caesar • Rank 10.5 • Science 77%

Robust robotic localization and mapping, together with NavAbility(TM). Reach out to info@wherewhen.ai for help.

caesar database isam julia multi-modal non-parametric parametric-navigation-solutions robotics slam

Updated 10 months ago

deepke • Rank 18.1 • Science 64%

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

attribute-extraction chinese deep-learning deepke document-level few-shot information-extraction instructie kg knowledge-graph knowprompt lightner low-resource multi-modal named-entity-recognition ner nlp prompt pytorch relation-extraction

Updated 10 months ago

fastrag • Rank 14.7 • Science 54%

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers

Updated 10 months ago

https://github.com/modelscope/data-juicer • Rank 19.2 • Science 46%

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

data data-analysis data-pipeline data-processing data-science data-visualization foundation-models instruction-tuning large-language-models llm llms multi-modal pre-training synthetic-data

Updated 10 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Updated 10 months ago

https://github.com/bytedance/salmonn • Rank 9.0 • Science 36%

SALMONN family: A suite of advanced multi-modal LLMs

audio audio-processing audio-visual-understanding bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university video video-understanding

Updated 10 months ago

farmvibes-ai • Rank 8.8 • Science 36%

FarmVibes.AI: Multi-Modal GeoSpatial ML Models for Agriculture and Sustainability

agriculture ai geospatial geospatial-analytics multi-modal remote-sensing stac sustainability weather

Updated 10 months ago

https://github.com/kyegomez/aoa-torch • Rank 3.2 • Science 26%

Implementation of Attention on Attention in Zeta

ai artificial-intelligence gpt4 machine-learning multi-modal multi-modality research

Updated 10 months ago

https://github.com/924973292/top-reid • Science 23%

【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation

aaai24 missing-modal-retrieval msvr310 multi-modal multi-modal-retrieval object-reid person-reid rgbnt100 rgbnt201 vehicle-reid

Updated 10 months ago

https://github.com/924973292/editor • Science 23%

【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

cvpr2024 frequency-analysis msvr310 multi-modal multi-modal-learning person-reid reid rgbnt100 rgbnt201 token-selection vehicle-reidentification

Updated 10 months ago

https://github.com/openbmb/minicpm-v • Science 36%

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and Video Understanding on Your Phone

minicpm minicpm-v multi-modal

Updated 10 months ago

https://github.com/ammarlodhi255/metadata-augmented-neural-networks-for-wild-animal-classification • Science 13%

This repository contains the implementation code for the paper "Metadata Augmented Neural Networks For Wild Animal Classification": https://www.sciencedirect.com/science/article/pii/S1574954124003479.

deep-learning fusion-techniques metadata metadata-fusion multi-modal multi-modal-learning wild-animal-classification wild-life-monitoring

Updated 10 months ago

https://github.com/amazon-science/contrastive_emc2 • Science 23%

Code the ICML 2024 paper: "EMC^2: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence"

contrastive-learning deep-neural-networks machine-learning machine-learning-algorithms mcmc-sampling multi-modal multi-modal-learning

Updated 10 months ago

https://github.com/amazon-science/fmcore • Science 26%

Running GenAI models at every scale, on every modality

ai-research artificial-intelligence computer-vision foundation-models genai machine-learning multi-modal natural-language-processing scientific-computing

Updated 10 months ago