Projects | Open Source Science

Updated 11 months ago

imageio • Rank 33.2 • Science 59%

Python library for reading and writing image data

animated-gif dicom imageio python scientific-formats video webcam-capture

Updated 11 months ago

scenedetect • Rank 25.8 • Science 54%

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

analysis image-processing opencv python python-opencv scene-detection scene-recognition video video-processing

Updated 11 months ago

awesome-virtual-try-on • Rank 11.4 • Science 67%

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

2d-virtual-try-on 3d-virtual-try-on awesome-list awesome-lists cloth collection curated curated-list fashion fashion-tech garment image image-based multi-pose multi-pose-guided try-on video video-virtual-try-on virtual-try-on vton

Updated 11 months ago

yuview • Rank 11.3 • Science 62%

The Free and Open Source Cross Platform YUV Viewer with an advanced analytics toolset

analysis ffmpeg h265 hevc player raw-video video video-coding video-player yuv yuv-player

Updated 11 months ago

aravis • Rank 16.9 • Science 54%

A vision library for genicam based cameras

c camera genicam gige glib gobject gobject-introspection gstreamer gtk3 meson usb3 video vision

Updated 11 months ago

sensorium • Rank 3.3 • Science 67%

NeurIPS | 1st place solution for Sensorium 2023 Competition

deep-learning neural-prediction neuroscience pytorch video visual-cortex

Updated 11 months ago

pliers • Rank 14.0 • Science 54%

Automated feature extraction in Python

audio feature-extraction python video

Updated 11 months ago

https://github.com/bytedance/xgplayer • Rank 25.7 • Science 36%

A HTML5 video player with a parser that saves traffic

dash flv flv-parser fmp4 hls hls-player html5-video html5-video-player mp4 mp4box player video video-player videoplayer

Updated 11 months ago

egyptiancoffins • Rank 4.7 • Science 54%

Egyptian Coffins website - Jekyll, material, bootstrap, jquery

3d-models ct-data egyptian-coffins egyptian-coffins-website fitzwilliam-museum gcrf jekyll medical-imaging research template-site video

Updated 11 months ago

contivqaexp • Rank 0.7 • Science 57%

A software tool for designing, conducting, and analyzing continuous subjective video quality assessments experiments.

analyzer experiment quality video

Updated 11 months ago

bombuscv-rs • Rank 11.6 • Science 44%

OpenCV based motion detection/recording software built for research on Bumblebees.

bee bee-detection bees bumblebee bumblebee-detection bumblebees insect-detection insects-detection motion-detection open-cv opencv raspberry-pi rust rust-lang video video-recording

Updated 11 months ago

https://github.com/altunenes/scramblery • Rank 6.4 • Science 49%

Desktop app for image and video scrambling/bluring with various methods including Fourier phase scramble: Entire image/video or just detected facial area.

desktop-app experimental-psychology face face-detection fourier fourier-transform gstreamer psychology scramble scramble-face tauri video video-processing

Updated 11 months ago

https://github.com/opencast/opencast • Rank 19.3 • Science 36%

The free and open source solution for automated video capture and distribution at scale.

capture-screen capture-video hacktoberfest opencast recording video video-management video-processing video-production video-publication

Updated 11 months ago

skelly_synchronize • Rank 11.1 • Science 44%

Synchronization tool for videos of the same event. Uses audio cross correlation to synchronize.

cross-correlation ffmpeg synchronization video video-processing

Updated 11 months ago

tuber • Rank 18.4 • Science 36%

:sweet_potato: Access YouTube from R

access-youtube caption video youtube youtube-api youtube-oauth

Updated 11 months ago

https://github.com/bytedance/salmonn • Rank 9.0 • Science 36%

SALMONN family: A suite of advanced multi-modal LLMs

audio audio-processing audio-visual-understanding bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university video video-understanding

Updated 11 months ago

https://github.com/cgohlke/vidsrc • Rank 7.2 • Science 36%

Video frameserver for NumPy.

format-reader python video windows

Updated 11 months ago

https://github.com/ai-forever/kandinsky-4 • Rank 7.2 • Science 36%

Text and image to video generation: Kandinsky 4.0 (2024)

distillation image-to-video kandinsky text-to-video video video-distillation video-generation video-to-audio

Updated 11 months ago

https://github.com/akamhy/videohash • Rank 16.9 • Science 23%

Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.

duplicate-detection duplicate-video-finder duplicate-videos ffmpeg find-similar-videos-by-content ndvd ndvr near-duplicate-video near-duplicate-video-clip-detection python video video-deduplication video-similarity-search visual-claim

Updated 11 months ago

https://github.com/simleek/displayarray • Rank 16.5 • Science 23%

A OpenCV interface to display tensors, multiple cameras, and so on.

display-tensors numpy python pytorch tensorflow video webcam

Updated 11 months ago

https://github.com/vpalmisano/webrtcperf • Rank 12.6 • Science 26%

WebRTC performance and quality evaluation tool.

chromium concurrent-webrtc-sessions edumeet janus jitsi load mediasoup nodejs performance statistics test test-automation video webrtc webrtcperf

Updated 11 months ago

vidstab • Rank 15.4 • Science 23%

A Python package to stabilize videos using OpenCV

computer-vision opencv python video video-stabilization

Updated 11 months ago

https://github.com/100mslive/100ms-flutter • Rank 7.8 • Science 26%

Flutter Live Streaming, Video Conferencing SDK & Sample App

audio chat conference dart ffmpeg flutter hacktoberfest hls live meeting player rtmp streaming video webrtc

Updated 11 months ago

https://github.com/SocAIty/media-toolkit • Rank 7.3 • Science 26%

Web-ready standardized file processing and serialization. Read, write, convert and send files. Including image, audio, video and any other file. Easily convert between numpy, base64, bytes and more.

base64 bytesio fast-task-api httpx image-processing json numpy serialization video

Updated 11 months ago

ball-action-spotting • Rank 4.8 • Science 26%

SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023

action-recognition action-spotting deep-learning football pytorch soccer soccernet video video-processing

Updated 11 months ago

scribesalad • Rank 4.4 • Science 26%

A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.

artificial-intelligence history joe-rogan-experience jordan-b-peterson multilingual politics science social-issues transcripts video yalecourses youtube-video youtube-videos-transcripts

Updated 11 months ago

https://github.com/alexkranias/sketchit • Rank 1.1 • Science 26%

SketchIt is a an interactive, media manipulation software applying fundamental computer vision/edge detection algorithms to media for both educational and artistic purposes.

desktop-app digital-art edge-detection image image-processing java-8 javafx sketch video video-processing

Updated 11 months ago

homepage • Rank 5.8 • Science 13%

A simple flask web app that pulls the audio track from almost any internet video using ytdl

downloader flask flask-application hacktoberfest python video webapp wsgi-application youtube-dl youtube-dl-wrapper

Updated 11 months ago

evan • Rank 3.0 • Science 10%

:mag: python package to evaluate GANs for video generation

deep-learning deep-neural-networks gan gans generative-adversarial-network video video-generation videos

Updated 11 months ago

image-processing-matlab • Science 67%

Acquired time lapse images from the IncuCyte® and processed with MATLAB to create a time-lapse video of one particular aggregate of bacteria

incucyte insets matlab matlab-processing video

Updated 11 months ago

https://github.com/amirzenoozi/aparat-videos-dataset • Science 13%

Some Simple Information About Aparat Videos for DataScientists

aparat cli crawler data-science data-science-projects pandas python python3 sdk-python sqlite3 video

Updated 11 months ago

stnls • Science 54%

Space-Time Attention with a Shifted Non-Local Search

autograd cuda differentiable non-local non-local-search pytorch video

Updated 10 months ago

https://github.com/hcmlab/nova • Science 39%

NOVA is a tool for annotating and analyzing behaviours in social interactions. It supports Annotators using Machine Learning already during the coding process. Further it features both, discrete labels and continuous scores and a visuzalization of streams recorded with the SSI Framework.

active-learning annotation annotations annotator automated-analysis behaviour coding collaboration cooperative database interactive kinect labeling lightning-network machine-learning multimedia neural-network nova social-interactions video

Updated 11 months ago

2d3mf • Science 36%

Code and models for the paper "2D3MF: Deepfake Detection using Multi Modal Middle Fusion"

audio deep-learning deepfake-detection machine-learning multimodal pytorch video

Updated 11 months ago

https://github.com/awslabs/speke-reference-server • Science 13%

Secure Packager and Encoder Key Exchange (SPEKE) is part of the AWS Elemental content encryption protection strategy for media services customers. SPEKE defines the standard for communication between our media services and digital rights management (DRM) system key servers. This project provides the basic framework that partners can specialize and extend to support their specific method of Digital Rights Management while utilizing AWS' video streaming solutions.

drm encryption media mediaconnect mediapackage serverless speke video

Updated 11 months ago

manga-reader • Science 26%

Generate a video recap of any manga volume PDF with GPT Vision and Elevenlabs narration. Discord: https://discord.gg/MMqcuDe2WZ

accessibility comic comics comics-reader elevenlabs manga manga-reader manhwa manhwa-reader openai summarization video vision

Updated 11 months ago

open-in-mpv • Science 44%

Host-side of the extension to open any link or page URL in mpv via the browser context menu.

audio browser-extension mpv multimedia video

Updated 11 months ago

batch-video-converter • Science 31%

video video-converter video-processing

Updated 11 months ago

https://github.com/amazon-science/gluonmm • Science 36%

A library of transformer models for computer vision and multi-modality research

computer-vision iccv-2021 multimodality pytorch transformer video

Updated 11 months ago

https://github.com/alexkranias/photessera • Science 13%

Photessera is a Java-based video and image manipulation software that can allows users to create video and image mosaics, constructed out of a separate group of user-selected images. The software supports the following file types as input: MP4, MOV, PNG, JPG. As of now the software is only verified as functional on Windows devices. A video explanation of the functionality of the software can be found @ https://youtu.be/ftKO35jiCHQ

art digital-art image image-manipulation image-processing video video-effects video-processing windows

Updated 11 months ago

https://github.com/altunenes/gstreamer-parallelism-study • Science 26%

tech experiment about parallel video decoding for my blogpost (CPU based using crossbeam)

experimental gstreamer video video-processing

Updated 11 months ago

bmt • Science 54%

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

activitynet-captions audio bi-modal-encoder bi-modal-transformer bmt bmvc bmvc20 dense-video-captioning i3d multimodal-fusion proposal-generator pytorch temporal-action-proposals transformer video video-features

Updated 11 months ago

https://github.com/chenzhaiyu/anomaly-detection • Science 36%

Intrusion detection and displacement monitoring from video sequences

anomaly displacement video

Updated 11 months ago

argan • Science 39%

[Open Source]. ARGAN - The improved version of AnimeGAN. Landscape photos/videos to anime

animation anime argan gan generative-adversarial-network resnet-101 resnet-18 resnet-34 resnet-50 style-transfer tensorflow tensorflow2 vgg19 vide-generation video

Updated 11 months ago

avocodo • Science 26%

A MATLAB-based GUI that optimizes the experience in audio/video coding EEG data.

audio coding eeg video