batchalign_app

A front-end web app for Batchalign2, with enhancements for use by the School of Psychology at Trinity College Dublin.

https://github.com/ma-haozhe/batchalign_app

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.9%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

A front-end web app for Batchalign2, with enhancements for use by the School of Psychology at Trinity College Dublin.

Basic Info

Host: GitHub
Owner: ma-haozhe
License: bsd-3-clause
Language: HTML
Default Branch: main
Homepage:
Size: 9.63 MB

Statistics

Stars: 0
Watchers: 1
Forks: 1
Open Issues: 1
Releases: 0

Created over 1 year ago · Last pushed 12 months ago

Metadata Files

Readme License Citation

batchalign_app

A front-end web app for Batchalign2, with enhancements for use by the School of Psychology at Trinity College Dublin.

Project Overview

This is a Django web application that provides a user-friendly interface for the Batchalign library, specifically tailored for the School of Psychology's needs. The app allows users to upload audio files (both single and batch uploads) and processes them using Batchalign for transcription and diarization with CHAT format support.

Current Implementation

Core Features

Single file audio upload functionality
Batch folder upload support
Audio file processing using Batchalign
Transcript generation in both raw and CHAT formats
Interactive speaker mapping interface
Support for CHAT format headers and metadata
Automatic speaker role assignment
Speaker diarization with customizable mapping
Export functionality for CHAT format files
Basic error handling and validation

Project Structure

batch_processor/: Main Django application
- models.py: Database models for AudioFile, Transcript, and SpeakerMap
- views.py: View logic for file handling, processing, and CHAT format generation
- urls.py: URL routing
- templates/: HTML templates for user interface
- tests.py: Test cases for functionality

Features

Audio Processing

Support for various audio file formats
Automatic speaker diarization
Transcript generation with speaker identification

CHAT Format Support

Automatic CHAT format generation
Customizable speaker role mapping
CHAT header metadata generation
Support for participant information
Export to .cha files

User Interface

Interactive file upload interface
Real-time speaker mapping controls
Toggle between raw and CHAT formats
Batch processing status indicators
Error feedback and validation

Installation and Setup

Prerequisites

Python 3.8 or higher
Django 3.2 or higher
Batchalign library
Rev.ai API key for speech recognition

Installation Steps

Clone the repository: bash git clone [repository-url]
Install dependencies: bash pip install -r requirements.txt
Configure environment variables:
- Set up Rev.ai API key
- Configure Django settings
Run migrations: bash python manage.py migrate
Start the development server: bash python manage.py runserver

How to Run This Project

Quick Start Guide

To get the application running quickly:

Create a virtual environment (recommended): bash python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
Install dependencies: bash pip install -r requirements.txt
Run the development server: bash python manage.py runserver This will start the application usuallyon http://127.0.0.1:8000/

Project Structure Explained

The project follows Django's standard structure: - batchalign_app/: The main Django project container with settings and configuration - batch_processor/: The actual Django app that implements the functionality - media/: Directory where uploaded audio files are stored - staticfiles/: Compiled static files for production

Accessing the Application

After starting the server, you can access: - Home page/Upload: http://127.0.0.1:8000/ - File list: http://127.0.0.1:8000/list/ - Transcript viewer: http://127.0.0.1:8000/transcript/{transcript_id}/

Common Issues

If audio files fail to process, verify your Rev.ai API key is correctly configured
For large audio files, processing may take several minutes
Ensure your Python environment has all dependencies from requirements.txt installed

Usage

Upload Audio Files:
- Use single file upload for individual files
- Use batch upload for multiple files
- Supported formats: MP3, WAV
Process Files:
- Files are automatically processed using Batchalign
- Speaker diarization is performed
- Transcripts are generated in both raw and CHAT formats
Map Speakers:
- Assign roles to identified speakers
- Use standard CHAT format roles (e.g., MOT, CHI)
- Update speaker mappings as needed
Export Results:
- Download transcripts in CHAT format
- Files are saved with .cha extension
- Contains proper CHAT headers and metadata

Development

Setting Up Development Environment

Install development dependencies
Configure local settings
Set up test database

Running Tests

bash python manage.py test batch_processor

Future Improvements

Priority Features

Enhanced User Interface:
- Progress indicators for file processing
- Advanced error messaging
- Improved responsive design
Processing Enhancements:
- Additional audio format support
- Multi-language support
- Processing queue optimization
Data Management:
- Advanced transcript editing
- Bulk file operations
- Enhanced export options

Technical Improvements

Backend Optimization:
- Advanced error handling
- Expanded test coverage
- Background task processing
Frontend Enhancement:
- Modern UI framework integration
- Real-time updates
- Advanced validation

License

This project is licensed under the BSD-3-Clause license - see the LICENSE file for details.

Attribution

This web application was developed by Haozhe Ma under the supervision of Dr. Jean Quigley at the School of Psychology, Trinity College Dublin. It is based on Batchalign2 by TalkBank, developed by Brian MacWhinney (Carnegie Mellon University) and Houjun Liu (Stanford University).

Citation

If you use this software in your research, please cite both this application and the original Batchalign2 software. The preferred citation for Batchalign2 is:

Liu, H., MacWhinney, B., Fromm, D., & Lanzi, A. (2023). Automation of Language Sample Analysis. Journal of Speech, Language, and Hearing Research, 66(7), 2421-2433. DOI: 10.1044/2023_JSLHR-22-00642

For more information, see the CITATION.cff file.

Owner

Name: Haozhe Ma
Login: ma-haozhe
Kind: user

Repositories: 1
Profile: https://github.com/ma-haozhe

Computer Science BSc. @ University of Galway Computer Science MSc. AR/VR @ Trinity College Dublin

Citation (CITATION.cff)

cff-version: 1.2.0
title: Batchalign App
message: >-
  If you use this software, please cite both this app and the 
  original Batchalign2 software using the metadata from this file.
type: software
authors:
  - given-names: Haozhe
    family-names: Ma
    affiliation: School of Psychology, Trinity College Dublin
  - given-names: Jean
    family-names: Quigley
    affiliation: School of Psychology, Trinity College Dublin
repository-code: 'https://github.com/ma-haozhe/batchalign_app'
url: 'https://github.com/ma-haozhe/batchalign_app'
abstract: >-
  A front-end web app for Batchalign2, with enhancements for use 
  by the School of Psychology at Trinity College Dublin. This app
  provides a user-friendly interface for the Batchalign library,
  specifically tailored for the School of Psychology's needs.
license: BSD-3-Clause
references:
  - type: software
    title: Batchalign2
    authors:
      - given-names: Brian
        family-names: MacWhinney
        email: macw@cmu.edu
        affiliation: Carnegie Mellon University
      - given-names: Houjun
        family-names: Liu
        email: houjun@stanford.edu
        affiliation: Stanford University
    repository-code: 'https://github.com/TalkBank/batchalign2'
    url: 'https://github.com/TalkBank/batchalign2'
    license: BSD-3-Clause
preferred-citation:
  type: article
  authors:
  - family-names: "Liu"
    given-names: "Houjun"
  - family-names: "MacWhinney"
    given-names: "Brian"
  - family-names: "Fromm"
    given-names: "Davida"
  - family-names: "Lanzi"
    given-names: "Alyssa"
  doi: "10.1044/2023_JSLHR-22-00642"
  journal: "Journal of Speech, Language, and Hearing Research"
  month: 7
  start: 2421 # First page number
  end: 2433 # Last page number
  title: "Automation of Language Sample Analysis"
  issue: 7
  volume: 66
  year: 2023

GitHub Events

Total

Delete event: 1
Push event: 15
Pull request event: 1
Fork event: 1
Create event: 3

Last Year

Delete event: 1
Push event: 15
Pull request event: 1
Fork event: 1
Create event: 3

batchalign_app

Science Score: 57.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

batchalign_app

Project Overview

Current Implementation

Core Features

Project Structure

Features

Audio Processing

CHAT Format Support

User Interface

Installation and Setup

Prerequisites

Installation Steps

How to Run This Project

Quick Start Guide

Project Structure Explained

Accessing the Application

Common Issues

Usage

Development

Setting Up Development Environment

Running Tests

Future Improvements

Priority Features

Technical Improvements

License

Attribution

Citation

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year