Nena Data Collector

React.jsTypeScriptTailwind CSSFastAPIPythonAudio ProcessingML/AI
Nena Data Collector

An interactive tool designed for dataset creation and curation workflows, developed at Sartify Company for their Pawa Models division, particularly for linguistic and machine learning audio corpora. you can view the tweet here

Enables seamless YouTube video and audio downloads through a FastAPI backend, with support for both video and audio-only playback using resilient loading mechanisms.

Features an interactive timeline with zoom, seek, and selection capabilities that allows precise audio segment cutting and management for research purposes.

Includes batch and single-segment transcription capabilities, providing real-time progress feedback and contextual status messaging throughout the workflow.

Built with modular architecture using reusable contexts and hooks, with configurable backend settings persisted in localStorage for flexible deployment.

Developed as part of Sartify Company's Pawa Models initiative to streamline audio data collection and preprocessing for AI model training.

Nena Data Collector | Kelvin Hemu