Thanks for your interest in improving gemma-realtime. Here's how to help.
High Impact:
- More data extractors (Telegram, Discord, Signal, WhatsApp native API)
- CoreML/ANE optimization for the draft model (currently falls back to MLX)
- Windows/Linux support (currently macOS-focused due to Apple Silicon)
- Voice pipeline integration (STT + inference + TTS end-to-end)
Medium Impact:
- Better auto-detection of PLE-safe model variants
- Training data quality metrics and filtering
- Multi-GPU support for M-series Ultra chips
- Benchmark result archiving and regression tracking
Documentation:
- Guides for additional messaging platforms
- Video walkthrough of the full pipeline
- Troubleshooting guide for common issues
git clone https://github.com/sethdford/gemma-realtime.git
cd gemma-realtime
pip install mlx mlx-lm- Python: standard library preferred, minimal dependencies
- Shell: bash 3.2 compatible (macOS default), no associative arrays
- Scripts should be self-contained and runnable independently
# Verify all scripts parse correctly
python3 -m py_compile scripts/*.py
bash -n scripts/*.sh
# Run the benchmark
python3 scripts/voice-bench.py --rounds 5- Fork the repo
- Create a feature branch (
git checkout -b feat/telegram-extractor) - Make your changes
- Test on Apple Silicon if possible
- Open a PR with a clear description
Never commit:
- Training data (JSONL files with personal messages)
- Model adapters (.safetensors files)
- Benchmark results that contain message previews
- API keys or credentials
The .gitignore already excludes these, but please verify before committing.