Weekly Progress Report #13 - Josiah Ehrler
Summary of Work Completed
This week I focused on optimization and a major new feature: voice chat mode for MedLlama2. The optimization efforts targeted memory usage and inference latency on the Jetson Nano, successfully reducing average response time. The highlight of the week is the early-stage integration of a voice chat interface. Using the Web Speech API and lightweight wake-word detection, the system can now accept spoken inventory queries and respond with synthesized audio. Astronauts can now ask "What medical supplies are low?" without touching a keyboard. A critical improvement for zero-gravity or gloves-on scenarios.
The voice pipeline consists of a browser-based speech recognition module that streams text to our backend LLM, which then returns a natural language answer. The text-to-speech component uses the browser's native synthesis for immediate feedback. For mission-critical reliability, we have cached fallback voices and implemented a push-to-talk toggle. Early testing shows strong accuracy on medical terminology recognition after customizing the speech model's vocabulary.
FDR Preparation - Houston
With the Final Design Review approaching in Houston, our team has shifted focus toward polishing the system for presentation. We have begun preparing our presentation and rehearsing the live demonstration. The voice mode will be a key differentiator in our presentation, showcasing how our system reduces cognitive load for astronauts.
Challenges Encountered and Solutions Implemented
The biggest challenge was latency: speech recognition to LLM inference to text-to-speech introduced a noticeable delay. To resolve this, I implemented speculative response generation where the system predicts follow-up queries and preloads context.
Goals for the Upcoming Week
With FDR just weeks away, our focus is on stability and presentation readiness. Specific goals include:
- Complete final polish of voice chat mode and integrate as a toggle within the main dashboard
- Expand voice commands to include restock requests and expiry alerts
- Conduct full system stress tests to ensure no crashes during the Houston demo
- Finalize FDR presentation and practice delivery as a team
- Prepare hardware for travel and create a backup plan in case of equipment failure
- Document all system APIs