VAD vs event-triggered for AI speech-to-speech applications