• given an audio clip, it detects the presence or absence of human speech,