I do recall seeing something like that making use of html5 audio tags. Or maybe it was chapter markers and images. I suppose the images could be subtitles.

/