Empathy sessions are really cool - but in practice I have found that it would be great if we could do the meeting over video, and then have the transcription generate the current text-based empathy session interface.
Each comment could have a timestamp link - which would allow jumping back into the video at that time so you could see the discussion at that moment. The 'save as idea' and starred comment would still be present.
One option would be to allow uploading a zoom recording with the transcript file and generating an empathy session based on the contents of it.
Another option would be Aha! hosting their own video chat at the empathy session web page. It would be really cool if the transcripts could be recorded in real time, and the text populated as participants spoke.
We had a Teams session during the empathy session so people can talk to each other. Worked well together.