Thanks for your comment. Let me know if you end up implementing this on Android. I think this concept could be adaptable to other platforms or languages. I just happened to be working in Swift and so I started there.
It took about a week to prove the idea was viable and then another week or two to get the code into a state where I felt it would be useful to share.
The closest alternative, which offers only text-to-speech mode, is Speechify. However, they charge over $100 per year, which is significant. A lifetime license for Quickify is available for just $39, or $1.99 per month.
I don't want to seem assuming, but I haven't seen such a set of features in any application yet, really.
How long did it take you to implement this proof of concept?