No, you didn't. This doesn't match the original text.
0:47 Added in text: "Okay, here's the text prepared for reading aloud."
0:58
Original: "Okay, yes, it's a dumb idea,"
Audio: "Okay, yes, it's a bit of a strange idea"
1:08
Original: "Or do you, say, list off to the left some? What I want to ask you is: Can you find out? Hell no. You can see that, sure."
Audio: "Or do you drift off to the left a bit? The question is, can you figure it out? No, you can't. You can see that."
---
It appears you are using "Variational Lossy Autoencoder (VLAE)" as the basis for your website[1], which might be good for simplifying more complex things but defeats the purpose here. It's using more than four letters in words, and censoring out "dumb" and "hell"?
Why don't you try pointing that another explanation of the theory of relativity without this limitation? Seems like that'd be a more interesting exercise.
Ah, I just want to clarify that I'm very unhappy about the censoring of "dumb" and "hell".
I allow the text to get slightly optimized for audio experiences, e.g. page numbers or mathematical notation gets replaced. But I have think about that again.