If you want the raw data, you'll have to go dig in the archives to find the log books and card decks.
This[1] paper goes into some detail on how the digital records were constructed from the log books, card decks and such. This[2] paper deals with an update of those digital records, including new digitization efforts. You can download the raw digital data from ICOADS here[3].
Regardless, ascii encoding isn’t raw data. You’re making software engineer assumptions. Statistical noise is introduced 4-5 steps before the data is recorded digitally.
Even after it’s digitized, more noise is introduced through recording errors and normalization.
To understand the original distribution, the entire workflow needs to have been recorded