Two words: excessive complexity. It's always seemed strange that an application ...

13of40 · on April 3, 2021

Here's an interesting quirk in Windows: There are two APIs to execute external programs, CreateProcess and ShellExecute. CreateProcess is the older of the two and only runs executables. ShellExecute opens the target with whatever app is associated with the extension.

When they shoehorned the ShellExecute behavior into cmd.exe, they basically just said "if (!CreateProcess(foo)) {ShellExecute (foo)}"

As a result, if you take "foo.exe" and rename it "foo.txt" then try to run it like "C:\>foo.txt" from the command line, it will run as an executable instead of opening in Notepad like you would expect. Do the same with a real text file (that doesn't start with "MZ") and it opens in Notepad.

COGlory · on April 3, 2021

This is a frustrating behavior on Windows, not because it's possible, but because it's default. I vastly prefer the way KDE performs. Whatever the default program to open that file type is attempts to open it. You can easily change what the default is.

It's frustrating when I instinctively change a file extension on Windows so I can do some other operation with it (say changing a configuration file to .txt to edit it) and Windows still doesn't know what to do with it.

I'm not averse to the behavior, I just wish I could control when it happens.

ectopod · on April 3, 2021

Being pedantic, CreateProcess is more fundamental but ShellExecute, dating back to 16-bit Windows, is older.

crazygringo · on April 3, 2021

It's a rich text editor by default. Rich text is still text.

Opening HTML files and converting them to rich text certainly does belong as a valid feature for a rich text editor. It'll open and convert Word files too, which is super useful.

The content-type autodetection, however, I agree was a bad idea. Still, this vulnerability presumably existed with an .html file opened in TextEdit.

fiddlerwoaroof · on April 3, 2021

I assume the content-type autodetection exists because of how downloading files occasionally appends a .txt extension (I think this is when the content type is text/plain). Postel’s law gets applied with the result of macOS attempting to make up for misconfigured servers.

Thorrez · on April 3, 2021

>Still, this vulnerability presumably existed with an .html file opened in TextEdit.

It wouldn't have been as bad though. From the article:

>Gatekeeper doesn’t quarantine TXT files

Wowfunhappy · on April 3, 2021

Text ≠ Plain Text. TextEdit defaults to rtf. It supports html as an alternative to rtf, which is to say it can do basic formatting and nothing else.

It's perfectly reasonable to expect a text editor to support more than literal unicode, and to work with a variety of commonly-used formats.

thitcanh · on April 3, 2021

It seems people are hunk TextEdit is the macOS equivalent of Notepad.exe while instead it’s more like WordPad

jaxn · on April 3, 2021

But is it reasonable to treat a .txt as anything other than plain text?

Wowfunhappy · on April 3, 2021

No, it’s definitely not, that’s a separate problem!

rualca · on April 3, 2021

You should read the article before commenting.

The blog post states that the contents of said "text file" were quite literally <!DOCTYPE HTML><html><head></head><body>

This is not a mere text file. At all. This is a HTML document that might be deemed valid by a very permissive validator.

Just because HTML might be stored in a text file that does not mean that a noncompliant HTML file ceases to be a HTML file.

ben_w · on April 4, 2021

If the file extension is .txt, I always expect it to be opened as plain text. The file extension is, rightly or wrongly[0], the metadata declaring the file type — nobody would consider it reasonable for an .exe to remain executable if the extension is changed to .txt, after all.

One might, possibly, still argue about the text encoding of a .txt file (I’m old enough to remember Unicode being a new fancy alternative to ASCII), but that’s about it.

[0] Sometimes I reminisce about the good old days of classic Mac OS, with resource forks and separate file type metadata: https://en.wikipedia.org/wiki/Resource_fork

rualca · on April 5, 2021

> If the file extension is .txt, I always expect it to be opened as plain text. The file extension is, rightly or wrongly[0], the metadata declaring the file type — nobody would consider it reasonable for an .exe to remain executable if the extension is changed to .txt, after all.

That statement is quite wrong and shows a good dose of ignorance. To start off in UNIX systems the extension means nothing regarding whether a file is an executable or not. All it takes is a +x flag and a file format (header, magic number) that can be executed.

Also, file extensions mean nothing. In fact, a popular and very basic trick to fool clueless users to run malware (and one which any anti-malware tool checks) is to sneak executables with a different extension, because it only means something to clueless users.

And a file with a txt file extensions means nothing at all. The only thing that matters is the file content and it's file permissions.

jaxn · on April 3, 2021

I read the article. I understand that. But what takes precedence, the first line or the extension.

throwawayboise · on April 3, 2021

TextEdit dates back to NeXTStep, so it was originally written in the late 1980s probably. Guessing it didn't render HTML originally, but it always had RTF capability. Not that it's an excuse in 2021, but very few applications from that era woudl be considered "safe" today.

thought_alarm · on April 3, 2021

Edit.app is the original NeXTSTEP text editor from the 1989. It supported plain text and rich text files. Famously, the first web browser was based on the rich text capabilities built into NeXSTEP.

TextEdit.app is the OpenStep rewrite of Edit.app and dates to the mid 1990s. It was likely one of the first OpenStep apps. It supported the same rich text files as the original Edit.app.

Apple bought NeXT, OpenStep became Cocoa, TextEdit was ported to Java, and then back to garbage collected Objective-C, then ARC Objective-C, (then Swift, probably).

Along the way it picked up features for reading/writing/editing HTML and Microsoft Word documents.

Apple used to publish the source code for TextEdit as part of their Xcode sample code, but they stopped a few years ago.

fiddlerwoaroof · on April 3, 2021

Yeah, I think TextEdit.app is supposed to be a showcase for the Cocoa text system, really.

anaerobicover · on April 3, 2021

> Apple used to publish the source code for TextEdit as part of their Xcode sample code, but they stopped a few years ago.

The URL still works if you want it, but, yeah, it's obviously not up-to-date:

https://developer.apple.com/library/archive/samplecode/TextE...

> (then Swift, probably).

Not yet at least; there's no Swift symbols in the binary on Big Sur.

Wowfunhappy · on April 3, 2021

> TextEdit was ported to Java

Wait, what? Wow, that’s nuts!

thought_alarm · on April 4, 2021

Java was supposed to be the primary programming language for OS X. That's why they renamed OpenStep to Cocoa (Java and Cocoa go great together).

But AppKit was still pure Objective-C, and bridging between AppKit's Obj-C APIs and the Java language presented problems. 3rd-party developers (eventually) preferred the write directly in Objective-C and Apple dropped the Java bridge some years later.

mceachen · on April 3, 2021

An example of "unsafe defaults:"

NeXT used Display PostScript for the display manager. If you opened an email that had PostScript commands, the mail agent would happily, automatically, execute them.

A favorite payload sent around the computer lab would smear all pixels downward to "melt" whatever was rendered on your display.

Note that there weren't that many interesting things to exfiltrate back then, so this wasn't a terrible default: there wasn't (any!) online commerce, online banking was rare, and even passwords were never echoed to the terminal.

icedchai · on April 3, 2021

You don't need a password to be echoed to exfiltrate it. You just need the key codes. Not sure about NeXTStep, but regular old X let you sniff keys really easily.

Some systems (specifically, earlier versions of SGI IRIX) shipped with X authorization disabled by default. This is the equivalent of "xhost +". You could sniff a box as soon as it was plugged into the network, including capturing login session credentials, all terminal commands, and anything else. When they su'd to root, yes, you'd capture the root password.

In those days (mid 90's) almost nobody was running firewalls. At least, nobody in these parts. Putting your "office on the Internet" meant raw, unfiltered IP.

BlueTemplar · on April 3, 2021

These days too, IPv6 tends to be firewall-free. In theory there are protections though, like regularly changing suffixes.

Do MacOS and Ubuntu ship with firewalls?

icedchai · on April 3, 2021

Most consumer routers should at least be doing basic inbound connection filtering for IPv6. Are they not?

MacOS and Ubuntu ship with firewalls, though not sure if they're enabled by default.

BlueTemplar · on April 4, 2021

I checked one big ISP, boasting 99% IPv6 coverage, and the IPv6 firewall is opt-in, and considering how many people change their settings...

(For those that might not be aware of it : with IPv6, there's no NAT, since there's no need for it.)

simonh · on April 3, 2021

This was fixed in 2020 so no need for any excuses in 2021.

movedx · on April 3, 2021

Agreed. This problem exists because someone wrote a tool that should only do one (really well) and but instead made it do five different things.

azinman2 · on April 3, 2021

According to you. I appreciate that TextEdit is a rich editor. I can use vim or countless other apps for plain text. Few do what TextEdit does with its simplicity.

movedx · on April 3, 2021

Aye. According to me. I have a preference for tools doing one thing, and one thing well. That attitude has served me very well.

Your opinion is that you like TextEdit for what it is.

Neither opinion/feeling is relevant.

fouric · on April 3, 2021

Neither is the opinion that "This problem exists because someone wrote a tool that should only do one (really well) and but instead made it do five different things."

You can make security bugs in simple tools - this security bug is not purely a function of the number of target use-cases.

Nor do you have any rational basis for asserting that the given app "should only do one [thing]".

marcosdumay · on April 3, 2021

So, keeping the GP rationale, do you see any gain from TextEdit opening txt files?

ethbr0 · on April 3, 2021

That sound in the background is emacs laughing.

eurasiantiger · on April 3, 2021

You can press ctrl-super-meta-¥ to make it stop.

tsimionescu · on April 3, 2021

I don't think it will accept the command if it knows you learned it as anything other than C-S-M-¥.

digitaltrees · on April 3, 2021

Underrated.

bobbylarrybobby · on April 3, 2021

It seems like the only real issue here is that a file:// URL can make a network request. Who could ever think that that would be a good idea?

aaronharnly · on April 3, 2021

People who were building next-generation networked computers in the late 1980s and 1990s?

zimpenfish · on April 3, 2021

> a file:// URL can make a network request. Who could ever think that that would be a good idea?

Anyone with networked filesystems, I should imagine?

rualca · on April 3, 2021

>>Anyone with networked filesystems, I should imagine?

You're either missing the point made by GP or being disingenuous. Please keep in mind that you need to explicitly mount a NFS before you're able to open it, and mounting a NFS not only requires explicit authorization but also only provides access to a specific file system mounted in a specific point following specific permissions.

Accessing the whole internet through file:// without being prompted for permissions or consent or even awareness is an entirely different thing. For starters, the access is not explicit nor subjected to conditions.

Tepix · on April 3, 2021

No, the fact that .TXT files got interpreted as HTML is worse.

kergonath · on April 3, 2021

Rigidly interpreting documents depending on their file extension is worse than trying to figure out the type of a document before interpreting it. File extensions are a brittle and primitive system that does not fix any security issue.

lmm · on April 3, 2021

File extensions are simple and, crucially, visible and understandable to the user. They're far better than any proposed alternative.

fouric · on April 3, 2021

Optimizing for "simple" for the sake of robustness is exactly backward.

> visible and understandable

False. Something is neither visible nor understandable if it's misleading - which file extensions are. There are absolutely no guarantees that a file extension will match file contents, and that assumption can cause security risks - like in this article.

An actually good alternative is to encode file type as metadata, instead of inside the file contents or file-name, and then configure viewers to display it. That, while not "simple", is also visible and understandable to the user, while simultaneously being safe.

lmm · on April 3, 2021

> There are absolutely no guarantees that a file extension will match file contents, and that assumption can cause security risks

Only in software that ignores the extension.

> An actually good alternative is to encode file type as metadata, instead of inside the file contents or file-name, and then configure viewers to display it. That, while not "simple", is also visible and understandable to the user, while simultaneously being safe.

Metadata can be just as wrong as a file extension, and is generally far less visible.

kergonath · on April 5, 2021

> Only in software that ignores the extension

No, only in software that blindly follows file extensions.

lmm · on April 6, 2021

The problem is that the text editor ignored the extension of the txt file. That's what lead to unsafe behaviour - the user thought the file was fine to open because the extension was txt, and improving users is not practical.

The exact same thing would happen with metadata - indeed file extensions are just a form of metadata - if the metadata says this is a text file but the application ignores it, we would have the exact same issue.

kergonath · on April 5, 2021

They are also trivial to get wrong, can be mangled when the files are moved around, and are easy to use as an attack vector.

They are not far better than the alternatives, it's just that no alternative reached a critical mass due to them not being how Windows works.

lmm · on April 6, 2021

> They are also trivial to get wrong, can be mangled when the files are moved around, and are easy to use as an attack vector.

On the contrary, they're the only kind of metadata that doesn't get mangled when files are moved around, and they're far less of an attack vector than other approaches. Of course you can set the wrong file type, but no approach avoids that problem.

Tepix · on April 4, 2021

I disagree, especially in the context of email attachments or web uploads. It isn't rocket science.

The problems occur when a file, that is said to be of a certain filetype, is erronously treated as something else.

Qwertious · on April 3, 2021

If you don't want text editors to do non text-editing stuff, then people need to stop saying we should build development environments around text editors. "An IDE is just a text editor with bells and whistles", people say. Well if that's the case, it's not surprising if people "only ship the one text editor".

Railsify · on April 3, 2021

But not with file extensions of .txt. They should only do bells and whistles if the extension warrants some bells. .md, sure syntax highlight me. But opening .txt and treating it as html, that seems strange.

cyphar · on April 3, 2021

Well, on Unix file extensions are a convention and don't have any strict semantic meaning. Maybe this doesn't make sense in a world where most people do think in terms of file extensions (thanks to the popularity of Windows) but it shouldn't be surprising that non-Windows programs might not special-case file extensions.

(Though in fairness, text editors do usually have special casing for file extensions and these days tools like ls will colour filenames based on the extension.)

tsimionescu · on April 3, 2021

This is only a Unix vs Windows thing in terms of the application launcher and how it is implemented. File extensions are semantically meaningful for many unix tools, most notably gcc.

kergonath · on April 3, 2021

Also meaningless for vastly more UNIX tools. Also, Gnu’s not UNIX.

irq · on April 3, 2021

> But opening.txt and treating it as html, that seems strange.

I’d go as far as to call it negligent, not merely strange.

smoldesu · on April 3, 2021

I don't know anyone who says that an IDE is just a text editor with bells and whistles. Visual Studio Code is a text editor with more bells and/or whistles than a choo-choo train, but that doesn't make it any more of an IDE than nano and termux.

aksss · on April 3, 2021

It compiles and debugs, that seems like an IDE to me.

VS Code is cool and all, but it definitely is a lot more manual and laborious than VS. the tooling and automation in VS is missed if you’re used to it.

slashdev · on April 3, 2021

VS code is an IDE by any reasonable measure. Nano is not.

renox · on April 3, 2021

Yesterday grep didn't work because it 'autodetected' that the target file was a binary.. So I 1) cursed whoever made this non backward compatible change 2) used man to find the '-a' option..

indigodaddy · on April 3, 2021

Hah, looks like I’ve been needlessly typing quite a few extra keystrokes, as I’ve always done —binary-files=text. I should have looked at the man page more closely..

dewlinedew2 · on April 3, 2021

shoulda used 'cat

jclulow · on April 3, 2021

Using cat with arbitrary input exposes you to many terminal-side security issues. It is insufficiently complex.

cyphar · on April 3, 2021

Alternatively use "cat -v" and any terminal escape characters will be escaped.

bawolff · on April 3, 2021

It can do some sketchy things and rewrite your terminal in weird confusing ways, but afaik most of the out-and-out malicious escape sequences have been patched out at least a decade ago.

cyberpunk · on April 3, 2021

Such as?

edgyquant · on April 3, 2021

Cat has long been known to expose you to code execution, among other things.

https://security.stackexchange.com/questions/56307/can-cat-i...

jclulow · on April 3, 2021

Any vulnerability in the escape sequence handling of the terminal emulator, and conceivably, depending on what sequences your terminal supports, access to facilities like local file generation or clipboard contents. There have been a number of issues with escape sequences injected into things people might copy and paste from a web page, or in git commit histories, that have done nefarious things.

aruggirello · on April 3, 2021

that wouldn't work nicely with the mouse :)

zrobotics · on April 3, 2021

That's what the on-screen keyboard is for. Even works with the touchscreen, no mouse required! Eminently usable.

A4ET8a8uTh0 · on April 3, 2021

I think this captures my sentiment on the matter as well. Applications today want to be a swiss army knife and do just about every job.. and do it poorly. I do expect that level of complexity from RStudio, but probably not from Notepad. I would probably kinda accept it in Notepad++.

So real question. Is TextEdit a default text editor in a mac?

kergonath · on April 3, 2021

TextEdit is closer to WordPad than Notepad in terms of functionality. It supports rich text edition, and is more than a plain text editor.