More

_fat_santa · 2026-01-28T15:19:34 1769613574

I've been running Ubuntu Linux for a long time now (over a decade, started with 8.04). Linux still has it's fair share of bugs but I'll take having to deal with those over running Windows or MacOS any day.

For me the biggest thing is control, with Windows there are some things like updates that you have zero control over. It's the same issue with MacOS, you have more control than Windows but you're still at the whims of Apple's design choices every year when they decide to release a new OS update.

Linux, for all it's issues, give you absolute control over your system and as a developer I've found this one feature outweighs pretty much all the issues and negatives about the OS. Updates don't run unless I tell them to run, OS doesn't upgrade unless I tell it to. Even when it comes to bugs at least you have the power to fix them instead of waiting on an update hoping it will resolve that issue. Granted in reality I wait for updates to fix various small issues but for bigger ones that impact my workflow I will go through the trouble of fixing it.

I don't see regular users adopting Linux anytime soon but I'm quickly seeing adoption pickup among the more technical community. Previously only a subset of technical folks actually ran Linux because Windows/MacOS just worked but I see more and more of them jumping ship with how awful Windows and MacOS have become.

cosmic_cheese · 2026-01-28T15:44:59 1769615099

The control is both a blessing and a curse. It’s really easy to accidentally screw things up when e.g. trying to polish some of the rough edges or otherwise make the system function as desired. It also may not be of any help if the issue you’re facing is too esoteric for anybody else to have posted about it online (or for LLMs to be of any assistance).

It would help a lot if there were a distro that was polished and complete enough that most people – even those of us who are more technical and are more demanding – rarely if ever have any need to dive under the hood. Then the control becomes purely an asset.

sovietmudkipz · 2026-01-28T15:26:43 1769614003

I remember when Ubuntu decided to reroute apt installations into SNAP installs. So you install a package via apt and there was logic to see if they should disregard your command and install a SNAP instead. Do they still do that?

It annoyed me so much that I switched to mint.

stuff4ben · 2026-01-28T15:48:57 1769615337

Meh, I don't care much about control, I care more about getting my work done with the least amount of friction. Macs do that for me. Linux and Windows have too many barriers to make them a daily GUI driver.

_fat_santa · 2026-01-27T16:40:18 1769532018

> Unsure of the actual issues people run into at this point outside of very niche workflows or applications, to which, there are X11 fallbacks for.

I don't know if others have experienced this but the biggest bug I see in Wayland right now is sometimes on an external monitor after waking the computer, a full-screen electron window will crash the display (ie the display disconnects).

I can usually fix this by switching to another desktop and then logging out and logging back in.

Such a strange bug because it only affects my external monitor and only affects electron apps (I notice it with VSCode the most but that's just cause I have it running virtually 24/7)

If anyone has encountered this issue and figured out a solution i am all ears.

0x1ch · 2026-01-27T17:01:04 1769533264

This is probably worth reporting. I don't think I've ever heard or ran into something like that before. Most issues I ran into during the early rollout of Wayland desktop environments was broken or missing functionality in existing apps.

gf000 · 2026-01-27T18:56:06 1769540166

Is it gnome or kde or what?

That's like saying "the website doesn't work", without saying what browser you are using.

_fat_santa · 2026-01-27T23:28:30 1769556510

Happens on any DE running Wayland. Ive gotten it to happen on both Gnome and KDE.

_fat_santa · 2026-01-27T16:33:34 1769531614

I don't live around any Amazon Fresh stores so I never saw them though I did see the technology in use at several airports (though I've never personally used it). IMO I think places like airports are the best place for something like this, people are usually in a rush so not having to wait in line to checkout is nice and you don't have to worry about security as much because everyone there is a ticketed passenger (only saw them post-security) and even if someone did try stealing they wouldn't get very far.

vel0city · 2026-01-27T16:53:36 1769532816

I saw these in several different airports. It usually had multiple people staffed at the gate to get in and out meanwhile most of the other snack vendors often only had a single person employed.

So you spend a few hundred thousand dollars extra on all the cameras, many millions on all the design, pay all the overseas contractors to manually review the transactions, and you still end up with twice the in-person staff than the average store in the airport.

onetokeoverthe · 2026-01-27T16:44:43 1769532283

not get far? at an airport?

_fat_santa · 2026-01-22T15:43:17 1769096597

I look at ReactOS largely as an exercise in engineering and there's really nothing wrong it with it being just that. Personally I think projects like Wine/Proton have made far more in-roads in being able to run Windows software on non-Windows systems but I still have to give props to the developers of ReactOS for sticking with it for 30 freaking years.

ACS_Solver · 2026-01-22T16:12:26 1769098346

Yes. The unique point of ReactOS is driver compatibility. Wine is pretty great for Win32 API, Proton completes it with excellent D3D support through DXVK, and with these projects a lot of Windows userspace can run fine on Linux. Wine doesn't do anything for driver compatibility, which is where ReactOS was supposed to fill in, running any driver written for Windows 2000 or XP.

But by now, as I also wrote in the other thread on this, ReactOS should be seen as something more like GNU Hurd. An exercise in kernel development and reverse engineering, a project that clearly requires a high level of technical skill, but long past the window of opportunity for actual adoption. If Hurd had been usable by say 1995, when Linux just got started on portability, it would have had a chance. If ReactOS had been usable ten years ago, it would also have had a chance at adoption, but now it's firmly in the "purely for engineering" space.

userulluipeste · 2026-01-22T18:38:24 1769107104

"ReactOS should be seen as something more like GNU Hurd. An exercise in kernel development and reverse engineering, a project that clearly requires a high level of technical skill, but long past the window of opportunity for actual adoption."

I understand your angle, or rather the attempt of fitting them in the same picture, somehow. However, the differences between them far surpass the similarities. There was no meaningful user-base for Unix/Hurd so to speak of compared to NT kernel. There's no real basis to assert the "kernel development" argument for both, as one was indeed a research project whereas the other one is just clean room engineering march towards replicating an existing kernel. What ReactOS needs to succeed is to become more stable and complete (on the whole, not just the kernel). Once it will be able to do that, covering the later Windows capabilities will be just a nice-to-have thing. Considering all the criticism that current version of Windows receives, switching to a stable and functional ReactOS, at least for individual use, becomes a no-brainer. Comparatively, there's nothing similar that Hurd kernel can do to get to where Linux is now.

ACS_Solver · 2026-01-22T19:36:55 1769110615

I'd still consider them more similar than not.

Hurd was not a research project initially. It was a project to develop an actual, usable kernel for the GNU system, and it was supposed to be a free, copyleft replacement for the Unix kernel. ReactOS was similarly a project to make a usable and useful NT-compatible kernel, also as a free and copyleft replacement.

The key difference is that Hurd was not beholden to a particular architecture, it was free to do most things its own way as long as POSIX compatibility was achieved. ReactOS is more rigid in that it aims for compatibility with the NT implementation, including bugs, quirks and all, instead of a standard.

Both are long irrelevant to their original goals. Hurd because Linux is the dominant free Unix-like kernel (with the BSD kernel a distant second), ReactOS because the kernel it targets became a retrocomputing thing before ReactOS could reach a beta stage. And in the case of ReactOS, the secondary "whole system" goal is also irrelevant now because dozens of modern Linux distributions provide a better desktop experience than Windows 2000. Hell, Haiku is a better desktop experience.

userulluipeste · 2026-01-22T20:38:27 1769114307

"And in the case of ReactOS, the secondary «whole system» goal is also irrelevant now because dozens of modern Linux distributions provide a better desktop experience than Windows 2000. Hell, Haiku is a better desktop experience."

Yet, there are still too many desktop users that, despite the wishful thinking or blaming, still haven't switched to neither Linux, nor Haiku. No mater how good Haiku or Linux distributions are, their incompatibility with the existing Windows simply disqualifies them as options for those desktop users. I bet we'll see people switching to ReactOS when it will get just stable enough, yet long before it will get as polished as either Haiku or any given quality Linux distribution.

ACS_Solver · 2026-01-22T21:30:46 1769117446

No, people will never be switching to ReactOS. For some of the same reasons they don't switch to Linux, but stronger.

ReactOS aims to be a system that runs Windows software and looks like Windows. But, it runs software that's compatible with WinXP (because they target the 5.1 kernel) and it looks like Windows 2000 because that's the look they're trying to recreate. Plenty of modern software people want to run doesn't run on XP. Steam doesn't run on XP. A perfectly working ReactOS would already be incompatible with what current Windows users expect.

UI wise there is the same issue. Someone used to Windows 10 or 11 would find a transition to Windows 2000 more jarring than to say Linux Mint. ReactOS is no longer a "get the UI you know" proposition, it's now "get the UI of a system from twenty five years ago, if you even used it then".

userulluipeste · 2026-01-22T22:18:15 1769120295

"UI wise there is the same issue. Someone used to Windows 10 or 11 would find a transition to Windows 2000 more jarring than to say Linux Mint. ReactOS is no longer a «get the UI you know» proposition, it's now «get the UI of a system from twenty five years ago, if you even used it then»." "A perfectly working ReactOS would already be incompatible with what current Windows users expect."

That look and feel is the easy part. That can be addressed if it's really an issue. The hard part is the compatibility (that is given by many still missing parts) and stability (the still defective parts). The targeted kernel matters, of course, but that is not set in stone. In fact, there is Windows Vista+ functionality added and written about, here: https://reactos.org/blogs/investigating-wddm although doing it properly would mean rewriting the kernel, bumping it to NT version 6.0

I'm sure there will indeed be many users that will find various ReactOS aspects jarring for as long as there are still defects, lack of polish, or dysfunction on application and kernel (drivers) level. However, considering the vast pool of Windows desktop users, it's reasonable to expect ReactOS to cover the limited needs for enough users at some point, which should turn attention into testing, polish, and funding to address anything still lacking, which then should further feed the adoption and improvement loop.

"No, people will never be switching to ReactOS. For some of the same reasons they don't switch to Linux, but stronger."

To me, this makes sense maybe for corporate world. The reasons that made them stick with Windows has less to do with familiarity or with application compatibility (given the fact that a lot of corporate infrastructure is in web applications). Yes, there must be something else that governs corporate decisions, something to do with the way corporations function, and that will most likely prevent a switch to ReactOS just as it did to Linux based distributions. But, this is exactly why I intentionally specified "for individual use" when I said "switching to a stable and functional ReactOS, at least for individual use, becomes a no-brainer". For individual use, the reason that prevented people to switch to Linux is well known, and ReactOS's reason to be was aimed exactly at that.

saghm · 2026-01-22T19:24:46 1769109886

> There was no meaningful user-base for Unix/Hurd so to speak of compared to NT kernel.

Sure, but that userbase also already has a way of using the NT kernel: Windows. The point is that both Hurd and ReactOS are trying to solve an interesting technical problem but lack any real reason to use rather than their alternatives that solve enough of the practical problems for most users.

tracker1 · 2026-01-22T16:26:04 1769099164

While I think better Linux integration and improving WINE is probably better time spend... I do think there's some opportunity for ReactOS, but I feel it would have to at LEAST get to pretty complete Windows 7 compatibility (without bug fixes since)... that seems to be the last Windows version people remember relatively fondly by most and a point before they really split-brained a lot of the configuration and settings.

With the contempt of a lot of the Win10/11 features, there's some chance it could see adoption, if that's an actual goal. But the effort is huge, and would need to be sufficient for wide desktop installs much sooner than later.

I think a couple of the Linux + WINE UI options where the underlying OS is linux, and Wine is the UI/Desktop layer on top (not too disimilar from DOS/Win9x) might also gain some traction... not to mention distros that smooth the use of WINE out for new users.

Worth mentioning a lot of WINE is reused in ReactOS, so that effort is still useful and not fully duplicated.

ACS_Solver · 2026-01-22T16:38:17 1769099897

> I do think there's some opportunity for ReactOS, but I feel it would have to at LEAST get to pretty complete Windows 7 compatibility

That's not going to happen in any way that matters. If ReactOS ever reaches Win7 compatibility, that would be at a time when Win7 is long forgotten.

The project has had a target of Windows 2000 compatibility, later changed to XP (which is a relatively minor upgrade kernel wise). Now as of 2026, ReactOS has limited USB 2.0 support and wholly lacks critical XP-level support like Wifi, NTFS or multicore CPUs. Development on the project has never been fast but somewhere around 2018 it dropped even more, just looking at the commit history there's now half the activity of a decade ago. So at current rates, it's another 5+ years away from beta level support of NT 5.0.

ReactOS actually reaching decent Win2K/XP compatibility is a long shot but still possible. Upgrading to Win7 compatibility before Win7 itself is three plus decades old, no.

genewitch · 2026-01-22T16:45:17 1769100317

maybe posts like this will move the needle. If i could withstand OS programming (or debugging, or...) I'd probably work on reactOS. I did self-host it, which i didn't expect to work, so at least i know the toolchain works!

pjmlp · 2026-01-23T07:47:23 1769154443

Basically if you do the math, it means a whole generation got tired of being on the project and focused into something else, and there is no new blood to account for that.

The history of most FOSS projects after being running for a while.

f311a · 2026-01-22T16:16:05 1769098565

> Wine/Proton have made far more in-roads in being able to run Windows

Yeah, they can even run modern games, which ReactOS can't. It can't even run on modern hardware properly.

It's a nice project, though. Good progress for a hobby project, and it's still going after 30 years!

_fat_santa · 2026-01-20T16:41:19 1768927279

This article goes more into the technical analysis of the stock rather than the underlying business fundamentals that would lead to a stock dump.

My 30k ft view is that the stock will inevitably slide as AI datacenter spending goes down. Right now Nvidia is flying high because datacenters are breaking ground everywhere but eventually that will come to an end as the supply of compute goes up.

The counterargument to this is that the "economic lifespan" of an Nvidia GPU is 1-3 years depending on where it's used so there's a case to be made that Nvidia will always have customers coming back for the latest and greatest chips. The problem I have with this argument is that it's simply unsustainable to be spending that much every 2-3 years and we're already seeing this as Google and others are extending their depreciation of GPU's to something like 5-7 years.

agentcoops · 2026-01-20T18:23:26 1768933406

I hear your argument, but short of major algorithmic breakthroughs I am not convinced the global demand for GPUs will drop any time soon. Of course I could easily be wrong, but regardless I think the most predictable cause for a drop in the NVIDIA price would be that the CHIPS act/recent decisions by the CCP leads a Chinese firm to bring to market a CUDA compatible and reliable GPU at a fraction of the cost. It should be remembered that NVIDIA's /current/ value is based on their being locked out of their second largest market (China) with no investor expectation of that changing in the future. Given the current geopolitical landscape, in the hypothetical case where a Chinese firm markets such a chip we should expect that US firms would be prohibited from purchasing them, while it's less clear that Europeans or Saudis would be. Even so, if NVIDIA were not to lower their prices at all, US firms would be at a tremendous cost disadvantage while their competitors would no longer have one with respect to compute.

All hypothetical, of course, but to me that's the most convincing bear case I've heard for NVIDIA.

reppap · 2026-01-20T23:10:32 1768950632

People will want more GPUs but will they be able to fund them? At what points does the venture capital and loans run out? People will not keep pouring hundreds of billions into this if the returns don't start coming.

gadflyinyoureye · 2026-01-21T01:35:00 1768959300

Money will be interesting the next few years.

There is a real chance that the Japanese carry trade will close soon the BoJ seeing rates move up to 4%. This means liquidity will drain from the US markets back into Japan. On the US side there is going to be a lot of inflation between money printing, refund checks, amortization changes and a possible war footing. Who knows?

coryrc · 2026-01-20T19:08:07 1768936087

Not that locked out: https://www.cnbc.com/2025/12/31/160-million-export-controlle...

agentcoops · 2026-01-21T18:37:43 1769020663

Yeah, that's the bull case for sure. Chinese firms might not accept training setbacks even given CCP regulations that they dogfood X homegrown chip.

tracker1 · 2026-01-20T23:15:03 1768950903

Doesn't even necessarily need to be CUDA compatible... there's OpenCL and Vulkan as well, and likely China will throw enough resources at the problem to bring various libraries into closer alignment to ease of use/development.

I do think China is still 3-5 years from being really competitive, but still even if they hit 40-50% of NVidia, depending on pricing and energy costs, it could still make significant inroads with legal pressure/bans, etc.

bigyabai · 2026-01-21T01:56:22 1768960582

> there's OpenCL and Vulkan as well

OpenCL is chronically undermaintained & undersupported, and Vulkan only covers a small subset of what CUDA does so far. Neither has the full support of the tech industry (though both are supported by Nvidia, ironically).

It feels like nobody in the industry wants to beat Nvidia badly enough, yet. Apple and AMD are trying to supplement raster hardware with inference silicon; both of them are afraid to implement a holistic compute architecture a-la CUDA. Intel is reinventing the wheel with OneAPI, Microsoft is doing the same with ONNX, Google ships generic software and withholds their bespoke hardware, and Meta is asleep at the wheel. All of them hate each other, none of them trust Khronos anymore, and the value of a CUDA replacement has ballooned to the point that greed might be their only motivator.

I've wanted a proper, industry-spanning CUDA competitor since high school. I'm beginning to realize it probably won't happen within my lifetime.

zozbot234 · 2026-01-21T02:56:38 1768964198

The modern successor to OpenCL is SYCL and there's been some limited convergence with Vulkan Compute (they're still based on distinct programming models and even SPIR-V varieties under the hood, but the distance is narrowing somewhat).

pjmlp · 2026-01-22T16:01:34 1769097694

Which is basically Intel for practical purposes.

robmay · 2026-01-21T13:04:24 1769000664

Lemurian Labs is working on this https://www.lemurianlabs.com/

Balinares · 2026-01-21T07:45:08 1768981508

Ask Claude, HN tells me that it can implement the things that you ask.

laughing_man · 2026-01-21T05:39:28 1768973968

I suspect major algorithmic breakthroughs would accelerate the demand for GPUs instead of making it fall off, since the cost to apply LLMs would go down.

MaxBarraclough · 2026-01-21T19:58:31 1769025511

Sounds like the Jevons paradox. From https://en.wiktionary.org/wiki/Jevons_paradox :

> The proposition that technological progress that increases the efficiency with which a resource is used tends to increase (rather than decrease) the rate of consumption of that resource.

See also Wikipedia: https://en.wikipedia.org/wiki/Jevons_paradox

nroets · 2026-01-21T06:36:39 1768977399

Some changes to the algorithms and implementations will allow cheaper commodity hardware to be used.

Rover222 · 2026-01-21T11:45:58 1768995958

There will always be an incentive to scale data centers. Better algorithms just mean more bang per gpu, not that “well, that’s enough now, we’ve done it”.

iLoveOncall · 2026-01-20T19:23:51 1768937031

> short of major algorithmic breakthroughs I am not convinced the global demand for GPUs will drop any time soon

Or, you know, when LLMs don't pay off.

unsupp0rted · 2026-01-20T22:00:03 1768946403

Even if LLMs didn't advance at all from this point onward, there's still loads of productive work that could be optimized / fully automated by them, at no worse output quality than the low-skilled humans we're currently throwing at that work.

pvab3 · 2026-01-20T22:53:08 1768949588

inference requires a fraction of the power that training does. According to the Villalobos paper, the median date is 2028. At some point we won't be training bigger and bigger models every month. We will run out of additional material to train on, things will continue commodifying, and then the amount of training happening will significantly decrease unless new avenues open for new types of models. But our current LLMs are much more compute-intensive than any other type of generative or task-specific model

SequoiaHope · 2026-01-21T05:15:39 1768972539

Run out of training data? They’re going to put these things in humanoids (they are weirdly cheap now) and record high resolution video and other sensor data of real world tasks and train huge multimodal Vision Language Action models etc.

The world is more than just text. We can never run out of pixels if we point cameras at the real world and move them around.

I work in robotics and I don’t think people talking about this stuff appreciate that text and internet pictures is just the beginning. Robotics is poised to generate and consume TONS of data from the real world, not just the internet.

DoctorOetker · 2026-01-21T19:28:17 1769023697

While we may run out of human written text of value, we won't run out of symbolic sequences of tokens: we can trivially start with axioms and do random forward chaining (or random backward chaining from postulates), and then train models on 2-step, 4-step, 8-step, ... correct forward or backward chains.

Nobody talks about it, but ultimately the strongest driver for terrascale compute will be for mathematical breakthroughs in crypography (not bruteforcing keys, but bruteforcing mathematical reasoning).

vintermann · 2026-01-21T09:40:57 1768988457

Yeah, another source of "unlimited data" is genetics. The human reference genome is about 6.5 GB, but these days, they're moving to pangenomes, wanting to map out not just the genome of one reference individual, but all the genetic variation in a clade. Depending on how ambitious they are about that "all", they can be humongous. And unlike say video data, this is arguably a language. We're completely swimming in unmapped, uninterpreted language data.

boppo1 · 2026-01-21T15:05:01 1769007901

Can you say more?

yourapostasy · 2026-01-20T23:17:18 1768951038

Inference leans heavily on GPU RAM and RAM bandwidth for the decode phase where an increasingly greater amount of time is being spent as people find better ways to leverage inference. So NVIDIA users are currently arguably going to demand a different product mix when the market shifts away from the current training-friendly products. I suspect there will be more than enough demand for inference that whatever power we release from a relative slackening of training demand will be more than made up and then some by power demand to drive a large inference market.

It isn’t the panacea some make it out to be, but there is obvious utility here to sell. The real argument is shifting towards the pricing.

zozbot234 · 2026-01-20T23:09:44 1768950584

> We will run out of additional material to train on

This sounds a bit silly. More training will generally result in better modeling, even for a fixed amount of genuine original data. At current model sizes, it's essentially impossible to overfit to the training data so there's no reason why we should just "stop".

_0ffh · 2026-01-21T00:51:36 1768956696

You'd be surprised how quickly improvement of autoregressive language models levels off with epoch count (though, admittedly, one epoch is a LOT). Diffusion language models otoh indeed keep profiting for much longer, fwiw.

zozbot234 · 2026-01-21T09:33:54 1768988034

Does this also apply to LLM training at scale? I would be a bit surprised if it does, fwiw.

_0ffh · 2026-01-21T12:48:40 1768999720

Yup, as soon as data is the bottleneck and not compute, diffusion wins. Tested following the Chinchilla scaling strategy from 7M to 2.5B parameters.

https://arxiv.org/abs/2507.15857

pvab3 · 2026-01-20T23:31:13 1768951873

I'm just talking about text generated by human beings. You can keep retraining with more parameters on the same corpus

https://proceedings.mlr.press/v235/villalobos24a.html

x-complexity · 2026-01-21T01:35:16 1768959316

> I'm just talking about text generated by human beings.

That in itself is a goalpost shift from

> > We will run out of additional material to train on

Where it is implied "additional material" === "all data, human + synthetic"

------

There's still some headroom left in the synthetic data playground, as cited in the paper linked:

https://proceedings.mlr.press/v235/villalobos24a.html ( https://openreview.net/pdf?id=ViZcgDQjyG )

"On the other hand, training on synthetic data has shown much promise in domains where model outputs are relatively easy to verify, such as mathematics, programming, and games (Yang et al., 2023; Liu et al., 2023; Haluptzok et al., 2023)."

With the caveat that translating this success outside of these domains is hit-or-miss:

"What is less clear is whether the usefulness of synthetic data will generalize to domains where output verification is more challenging, such as natural language."

The main bottleneck for this area of the woods will be (X := how many additional domains can be made easily verifiable). So long as (the rate of X) >> (training absorption rate), the road can be extended for a while longer.

SchemaLoad · 2026-01-20T22:09:44 1768946984

How much of the current usage is productive work that's worth paying for vs personal usage / spam that would just drop off after usage charges come in? I imagine flooding youtube and instagram with slop videos would reduce if users had to pay fair prices to use the models.

The companies might also downgrade the quality of the models to make it more viable to provide as an ad supported service which would again reduce utilisation.

unsupp0rted · 2026-01-20T22:20:59 1768947659

For any "click here and type into a box" job for which you'd hire a low-skilled worker and give them an SOP to follow, you can have an LLM-ish tool do it.

And probably for the slightly more skilled email jobs that have infiltrated nearly all companies too.

Is that productive work? Well if people are getting paid, often a multiple of minimum wage, then it's productive-seeming enough.

greree · 2026-01-21T04:24:10 1768969450

Another bozo making fun of other job classes.

Why are there still customer service reps? Shouldn’t they all be gone by now due to this amazing technology?

Ah, tumbleweed.

bethekidyouwant · 2026-01-21T03:24:34 1768965874

Who is generating videos for free?

stingraycharles · 2026-01-20T22:11:54 1768947114

Exactly, the current spend on LLMs is based on extremely high expectations and the vendors operating at a loss. It’s very reasonable to assume that those expectations will not be met, and spending will slow down as well.

Nvidia’s valuation is based on the current trend continuing and even increasing, which I consider unlikely in the long term.

bigyabai · 2026-01-20T22:27:50 1768948070

> Nvidia’s valuation is based on the current trend continuing

People said this back when Folding@Home was dominated by Team Green years ago. Then again when GPUs sold out for the cryptocurrency boom, and now again that Nvidia is addressing the LLM demand.

Nvidia's valuation is backstopped by the fact that Russia, Ukraine, China and the United States are all tripping over themselves for the chance to deploy it operationally. If the world goes to war (which is an unfortunate likelihood) then Nvidia will be the only trillion-dollar defense empire since the DoD's Last Supper.

matthewdgreen · 2026-01-20T22:34:17 1768948457

China is restricting purchases of H200s. The strong likelihood is that they're doing this to promote their own domestic competitors. It may take a few years for those chips to catch up and enter full production, but it's hard to envision any "trillion dollar" Nvidia defense empire once that happens.

bigyabai · 2026-01-20T23:00:32 1768950032

It's very easy to envision. America needs chips, and Intel can't do most of this stuff.

zozbot234 · 2026-01-20T23:12:31 1768950751

Intel makes GPUs.

bigyabai · 2026-01-21T00:09:51 1768954191

Intel's GPU designs make AMD look world-class by comparison. Outside of transcode applications, those Arc cards aren't putting up a fight.

irishcoffee · 2026-01-21T06:00:34 1768975234

...if you can't be with the one you love, love the one you're with?

pjmlp · 2026-01-22T16:02:57 1769097777

Intel's GPU story all their life.

MichaelRo · 2026-01-21T07:59:12 1768982352

> short of major algorithmic breakthroughs I am not convinced the global demand for GPUs will drop any time soon

>> Or, you know, when LLMs don't pay off.

Heh, exactly the observation that a fanatic religious believer cannot possibly foresee. "We need more churches! More priests! Until a breakthrough in praying technique will be achieved I don't foresee less demand for religious devotion!" Nobody foresaw Nietzsche and the decline in blind faith.

But then again, like an atheist back in the day, the furious zealots would burn me at the stake if they could, for saying this. Sadly no longer possible so let them downvotes pour instead!

selfhoster11 · 2026-01-20T19:48:20 1768938500

They already are paying off. The nature of LLMs means that they will require expensive, fast hardware that's a large capex.

kortilla · 2026-01-20T19:53:31 1768938811

They aren’t yet because the big providers that paid for all of this GPU capacity aren’t profitable yet.

They continually leap frog each other and shift around customers which indicates that the current capacity is already higher than what is required for what people actually pay for.

MrDarcy · 2026-01-20T20:49:22 1768942162

Google, Amazon, and Microsoft aren’t profitable?

notyourwork · 2026-01-20T21:02:10 1768942930

I assume the reference was AI use cases are not profitable. Those companies are subsidizing and OpenAI/grok are burning money.

lossyalgo · 2026-01-21T00:00:20 1768953620

Yeah but OpenAI is adding ads this year for the free versions, which I'm guessing is most of their users. They are probably hedging on taking a big slice of Google's advertising monopoly-pie (which is why Google is also now all-in on forcing Gemini opt-out on every product they own, they can see the writing on the wall).

onion2k · 2026-01-20T23:53:03 1768953183

Google, Amazon, and Microsoft do a lot of things that aren't profitable in themselves. There is no reason to believe a company will kill a product line just because it makes a loss. There are plenty of other reasons to keep it running.

notyourwork · 2026-01-21T16:44:45 1769013885

I didn't imply anything about what big-tech would do.

wolfram74 · 2026-01-20T22:20:13 1768947613

Do you think it's odd you only listed companies with already existing revenue streams and not companies that started with and only have generative algos as their product?

josefx · 2026-01-20T21:15:30 1768943730

Aren't all Microsoft products OpenAI based? OpenAI has always been burning money.

dangus · 2026-01-20T22:25:50 1768947950

How many business units have Google and Microsoft shut down or ceased investment for being unprofitable?

I hear Meta is having massive VR division layoffs…who could have predicted?

Raw popularity does not guarantee sustainability. See: Vine, WeWork, MoviePass.

Forgeties79 · 2026-01-20T20:30:38 1768941038

Where? Who’s in the black?

selfhoster11 · 2026-01-21T17:33:20 1769016800

The users.

Forgeties79 · 2026-01-22T02:39:28 1769049568

Ehhhhhhh

kelseyfrog · 2026-01-21T17:33:16 1769016796

Algorithmic breakthroughs (increases in efficiency) risk Jevons Paradox. More efficient processes make deploying them even more cost effective and increases demand.

lairv · 2026-01-20T18:03:07 1768932187

NVIDIA stock tanked in 2025 when people learned that Google used TPUs to train Gemini, which everyone in the community knows since at least 2021. So I think it's very likely that NVIDIA stock could crash for non-rationale reasons

edit: 2025* not 2024

readthenotes1 · 2026-01-20T19:27:44 1768937264

It also tanked to ~$90 when Trump announced tariffs on all goods for Taiwan except semiconductors.

I don't know if that's non-rational, or if people can't be expected to read the second sentence of an announcement before panicking.

Loudergood · 2026-01-20T19:52:54 1768938774

The market is full of people trying to anticipate how other people are going to react and exploit that by getting there first. There's a layer aimed at forecasting what that layer is going to do as well.

It's guesswork all the way down.

Terr_ · 2026-01-21T00:55:11 1768956911

A bunch of "Greater Fool" motivation too.

https://en.wikipedia.org/wiki/Greater_fool_theory

recursive · 2026-01-20T20:17:06 1768940226

Personally, I try to predict how others are going to predict that yet others will react.

svnt · 2026-01-20T21:56:44 1768946204

You jerk

nealabq · 2026-01-21T00:14:14 1768954454

Third-derivative pun.

Riposte: I knew you'd say that! Snap!

MrOrelliOReilly · 2026-01-20T21:48:59 1768945739

And I just predict how you’ll predict

Nevermark · 2026-01-21T03:42:27 1768966947

So we have a closed instability/volatility amplification loop. Great: Time for the straddle with finger-cross trade.

gpderetta · 2026-01-20T22:11:34 1768947094

Keynesian beauty contest.

gertlex · 2026-01-20T20:34:09 1768941249

This was also on top of claims (Jan 2025) that Deepseek showed that "we don't actually need as much GPU, thus NVidia is less needed"; at least it was my impression this was one of the (now silly-seeming) reasons NVDA dropped then.

readthenotes1 · 2026-01-24T19:04:37 1769281477

It had already recovered from the DeepSeek head fake iirc

mschuster91 · 2026-01-20T21:22:23 1768944143

> I don't know if that's non-rational, or if people can't be expected to read the second sentence of an announcement before panicking.

These days you have AI bots doing sentiment based training.

If you ask me... all these excesses are a clear sign for one thing, we need to drastically rein in the stonk markets. The markets should serve us, not the other way around.

Der_Einzige · 2026-01-20T20:59:55 1768942795

Google did not use TPUs for literally every bit of compute that led to Gemini. GCP has millions of high end Nvidia GPUs and programming for them is an order of magnitude easier, even for googlers.

Any claim from google that all of Gemini (including previous experiments) was trained entirely by TPUs is lies. What they are truthfully saying is that the final training run was done on all TPUs. The market shouldn’t react heavily to this, but instead should react positively to the fact that google is now finally selling TPUs externally and their fab yields are better than expected.

djsjajah · 2026-01-20T21:23:14 1768944194

> including all previous experiments

How far back do you go? What about experiments into architecture features that didn’t make the cut? What about pre-transformer attention?

But more generally, why are you so sure that they team that built Gemini didn’t exclusively use TPUs while they were developing it?

I think that one of the reasons that Gemini caught up so quickly is because they have so much compute at fraction of the price of everyone else.

notyourwork · 2026-01-20T21:03:28 1768943008

Why should it not react heavily? What’s stopping this from being a start of a trend for google and even Amazon?

imtringued · 2026-01-21T07:35:57 1768980957

JAX is very easy to use. Give it a try.

gregorygoc · 2026-01-21T07:19:16 1768979956

They are not lies.

mnky9800n · 2026-01-20T17:17:23 1768929443

I really don't understand the argument that nvidia GPUs only work for 1-3 years. I am currently using A100s and H100s every day. Those aren't exactly new anymore.

mbrumlow · 2026-01-20T18:52:33 1768935153

It’s not that they don’t work. It’s how businesses handle hardware.

I worked at a few data centers on and off in my career. I got lots of hardware for free or on the cheap simply because the hardware was considered “EOL” after about 3 years, often when support contracts with the vendor ends.

There are a few things to consider.

Hardware that ages produce more errors, and those errors cost, one way or another.

Rack space is limited. A perfectly fine machine that consumes 2x the power for half the output cost. It’s cheaper to upgrade a perfectly fine working system simply because it performs better per watt in the same space.

Lastly. There are tax implications in buying new hardware that can often favor replacement.

fooker · 2026-01-20T18:56:58 1768935418

I’ll be so happy to buy a EOL H100!

But no, there’s none to be found, it is a 4 year, two generations old machine at this point and you can’t buy one used at a rate cheaper than new.

pixl97 · 2026-01-20T20:26:12 1768940772

Well demand is so high currently that it's likely this cycle doesn't exist yet for fast cards.

For servers I've seen where the slightly used equipment is sold in bulk to a bidder and they may have a single large client buy all of it.

Then around the time the second cycle comes around it's split up in lots and a bunch ends up at places like ebay

lancekey · 2026-01-20T22:31:37 1768948297

Yea looking at 60 day moving average on computeprices.com H100 have actually gone UP in cost recently, at least to rent.

A lot of demand out there for sure.

aswegs8 · 2026-01-20T19:25:04 1768937104

Not sure why this "GPUs obsolete after 3 years" gets thrown around all the time. Sounds completely nonsensical.

belval · 2026-01-20T19:42:39 1768938159

Especially since AWS still have p4 instances that are 6 years old A100s. Clearly even for hyperscalers these have a useful life longer than 3 years.

tuckerman · 2026-01-20T22:39:41 1768948781

I agree that there is hyperbole thrown around a lot here and its possible to still use some hardware for a long time or to sell it and recover some cost but my experience in planning compute at large companies is that spending money on hardware and upgrading can often result in saving money long term.

Even assuming your compute demands stay fixed, its possible that a future generation of accelerator will be sufficiently more power/cooling efficient for your workload that it is a positive return on investment to upgrade, more so when you take into account you can start depreciating them again.

If your compute demands aren't fixed you have to work around limited floor space/electricity/cooling capacity/network capacity/backup generators/etc and so moving to the next generation is required to meet demand without extremely expensive (and often slow) infrastructure projects.

zozbot234 · 2026-01-20T23:19:21 1768951161

Sure, but I don't think most people here are objecting to the obvious "3 years is enough for enterprise GPUs to become totally obsolete for cutting-edge workloads" point. They're just objecting to the rather bizarre notion that the hardware itself might physically break in that timeframe. Now, it would be one thing if that notion was supported by actual reliability studies drawn from that same environment - like we see for the Backblaze HDD lifecycle analyses. But instead we're just getting these weird rumors.

tuckerman · 2026-01-21T17:14:29 1769015669

I agree that is a strange notion that would require some evidence and I see it in some other threads but looking at the parent comments going up it seems people are discussing economic usefulness so that is what I'm responding to.

thworp · 2026-01-21T15:47:50 1769010470

A toy example: NeoCloud Inc builds a new datacenter full of the new H800 GPUs. It rents out a rack of them for $10/minute while paying $6/minute for electricity, interest, loan repayment, rent and staff.

Two years later, H900 is released for a similar price but it performs twice as many TFlOps/Watt. Now any datacenter using H900 can offer the same performance as NeoCloud Inc at $5/month, taking all their customers.

[all costs reduced to $/minute to make a point]

fooker · 2026-01-22T12:17:07 1769084227

It really depends on how long `NeoCloud` takes to recoup their capital expenditure on the H800s.

Current estimates are about 1.5-2 years, which not-so-suspiciously coincides with your toy example.

bmurphy1976 · 2026-01-20T20:00:54 1768939254

It's because they run 24/7 in a challenging environment. They will start dying at some point and if you aren't replacing them you will have a big problem when they all die en masse at the same time.

These things are like cars, they don't last forever and break down with usage. Yes, they can last 7 years in your home computer when you run it 1% of the time. They won't last that long in a data center where they are running 90% of the time.

zozbot234 · 2026-01-20T20:50:07 1768942207

A makeshift cryptomining rig is absolutely a "challenging environment" and most GPUs by far that went through that are just fine. The idea that the hardware might just die after 3 years' usage is bonkers.

Der_Einzige · 2026-01-20T21:02:42 1768942962

Crypto miners undervote for efficiency GPUs and in general crypto mining is extremely light weight on GPUs compared to AI training or inference at scale

Der_Einzige · 2026-01-20T21:01:48 1768942908

With good enough cooling they can run indefinitely!!!!! The vast majority of failures are either at the beginning due to defects or at the end due to cooling! It’s like the idea that no moving parts (except the HVAC) is somehow unreliable is coming out of thin air!

jpfromlondon · 2026-01-21T14:10:40 1769004640

Economically obsolete, not obsolete, I suspect this is in line with standard depreciation.

SequoiaHope · 2026-01-20T21:59:23 1768946363

There’s plenty on eBay? But at the end of your comment you say “a rate cheaper than new” so maybe you mean you’d love to buy a discounted one. But they do seem to be available used.

fooker · 2026-01-21T00:55:35 1768956935

> so maybe you mean you’d love to buy a discounted one

Yes. I'd expect 4 year old hardware used constantly in a datacenter to cost less than when it was new!

(And just in case you did not look carefully, most of the ebay listings are scams. The actual product pictured in those are A100 workstation GPUs.)

JMiao · 2026-01-20T19:45:22 1768938322

Do you know how support contract lengths are determined? Seems like a path to force hardware refreshes with boilerplate failure data carried over from who knows when.

aorloff · 2026-01-21T07:12:02 1768979522

> Rack space is limited.

Rack space and power (and cooling) in the datacenter drives what hardware stays in the datacenter

linkregister · 2026-01-20T17:24:41 1768929881

The common factoid raised in financial reports is GPUs used in model training will lose thermal insulation due to their high utilization. The GPUs ostensibly fail. I have heard anecdotal reports of GPUs used for cryptocurrency mining having similar wear patterns.

I have not seen hard data, so this could be an oft-repeated, but false fact.

Melatonic · 2026-01-20T17:38:19 1768930699

It's the opposite actually - most GPU used for mining are run at a consistent temp and load which is good for long term wear. Peaky loads where the GPU goes from cold to hot and back leads to more degradation because of changes in thermal expansion. This has been known for some time now.

Yizahi · 2026-01-20T18:26:29 1768933589

That is commonly repeated idea, but it doesn't take into account countless token farms which are smaller than a datacenter. Basically anything from a single MB with 8 cards to a small shed with rigs, all of which tend to disregard common engineering practices and run hardware into a ground to maximize output until next police raid or difficulty bump. Plenty of photos in the internet of crappy rigs like that, and no one guarantees which GPU comes whom where.

Another commonly forgotten issue is that many electrical components are rated by hours of operation. And cheaper boards tend to have components with smaller tolerances. And that rated time is actually a graph, where hour decrease with higher temperature. There were instances of batches of cards failing due to failing MOSFETs for example.

Melatonic · 2026-01-20T20:44:50 1768941890

While I'm sure there are small amateur setups done poorly that push cards to their limits this seems like a more rare and inefficient use. GPUS (even used) are expensive and running them at maximum would require large costs and time to be replacing them regularly. Not to mention the increased cost of cooling and power.

Not sure I understand the police raid mentality - why are the police raiding amateur crypto mining setups ?

I can totally see cards used by casual amateurs being very worn / used though - especially your example of single mobo miners who were likely also using the card for gaming and other tasks.

I would imagine that anyone purposely running hardware into the ground would be running cheaper / more efficient ASICS vs expensive Nvidia GPUs since they are much easier and cheaper to replace. I would still be surprised however if most were not proritising temps and cooling

whaleofatw2022 · 2026-01-20T19:05:50 1768935950

Let's also not forget the set of miners that either overclock or dont really care about long term in how they set up thermals

belval · 2026-01-20T19:44:18 1768938258

Miners usually don't overclock though. If anything underclocking is the best way to improve your ROI because it significantly reduces the power consumption while retaining most of the hashrate.

Melatonic · 2026-01-20T20:40:54 1768941654

Exactly - more specifically undervolting. You want the minimum volts going to the card with it still performing decently.

Even in amateur setups the amount of power used is a huge factor (because of the huge draw from the cards themselves and AC units to cool the room) so minimising heat is key.

From what I remember most cards (even CPUs as well) hit peak efficiency when undervolted and hitting somewhere around 70-80% max load (this also depends on cooling setup). First thing to wear out would probably be the fan / cooler itself (repasting occasionally would of course help with this as thermal paste dries out with both time and heat)

bluGill · 2026-01-20T23:07:17 1768950437

The only amatures I know doing this are trying to heat their garrage for free. so long as the heat gain is paid for they can afford to heat an otherwise unheated building.

zozbot234 · 2026-01-20T20:53:29 1768942409

Wouldn't the exact same considerations apply to AI training/inference shops, seeing as gigawatts are usually the key constraint?

coryrc · 2026-01-20T19:10:11 1768936211

Specifically, we expect a halving of lifetime per 10K increase in temperature.

WalterBright · 2026-01-20T23:11:43 1768950703

Why would police raid a shed housing a compute center?

mbesto · 2026-01-20T19:23:56 1768937036

Source?

zozbot234 · 2026-01-20T17:32:32 1768930352

> I have heard anecdotal reports of GPUs used for cryptocurrency mining having similar wear patterns.

If this was anywhere close to a common failure mode, I'm pretty sure we'd know that already given how crypto mining GPUs were usually ran to the max in makeshift settings with woefully inadequate cooling and environmental control. The overwhelming anecdotal evidence from people who have bought them is that even a "worn" crypto GPU is absolutely fine.

munk-a · 2026-01-20T17:29:28 1768930168

I can't confirm that fact - but it's important to acknowledge that consumer usage is very different from the high continuous utilization in mining and training. It is credulous that the wear on cards under such extreme usage is as high as reported considering that consumers may use their cards at peak 5% of waking hours and the wear drop off is only about 3x if it is used near 100% - that is a believable scale for endurance loss.

denimnerd42 · 2026-01-20T17:46:26 1768931186

1-3 is too short but they aren’t making new A100s, theres 8 in a server and when one goes bad what do you do? you wont be able to renew a support contract. if you wanna diy you eventually you have to start consolidating pick and pulls. maybe the vendors will buy them back from people who want to upgrade and resell them. this is the issue we are seeing with A100s and we are trying to see what our vendor will offer for support.

iancmceachern · 2026-01-20T17:18:37 1768929517

They're no longer energy competitive. I.e. the amount of power per compute exceeds what is available now.

It's like if your taxi company bought taxis that were more fuel efficient every year.

bob1029 · 2026-01-20T17:21:50 1768929710

Margins are typically not so razor thin that you cannot operate with technology from one generation ago. 15 vs 17 mpg is going to add up over time, but for a taxi company it's probably not a lethal situation to be in.

SchemaLoad · 2026-01-20T22:11:54 1768947114

At least with crypto mining this was the case. Hardware from 6 months ago is useless ewaste because the new generation is more power efficient. All depends on how expensive the hardware is vs the cost of power.

iancmceachern · 2026-01-20T20:08:09 1768939689

Tell that to the airline industry

hibikir · 2026-01-20T21:59:20 1768946360

And yet they aren't running planes and engines all from 2023 or beyond: See the MD-11 that crashed in Louisville: Nobody has made a new MD-11 in over 20 years. Planes move to less competitive routes, change carriers, and eventually might even stop carrying people and switch to cargo, but the plane itself doesn't get to have zero value when the new one comes out. An airline will want to replace their planes, but a new plane isn't fully amortized in a year or three: It still has value for quite a while

bob1029 · 2026-01-20T20:10:23 1768939823

I don't think the airline industry is a great example from an IT perspective, but I agree with regard to the aircraft.

echelon · 2026-01-20T17:26:39 1768929999

Nvidia has plenty of time and money to adjust. They're already buying out upstart competitors to their throne.

It's not like the CUDA advantage is going anywhere overnight, either.

Also, if Nvidia invests in its users and in the infrastructure layouts, it gets to see upside no matter what happens.

mikkupikku · 2026-01-20T17:22:51 1768929771

If a taxi company did that every year, they'd be losing a lot of money. Of course new cars and cards are cheaper to operate than old ones, but is that difference enough to offset buying a new one every one to three years?

gruez · 2026-01-20T17:39:03 1768930743

>If a taxi company did that every year, they'd be losing a lot of money. Of course new cars and cards are cheaper to operate than old ones, but is that difference enough to offset buying a new one every one to three years?

That's where the analogy breaks. There are massive efficiency gains from new process nodes, which new GPUs use. Efficiency improvements for cars are glacial, aside from "breakthroughs" like hybrid/EV cars.

philwelch · 2026-01-20T17:46:15 1768931175

If there was a new taxi every other year that could handle twice as many fares, they might. That’s not how taxis work but that is how chips work.

dylan604 · 2026-01-20T17:35:29 1768930529

>offset buying a new one every one to three years?

Isn't that precisely how leasing works? Also, don't companies prefer not to own hardware for tax purposes? I've worked for several places where they leased compute equipment with upgrades coming at the end of each lease.

mikkupikku · 2026-01-20T19:03:41 1768935821

Who wants to buy GPUs that were redlined for three years in a data center? Maybe there's a market for those, but most people already seem wary of lightly used GPUs from other consumers, let alone GPUs that were burning in a crypto farm or AI data center for years.

dylan604 · 2026-01-20T21:02:38 1768942958

> Who wants to buy

who cares? that's the beauty of the lease. once it's over, the old and busted gets replaced with new and shiny. what the leasing company does is up to them. it becomes one of those YP not an MP situations with deprecated equipment.

bluGill · 2026-01-20T23:15:12 1768950912

The leasing company cares - the lease terms depend on the answer. That is why I can lease a car for 3 years for the same payment as a 6 year loan (more or less) - the lease company expects someone will want it. If there is no market for it after they will still lease it but the cost goes up

coryrc · 2026-01-20T19:14:35 1768936475

Depends on the price, of course. I'm wary of paying 50% of new for something run hard 3 years. Seems an NVIDIA H100 is going for $20k+ on EBay. I'm not taking that risk.

pixl97 · 2026-01-20T20:40:22 1768941622

Depending on the discount, a lot of people.

gowld · 2026-01-20T18:26:10 1768933570

That works either because someone wants to buy old hardware for the manufacturer/lessor, or because the hardware is EOL in 3 years but it's easier to let the lessor deal with recyling / valuable parts recovery.

wordpad · 2026-01-20T17:30:48 1768930248

If your competitor refreshes their cards and you dont, they will win on margin.

You kind of have to.

lazide · 2026-01-20T17:35:21 1768930521

Not necessarily if you count capital costs vs operating costs/margins.

Replacing cars every 3 years vs a couple % in efficiency is not an obvious trade off. Especially if you can do it in 5 years instead of 3.

zozbot234 · 2026-01-20T17:40:44 1768930844

You can sell the old, less efficient GPUs to folks who will be running them with markedly lower duty cycles (so, less emphasis on direct operational costs), e.g. for on-prem inference or even just typical workstation/consumer use. It ends up being a win-win trade.

lazide · 2026-01-20T19:27:58 1768937278

Then you’re dealing with a lot of labor to do the switches (and arrange sales of used equipment), plus capital float costs while you do it.

It can make sense at a certain scale, but it’s a non trivial amount of cost and effort for potentially marginal returns.

pixl97 · 2026-01-20T20:43:42 1768941822

Building a new data center and getting power takes years to double your capacity. Swapping out out a rack that is twice as fast takes very little time in comparison.

lazide · 2026-01-20T21:09:45 1768943385

Huh? What does your statements have to do with what I’m saying?

I’m just pointing out changing it out at 5 years is likely cheaper than at 3 years.

pixl97 · 2026-01-20T21:44:51 1768945491

Depends at the rate of growth of the hardware. If your data center is full and fully booked, and hardware is doubling in speed every year it's cheaper to switch it out every couple of years.

lazide · 2026-01-21T10:30:53 1768991453

So many goal posts being changed constantly?

iancmceachern · 2026-01-20T22:41:14 1768948874

You highlight the exact dilemma.

Company A has taxis that are 5 percent less efficient and for the reasons you stated doesn't want to upgrade.

Company B just bought new taxis, and they are undercutting company A by 5 percent while paying their drivers the same.

Company A is no longer competitive.

Dylan16807 · 2026-01-20T23:47:54 1768952874

The debt company B took on to buy those new taxis means they're no longer competitive either if they undercut by 5%.

The scenario doesn't add up.

iancmceachern · 2026-01-21T00:43:44 1768956224

But Company A also took on debt for theirs, so that's a wash. You assume only one of them has debt to service?

Dylan16807 · 2026-01-21T00:49:25 1768956565

Both companies bought a set of taxis in the past. Presumably at the same time if we want this comparison to be easy to understand.

If company A still has debt from that, company B has that much debt plus more debt from buying a new set of taxis.

Refreshing your equipment more often means that you're spending more per year on equipment. If you do it too often, then even if the new equipment is better you lose money overall.

If company B wants to undercut company A, their advantage from better equipment has to overcome the cost of switching.

iancmceachern · 2026-01-21T03:15:18 1768965318

You are assuming something again.

They both refresh their equipment at the same rate.

Dylan16807 · 2026-01-21T04:13:17 1768968797

> They both refresh their equipment at the same rate.

I wish you'd said that upfront. Especially because the comment you replied to was talking about replacing at different rates.

So your version, if company A and B are refreshing at the same rate, then that means six months before B's refresh company A had the newer taxis. You implied they were charging similar amounts at that point, so company A was making bigger profits, and had been making bigger profits for a significant time. So when company B is able to cut prices 5%, company A can survive just fine. They don't need to rush into a premature upgrade that costs a ton of money, they can upgrade on their normal schedule.

TL;DR: six months ago company B was "no longer competitive" and they survived. The companies are taking turns having the best tech. It's fine.

mbesto · 2026-01-20T18:11:44 1768932704

Not saying your wrong. A few things to consider:

(1) We simply don't know what the useful life is going to be because of how new the advancements of AI focused GPUs used for training and inference.

(2) Warranties and service. Most enterprise hardware has service contracts tied to purchases. I haven't seen anything publicly disclosed about what these contracts look like, but the speculation is that they are much more aggressive (3 years or less) than typical enterprise hardware contracts (Dell, HP, etc.). If it gets past those contracts the extended support contracts can typically get really pricey.

(3) Power efficiency. If new GPUs are more power efficient this could be huge savings on energy that could necessitate upgrades.

epolanski · 2026-01-20T18:59:30 1768935570

Nvidia is moving to a 1 year release life cycle for data center, and in Jensen's words once a new gen is released you lose money for being on the older hardware. It makes no longer financially sense to run it.

aurareturn · 2026-01-21T15:13:07 1769008387

Do you not see the bad logic?

Companies can’t buy new Nvidia GPUs because their older Nvidia GPUs are obsolete. However, the old GPUs are only obsolete if companies buy the new Nvidia GPUs.

pixl97 · 2026-01-20T20:45:44 1768941944

That will come back to bite them in the ass if money leaves the AI race.

pvab3 · 2026-01-20T23:14:15 1768950855

based on my napkin math, an H200 needs to run for 4 years straight at maximum power (10.2 kW) to consume its own price of $35k worth of energy (based on 10 cents per kWh)

swalsh · 2026-01-20T20:25:51 1768940751

If power is the bottleneck, it may make business sense to rotate to a GPU that better utilizes the same power if the newer generation gives you a significant advantage.

linuxftw · 2026-01-20T17:39:30 1768930770

I think the story is less about the GPUs themselves, and more about the interconnects for building massive GPU clusters. Nvidia just announced a massive switch for linking GPUs inside a rack. So the next couple of generations of GPU clusters will be capable of things that were previously impossible or impractical.

This doesn't mean much for inference, but for training, it is going to be huge.

legitster · 2026-01-20T17:44:38 1768931078

From an accounting standpoint, it probably makes sense to have their depreciation be 3 years. But yeah, my understanding is that either they have long service lives, or the customers sell them back to the distributor so they can buy the latest and greatest. (The distributor would sell them as refurbished)

savorypiano · 2026-01-20T17:30:20 1768930220

You aren't trying to support ad-based demand like OpenAI is.

nospice · 2026-01-20T17:19:48 1768929588

> My 30k ft view is that the stock will inevitably slide as AI datacenter spending goes down.

Their stock trajectory started with one boom (cryptocurrencies) and then seamlessly progressed to another (AI). You're basically looking at a decade of "number goes up". So yeah, it will probably come down eventually (or the inflation will catch up), but it's a poor argument for betting against them right now.

Meanwhile, the investors who were "wrong" anticipating a cryptocurrency revolution and who bought NVDA have not much to complain about today.

mysteria · 2026-01-20T17:34:59 1768930499

Personally I wonder even if the LLM hype dies down we'll get a new boom in terms of AI for robotics and the "digital twin" technology Nvidia has been hyping up to train them. That's going to need GPUs for both the ML component as well as 3D visualization. Robots haven't yet had their SD 1.1 or GPT-3 moment and we're still in the early days of Pythia, GPT-J, AI Dungeon, etc. in LLM speak.

iwontberude · 2026-01-20T19:28:05 1768937285

Exactly, they will pivot back to AR/VR

mysteria · 2026-01-20T19:57:09 1768939029

That's going to tank the stock price though as that's a much smaller market than AI, though it's not going to kill the company. Hence why I'm talking about something like robotics which has a lot of opportunity to grow and make use of all those chips and datacenters they're building.

Now there's one thing with AR/VR that might need this kind of infrastructure though and that's basically AI driven games or Holodeck like stuff. Basically have the frames be generated rather than modeled and rendered traditionally.

bigyabai · 2026-01-20T20:04:45 1768939485

Nvidia's not your average bear, they can walk and chew bubblegum at the same time. CUDA was developed off money made from GeForce products, and now RTX products are being subsidized by the money made on CUDA compute. If an enormous demand for efficient raster compute arises, Nvidia doesn't have to pivot much further than increasing their GPU supply.

Robotics is a bit of a "flying car" application that gets people to think outside the box. Right now, both Russia and Ukraine are using Nvidia hardware in drones and cruise missiles and C2 as well. The United States will join them if a peer conflict breaks out, and if push comes to shove then Europe will too. This is the kind of volatility that crazy people love to go long on.

munk-a · 2026-01-20T17:26:51 1768930011

That's the rub - it's clearly overvalued and will readjust... the question is when. If you can figure out when precisely then you've won the lottery, for everyone else it's a game of chicken where for "a while" money that you put into it will have a good return. Everyone would love if that lasted forever so there is a strong momentum preventing that market correction.

jama211 · 2026-01-20T18:06:09 1768932369

It was overvalued when crypto was happening too, but another boom took its place. Of course, lightening rarely strikes twice and all that, but it proves overvalued doesn’t mean the price is guaranteed to go down it seems. Predicting the future is hard.

pixl97 · 2026-01-20T20:48:37 1768942117

As they say, the market can remain irrational far longer than you can remain solvent.

jama211 · 2026-01-21T04:24:10 1768969450

Hah! Indeed

sidrag22 · 2026-01-20T18:49:54 1768934994

if there was anything i was going to bet against between 2019 and now, it was nvidia... and wow it feels wild how much in the opposite direction it went.

I do wonder what people would think the reasoning would be for them to increase in value this much back then, prolly would just assume crypto related still.

jama211 · 2026-01-21T04:25:09 1768969509

It’s not impossible they could’ve seen AI investment coming but it would’ve been very hard

ericmcer · 2026-01-20T18:04:20 1768932260

Crypto & AI can both be linked to part of a broader trend though, that we need processors capable of running compute on massive sets of data quickly. I don't think that will ever go down, whether some new tech emerges or we just continue shoveling LLMs into everything. Imagine the compute needed to allow every person on earth to run a couple million tokens through a model like Anthropic Opus every day.

pixl97 · 2026-01-20T20:47:25 1768942045

Agreed, single thread performance increases are dead and things are moving to massively parallel processing.

JakeSc · 2026-01-20T22:21:35 1768947695

Agree on looking at the company-behind-the-numbers. Though presumably you're aware of the Efficient Market Hypothesis. Shouldn't "slowed down datacenter growth" be baked into the stock price already?

If I'm understanding your prediction correctly, you're asserting that the market thinks datacenter spending will continue at this pace indefinitely, and you yourself uniquely believe that to be not true. Right? I wonder why the market (including hedge fund analysis _much_ more sophisticated than us) should be so misinformed.

Presumably the market knows that the whole earth can't be covered in datacenters, and thus has baked that into the price, no?

testdelacc1 · 2026-01-20T22:27:15 1768948035

I saw a $100 bill on the ground. I nearly picked it up before I stopped myself. I realised that if it was a genuine currency note, the Efficient Market would have picked it up already.

matthewdgreen · 2026-01-20T22:30:26 1768948226

The EMH does not mean that markets are free of over-investment and asset bubbles, followed by crashes.

TacticalCoder · 2026-01-20T23:39:33 1768952373

> This article goes more into the technical analysis of the stock rather than the underlying business fundamentals that would lead to a stock dump. My 30k ft view is that the stock will inevitably slide as AI

Actually "technical analysis" (TA) has a very specific meaning in trading: TA is using past prices, volume of trading and price movements to, hopefully, give probabilities about future price moves.

https://en.wikipedia.org/wiki/Technical_analysis

But TFA doesn't do that at all: it goes in detail into one pricing model formula/method for options pricing. In the typical options pricing model all you're using is current price (of the underlying, say NVDA), strike price (of the option), expiration date, current interest rate and IV (implied volatility: influenced by recent price movements but independently of any technical analysis).

Be it Black-Scholes-Merton (european-style options), Bjerksund-Stensland (american-style options), binomial as in TFA, or other open options pricing model: none of these use technical analysis.

Here's an example (for european-style options) where one can see the parameters:

https://www.mystockoptions.com/black-scholes.cfm

You can literally compute entire options chains with these parameters.

Now it's known for a fact that many professional traders firms have their own options pricing method and shall arb when they think they find incorrectly priced options. I don't know if some use actual so forms of TA that they then mix with options pricing model or not.

> My 30k ft view is that the stock will inevitably slide as AI datacenter spending goes down.

No matter if you're right or not, I'd argue you're doing what's called fundamental analysis (but I may be wrong).

P.S: I'm not debatting the merits of TA and whether it's reading into tea leaves or not. What I'm saying is that options pricing using the binomial method cannot be called "technical analysis" for TA is something else.

AnotherGoodName · 2026-01-20T17:57:41 1768931861

I'll also point out there were insane takes a few years ago before nVidia's run up based on similar technical analysis and very limited scope fundamental analysis.

Technical analysis fails completely when there's an underlying shift that moves the line. You can't look at the past and say "nvidia is clearly overvalued at $10 because it was $3 for years earlier" when they suddenly and repeatedly 10x earnings over many quarters.

I couldn't get through to the idiots on reddit.com/r/stocks about this when there was non-stop negativity on nvidia based on technical analysis and very narrow scoped fundamental analysis. They showed a 12x gain in quarterly earnings at the time but the PE (which looks on past quarters only) was 260x due to this sudden change in earnings and pretty much all of reddit couldn't get past this.

I did well on this yet there were endless posts of "Nvidia is the easiest short ever" when it was ~$40 pre-split.

KeplerBoy · 2026-01-20T16:51:35 1768927895

Also there's no way Nvidia's market share isn't shrinking. Especially in inference.

gpapilion · 2026-01-20T17:09:36 1768928976

The large api/token providers, and large consumers are all investing in their own hardware. So, they are in an interesting position where the market is growing, and NVIDIA is taking the lion's share of enterprise, but is shrinking at the hyperscaler side (google is a good example as they shift more and more compute to TPU). So, they have a shrinking market share, but its not super visible.

zozbot234 · 2026-01-20T18:04:33 1768932273

> The large api/token providers, and large consumers are all investing in their own hardware.

Which is absolutely the right move when your latest datacenter's power bill is literally measured in gigawatts. Power-efficient training/inference hardware simply does not look like a GPU at a hardware design level (though admittedly, it looks even less like an ordinary CPU), it's more like something that should run dog slow wrt. max design frequency but then more than make up for that with extreme throughput per watt/low energy expense per elementary operation.

The whole sector of "neuromorphic" hardware design has long shown the broad feasibility of this (and TPUs are already a partial step in that direction), so it looks like this should be an obvious response to current trends in power and cooling demands for big AI workloads.

dogma1138 · 2026-01-20T17:14:51 1768929291

Market share can shrink but if the TAM is growing you can still grow.

blackoil · 2026-01-20T17:05:10 1768928710

But will the whole pie grow or shrink?

baxtr · 2026-01-20T17:16:28 1768929388

I no AI fanboy at all. I think it there won’t be AGI anytime soon.

However, it’s beyond my comprehension how anyone would think that we will see a decline in demand growth for compute.

AI will conquer the world like software or the smartphone did. It’ll get implemented everywhere, more people will use it. We’re super early in the penetration so far.

Ekaros · 2026-01-20T17:22:26 1768929746

At this point computation is in essence commodity. And commodities have demand cycles. If other economic factors slowdown or companies go out of business they stop using compute or start less new products that use compute. Thus it is entirely realistic to me that demand for compute might go down. Or that we are just now over provisioning compute in short or medium term.