Hacker Newsnew | past | comments | ask | show | jobs | submit | foolishbard's commentslogin

Much love to Bret. Dynamicland represents such a big shift in traditional computing.


what do you mean a big shift in traditional computing? this doesn't seem to be anything anyone is "shifting" to ?


There's a chance that these systems can actually out perform their training data and be better than the sum of their parts. New work out Harvard talks about this idea of "transcendence" https://arxiv.org/abs/2406.11741

While this is a new area, it would be naive to write this off as just science fiction.


It would be nice if authors wouldn't use a loaded-as-fuck word like "transcendence" for "the trained model can sometimes achieve better performance than all [chess] players in the dataset" because while certainly that's demonstrating an impressive internalization of the game, it's also something that many humans can also do. The machine, of course, can be scaled in breadth and performance, but... "transcendence"? Are they trying to be mis-interpreted?


It transcends the training data, I get the usage intended but it certainly is ripe for misinterpretation


The word for that is "generalizes" or "generalization" and it has existed for a very long time.


I've been very confidently informed that these AIs are not AGIs, which makes me wonder what the "General" in AGI is supposed to mean and whether generalization is actually the benchmark for advanced intelligence. If they're not AGI, then wouldn't another word for that level of generalization be more accurate than "generalization"? It doesn't have to be "transcendence" but it seems weird to have a defined step we claim we aren't at but also use the same word to describe a process we know it does. I don't get the nuance of the lingo entirely, I guess. I'm just here for the armchair philosophy


That's trivial though, conceptually. Every regression line transcends the training data. We've had that since Wisdom of Crowds.


"In chess" for AI papers == "in mice" for medical papers. Against lichess levels 1, 2, 5, which use a severely dumbed down Stockfish version.

Of course it is possible that SSI has novel, unpublished ideas.


Also it's possible that human intelligence already reached the most general degree of intelligence, since we can deal with every concept that could be generated, unless there are concepts that are uncompressible and require more memory and processing than our brains could support. In such case being "superintelligent" can be achieved by adding other computational tools. Our pocket calculators make us smarter, but there is no "higher truth" a calculator could let us reach.


Lichess 5 is better than the vast majority of chess players


I think the main point is that from a human intelligence perspective chess is easy mode. Clearly defined, etc.

Think of politics or general social interactions for actual hard mode problems.


The past decade has seen a huge number of problems widely and confidently believed to be "actual hard mode problems" turn out to be solvable by AI. This makes me skeptical that the problems today's experts think are hard aren't easily solvable too.


Hard problems are those for which the rules aren't defined, or constantly change, or don't exist at all. And no one can even agree on the goals.


I'd say anybody who is working in the field has this expectation. But the outside observer who is excited to try a new model does not expect it.


You can build apps hosted on HF which access third party APIs, e.g. OpenAI or Anthropic. The api keys for these are then stored in the HF secrets


My anthropic key was leaked and someone ran up a 10k bill on it. Are HF going to cover that?


My openAI key was leaked and I noticed someone was using it, luckily the damage wasn’t nearly as bad as you. A few dollars worth of GPT4, a model none of my apps were using at the time.

I’m almost entirely certain it was leaked via secrets on HF space, I got a message a few days ago warning me some of my spaces were affected


Are you sure it was only stored in your space secrets? Not variables (which are public) or stored in the .env file (also public).


I searched everywhere for any other leaks of it and found nothing.


i think you can ask Anthropic to provide access data (IP addresses, User Agents etc) specific to your key.

Then you can challenge hugging-face (eg paying customer) even sue them if you wish to...


I always thought you could set your "maximum limit" for spending on cloud providing platforms.


That's surprisingly not a thing in many platforms.


That $10k was probably the limit for their work, not someone else’s stolen time.


Anthropic is too new to have built that functionality I guess. Only found out because they were mad that my key was abusing their ToS and they notified the organization owner.


> Anthropic is too new to have built that functionality I guess.

That’s no sort of excuse


All3D.ai | San Francisco, CA | 75%-100% time | Machine Learning Engineer | Remote (in USA)

All3D is a 3D generative platform, built to supply 3D assets to ecommerce businesses. We've been around for four years and have reached PMF, looking to bring on additional ML talent to automate more of the workflow.

If you are familiar with the difference between NeRF and NeuS, then you'll be a great candidate. Background in diffusion models is a requirement.

Reach out to me here: kaiser at all3d dot ai


SEEKING FREELANCER

All3D | San Francisco, CA | Full-time or 75%+ | Machine Learning Engineer | Remote (in USA)

All3d is a 3D generative platform, built to supply 3D assets to ecommerce businesses. We've been around for four years and have reached PMF, looking to bring on additional ML talent to automate more of the workflow.

If you are familiar with the difference between NeRF and NeuS, then you'll be a great candidate. Background in diffusion models is a requirement.

Reach out to me here: kaiser at all3d dot ai


All3D | San Francisco, CA | Full-time or 75%+ | Machine Learning Engineer | Remote (in USA) All3d is a 3D generative platform, built to supply 3D assets to ecommerce businesses. We've been around for four years and have reached PMF, looking to bring on additional ML talent to automate more of the workflow.

If you are familiar with the difference between NeRF and NeuS, then you'll be a great candidate. Background in diffusion models is a requirement.

Reach out to me here: kaiser at all3d dot ai


I have been using sllim (note: I am the author) which allows me to easily parallelize the network requests to speed up agents.

It is only helpful with distributed chains (like tree of thoughts) and doesn't help with sequential chains.

link: https://news.ycombinator.com/item?id=36913492


Seeking Work | Remote | NLP and LLM Specialist

I am an NLP focused consultant. Previously, I've worked with small startups build value-add ML projects and design data flows which help set the foundation for future ML products.

I use the latest tools and state of the art research to take a problem from inception to production-ready code and models. My practical experience as a founder helps me connect to the startup mentality of always providing value.

Email: kaiser@pister.dev Resume: https://pister.dev/files/resume.pdf (https://pister.dev)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: