- Post in [email protected] attacks the entire concept of AI safety as a made-up boogeyman
- I disagree and am attacked from all sides for āposting like an evangelistā
- I give citations for things I thought would be obvious, such as that AI technology in general has been improving in capability compared to several years ago
- Instance ban, āpromptfondling evangelistā
This one Iām not aggrieved about as much, itās just weird. Itās reminiscent of the lemmy.ml type of echo chamber where everyoneās convinced itās one way, because in a self-fulfilling prophecy, anyone who is not convinced gets yelled at and receives a ban.
Full context: https://ponder.cat/post/1030285 (Some of my replies were after the ban because I didnāt PT Barnum carefully enough, so didnāt realize.)
Incorrect. I definitely think we are dealing with bad faith actors. I talk about that at the end of my very first message. I actually agree that the study they looked at, based on asking a chatbot things and then inferring judgements from the answers, is more or less useless. Iām just saying that doesnāt imply that the entire field of AI safety is made of bad actors.
No. I said, āAI chat bots that do bizarre and pointless things, but are clearly capable of some kind of sophistication, are exactly the warning sign that as it gains new capabilities this is a danger we need to be aware of.ā Thatās a brief explanation of my argument. People deploying AI systems which then do unexpected or unwanted things, but can get some types of tasks done effectively, and then the companies not worrying about it, is exactly the problem. I just cited someone talking at more length about it, thatās all.
Yes. Because theyāre two different things. There is real AI safety, and then there is AI safety grift. I was talking about the former, so it makes sense that it wouldnāt overlap at all with the grift.
Sure. Say you train a capable AI system to accomplish a goal. Take āmaximize profit for my companyā as an example. Then, years from now when the technology is more powerful than it is now, it might be able to pursue that goal so effectively that itās going to destroy the earth. It might decide that enslaving all of humanity, and causing them to work full-time in the mines and donate all their income to the companyās balance sheet, is the way to get that done. If you try to disable it, it might prevent you, because if itās disabled, then some other process might come in that wonāt maximize the profit.
Itās hard to realize how serious a threat that is, when I explain it briefly like that, partly because the current AI systems are so wimpy that they could never accomplish it. But, if they keep moving forward, they will at some point become capable of doing that kind of thing and fighting us effectively if we try to make them stop, and once that bridge is crossed thereās no going back. We need to have AI safety firmly in mind as we devote so much incredible resources and effort to making these things more powerful, and currently, we are not.
I think itās highly unlikely that whatever that system will be, will be an LLM. The absolutely constant confusion of āAIā with āLLMā in the people who are trying to dunk on me is probably the clearest sign, to me, that theyāre just babbling in the wilderness instead of trying to even bother to understand what Iām saying and why AI safety might be a real thing.
The only relevance the paper has is that I was challenged to show that LLMs are gaining capabilities over time. Thatās obviously true, but also, sure, itās been studied objectively. They set out a series of tasks, things like adding numbers together or basic reasoning tasks, and then measured the performance of various iterations of LLM technology over time on the tasks. Lo and behond, the newer ones can do things the old ones canāt do.
The paper isnāt itself directly relevant to the broader question, just the detail of āis AI technology getting any better.ā I do think, as I said, that the current type of LLM technology has gone about as far as itās going to go, and it will take some new type of breakthrough similar to the original LLM breakthroughs like āattentionā for the overall technology to move forward. That kind of thing happens sometimes, though.
I originally stated that I did not find your arguments convincing. I wasnāt talking about AI safety as a general concept, but the overall discussion related to the article titled (Anthropic, Apollo astounded to find a chatbot will lie to you if you tell it to lie to you).
I didnāt find you initial post (or any you posts in that thread) to be explicit in the recognition in the potential for bad faith actions from the likes of Anthropic, Apollo. On the contrary, you largely deny the concept of ācriti-hypeā. One can, in good faith, interpret this as de facto corporate PR promotion (whether that was the intentional or not).
You didnāt mention the hypothetical profit maximization example in the thread and your phrasing implied a current tool/service/framework, not a hypothetical.
I donāt see how the YT video or the article summary (I did not read the paper) is honestly relevant to what was being discussed.
I am honestly trying to not take sides (but perhaps I am failing in this?), more like suggesting that how people interpret āgroupthinkā can take many forms and that ācounter-contrarianā arguments in of themselves are not some of magical silver bullet.
Okay, cool. I was. That was my whole point, that even if some is grift, AI safety itself is a real and important thing, and thatās an important thing to keep in mind.
I think Iāve explained myself enough at this point. If you donāt know that the paperclips reference from the linked article is indicative of the exact profit maximization situation that I explained in more detail for you when you asked, or you canāt see how the paper I linked might be a reasonable response if someone complains that I havenāt given proof that AI technology has ever gained abilities over time, then I think Iāll leave you with those conclusions, if those are the conclusions youāve reached.