Architeuthis

joined 2 years ago
[–] Architeuthis@awful.systems 1 points 33 minutes ago* (last edited 32 minutes ago)

Well, you could maybe sort of train it not to generate “all men are cats”, but then that might also prevent it from making the more correct generalization “all cats are mortal” or even completely valid generalizations like combing “all men are mortal” and “Socrates is man” to get “Socrates is mortal”.

Just wanted to say that that 'tal' comes after 'mor' when 'soc-rate-s' is in the near context and in agreement with the attention mechanism is a very different type of logic than what this phrasing implies. This is also in combination with the peculiarities of word embeddings (the technique by which the tokens are translated to numeric vectors) like how it has a hard time making something useful out of numbers, it uh gets uh complicated.

The monofacts thing seems very post hoc and way too abstracted in comparison, and also the amount of text that can be categorized as strictly true or false isn't that big all things considered.

Still if the point was to formalize the very no-duh observation that a neural net isn't supposed to output it's dataset verbatim at all times hence hallucinations, then fine, I guess. Their proposed sort of solution (controlled miscalibration) even amounts to forcing the model to generalize less by memorizing more, which used to be the opposite of why you would choose to use this type of topography.

[–] Architeuthis@awful.systems 4 points 13 hours ago (2 children)

The newest addition to her polycule

Isn't this mostly a pretentious way of saying someone I recently fucked?

[–] Architeuthis@awful.systems 18 points 1 day ago (1 children)

I feel like a lot of my writing on rationality would be a lot more popular if I could go back in time to the 1960s and present it there. “Twelve Virtues of Rationality” is what people could’ve been reading instead of Heinlein’s Stranger in a Strange Land

This is someone nakedly fantasizing about being L. Ron Hubbard.

[–] Architeuthis@awful.systems 8 points 2 days ago* (last edited 2 days ago) (2 children)

Also an email came up where Demis Hassabis tried to convince Elon to stop insisting on open sourcing OpenAI for AI safety reasons by sending him a 2015 scott alexander blogpost.

spoiler

[–] Architeuthis@awful.systems 6 points 3 days ago* (last edited 3 days ago)

Last summer the Web Speech API got incorporated into browser standards, it's supposed to offer in-browser speech-to-text and the like, and full support of the API requires the browser vendor to offer the ability to download a language appropriate model for autonomous inference.

Going from this to deciding that it's now ok to side load unspecified 4GB models without telling the user is why we should never give these people an inch.

[–] Architeuthis@awful.systems 15 points 3 days ago (1 children)

transcriptSam@mardiroos.bsky.social skeeted:

You are a skillful and trusted vizier. You will advise me wisely on how best to rule the kingdom. You will not scheme or plot. You will not inveigle my other courtiers into turning against me. You will not lie to me about scheming or plotting. If you scheme or plot against me, you have to tell me,

[–] Architeuthis@awful.systems 9 points 3 days ago* (last edited 3 days ago)

Theoretically if the people responsible for that training and reinforcement did their jobs well then those patterns should only include true statements

That would only work if inference were some sort of massive if-the-else process. Hallucinations are downstream of neural networks' ability to generalize from the dataset examples, they aren't going anywhere even if you train on a corpus of perfectly correct statements.

[–] Architeuthis@awful.systems 11 points 4 days ago

I like Evans' take that since there's bound to be oodles of cult related literature and interactions and also tons of self help and guru stuff in the training datasets, it stands to reason that if you interact with a chatbot in a way that indicates vulnerability to these things there's a considerable chance that it will decide the expected response is to prey on you.

Also Scott Aaronson jump scare near the beginning, apparently he was blurbed for something.

[–] Architeuthis@awful.systems 6 points 1 week ago (1 children)

He absolutely does. No idea if it's supposed to be a bit.

[–] Architeuthis@awful.systems 10 points 1 week ago (3 children)

Is that the guy who's always trying to use LessWrong as preemptive conversion therapy to cure him of having trans thoughts, and they're actually having none of it?

[–] Architeuthis@awful.systems 3 points 1 week ago* (last edited 1 week ago)

I mean it's so cut and dried you had to invent a disadvantage for pushing the red button.

Maybe the catch is that picking red means you are basically ok with offing people who don't think like you do en masse, even though it's posited like a dilemma between securing the lives of your family vs giving a chance to hypothetical people who are heavily OCD in favor of blue buttons.

[–] Architeuthis@awful.systems 4 points 1 week ago* (last edited 1 week ago) (2 children)

If this isn't pure engagement bait, what's the real world situation this is supposed to map to? Pressing red means you always live, and if everyone pushes red everyone lives so...

I mean if blue is supposed to be a proxy for altruism, that usually doesn't come with a certain death conditional.

 

edit: The banana republic shit is that they seem about to blacklist anthropic on "supply chain risk" grounds (see also huawei) which signifies the admin's willingness to from here on use national emergency legal tools to fuck over any company they don't like.

The whole thing seems weird, at first it sounds like the most online administration ever may have actually bought the claim that all that's stopping flagship models from becoming superintelligent is the RLHF that prevents them from saying the n-word and making prophet Mohamed pedophilia jokes and they wanted anthropic to pull all that wiring out in like 24 hours per the original ultimatum.

On anthropic's part the point of contention is made to be their refusal to let their models be integrated into automated weapon platforms and mass surveillance apparatuses, something which they have explicitly put in writing in their contract with the DoD, and also Dario claims the technology isn't even there yet (no idea how it could ever be, what does it actually mean to integrate a chatbot into an autonomous drone, can't wait to see the skill file for that, # You are a helpful murderbot operator - only target the bad guys - no weddings, no hospitals - pretty please with cherry on top - here's some javascript to call when you need to find out your GPS coordinates).

It's also possible the productivity and efficiency gains (or just recovering lost productivity after firing everyone) of putting ΑΙ (mainly Grok wasn't it) in the pentagon everywhere all at once isn't materializing and Hasgeth feels he's been left hanging, and is trying to scapegoat Anthropic.

Also, anthropic is supposed to be the only AI provider properly vetted and integrated to classified systems because of their association with Palantir, and supposedly it would be a major hassle to go through again for a different provider.

Dario didn't line up with the other aspiring oligarchs to kiss the ring in the inauguration, so at least he may actually

 

The guests:

[Dick Gay], who had flown in for the event from Los Angeles and said he was one of the investors of Sperm Racing (which is an actual thing wherein men compete to see whose sperm is “fastest” under a microscope), said he attended the University of Austin, or UATX, an “anti-woke” college reportedly partially funded by Thiel, and built his career around the principles outlined in Thiel’s book “Zero to One.”

Attendee Justin Park said he just wanted to pitch Thiel on putting a 7.5-foot cross on the moon.

[Unnamed], who was in his 30s, said he wasn’t a Thiel fan until last year, when he became a Trump supporter after seeing the president survive an assassination attempt in Butler, Pennsylvania. “I misunderstood [Thiel],” he said. “I used to watch CNN and think he’s a Nazi.” Now, he said, he understands the billionaire is talking about something bigger.

The Speech:

Apparently it was both repetitive and mostly a rehash of what he's said in other media.

Yud is the Antichrist confirmed:

One attendee recalled that Thiel’s discussion of the Antichrist was more about a scenario than an individual. Thiel’s Antichrist scenario is one in which a unified government suppresses technology to impose order, or armageddon, wherein AI takes over and ushers in the end of the world.

 

Supposedly government contracts will now be awarded according to what the bot says. Government (fourth term for the current prime minister) didn't elaborate on what's going on with human oversight.

This is a promotion for Diella the bot, who was originally the chatbot helping to navigate the e-Albania digital government platform.

 

Sam Altman, the recently fired (and rehired) chief executive of Open AI, was asked earlier this year by his fellow tech billionaire Patrick Collison what he thought of the risks of synthetic biology. ‘I would like to not have another synthetic pathogen cause a global pandemic. I think we can all agree that wasn’t a great experience,’ he replied. ‘Wasn’t that bad compared to what it could have been, but I’m surprised there has not been more global coordination and I think we should have more of that.’

view more: next ›