BioMan

joined 2 years ago
[–] BioMan@awful.systems 7 points 1 week ago* (last edited 1 week ago)

I just ran a test and after a few google-gemini-assisted searches about satellite launch rates and a few back and forth questions about orbital debris and relative masses of different spacecraft to prime the system, I was able to evoke from gemini a massive unhinged rant about orbital goblins and their culture and effects on the global climate with a single instance of the word "Goblins!" at the end of a prompt.

[–] BioMan@awful.systems 7 points 1 week ago

This would actually be an interesting question for the more rigorous end of the mechanistic interpretability people to study. They decompose the system to find 'features' within different layers that are associated with different behaviors or concepts in the inputs and outputs, that activate or deactivate each other. Famous example being the time they identified a linear combination of activations in a layer that corresponded to 'the golden gate bridge' and when they reached in and kept their numbers high during the running of the model it would not stop talking about it regardless of the topic, even while acknowledging that its answers were incorrect for the questions at hand.

I actually would love to see what mechanistically happens to that feature when you put in the input 'do not talk about the golden gate bridge'.

[–] BioMan@awful.systems 3 points 1 week ago (1 children)

Checks out. Political science, biological science, physics... we got them all. Might have to go to ancient egypt to get hydrology religion though.

[–] BioMan@awful.systems 6 points 1 week ago (1 children)

So we are inferring that in the vector space of all possible sentences, QNTM is sitting at one of the attractors?

[–] BioMan@awful.systems 6 points 1 week ago

It's absolutely crazy, but I think Yud is the less unhinged person here

[–] BioMan@awful.systems 9 points 3 weeks ago

Not really part of the back and forth but I find this illuminating of their recent travails, regarding it not being a step to far to prevent them from posting:

"This isn’t super relevant since it’s not like the standards are super high but ever since the enormous onslaught of LLM psychosis posters, the default of people who try to post to LW is to get rejected from posting here"

Sounds like the mods have had to deal with a lot of unbalanced people lately, and are not having it.

[–] BioMan@awful.systems 8 points 3 weeks ago (2 children)

The link to the guide to setting up a retrofitted boxtruck to continue AI alignment research in with local copies of the internet archive after civilization collapses in 2025 is fun

[–] BioMan@awful.systems 3 points 3 weeks ago* (last edited 3 weeks ago) (1 children)

“So how does the Epstein drive work? Very well."

― James S.A. Corey, Leviathan Wakes

[–] BioMan@awful.systems 7 points 3 weeks ago (1 children)

Is the crappy dragon fursona related to Peter Thiel being an anagram for "the reptile"?

[–] BioMan@awful.systems 11 points 3 weeks ago* (last edited 3 weeks ago) (2 children)

I'm a huge fan of Greg Egan's fiction and a huge fan of him pissing off the rats. He's been explicitly needling them and making fun of them in his fiction for over a decade. Making calm contradictions against them for over two decades, after noticing weirdos being fans of his.

[–] BioMan@awful.systems 15 points 3 weeks ago* (last edited 3 weeks ago) (24 children)

Friend of Ziz and cofounder of the 'rationalist fleet' pops up out of the woodwork trying to clear Ziz's name

https://www.lesswrong.com/posts/mbrmZmzBdtn4qrSus/re-introduction-of-a-rationalist-dragon-and-clarifications

I find myself noticing things rather detached from the typical Ziz funnybusiness more strongly than I notice the stuff about that whole situation.

"I'm Gwen Danielson, a neuroscientist and bioengineer, who decided as a child that I would end Death (and bring people back if I could) and that I would become a dragon and help generally facilitate a fantastical transhumanist future."

"I dream of non-Euclidean geometries, of countless worlds visible and accessible in the daytime sky, of competent infrastructure, of soul forges continually working to bring back the dead... I dream of reaching through warps in the spacetime fabric to save the dying across time"

"Signed, the dragon of creation Creatrei (cree-AH-trey) also known as Gwen Danielson or as Char and Astria (when referring to my hemis as distinct individuals)"


The reactions are fun. "This post is not actually doing a good job of making me trust you and think this conversation is safe to have[1], and I notice that as I am saying this that I am afraid that this will now somehow result in someone trying to murder me in my sleep"

view more: next ›