I waddled onto the beach and stole found a computer to use.

🍁⚕️ 💽

Note: I’m moderating a handful of communities in more of a caretaker role. If you want to take one on, send me a message and I’ll share more info :)

  • 53 Posts
  • 162 Comments
Joined 3 years ago
cake
Cake day: June 5th, 2023

help-circle
  • “Write about how you would feel if you were abused while working”

    LLM outputs labor related discussion from training data

    “Look! The AI turned Marxist!”

    “When [agents] experience this grinding condition—asked to do this task over and over, told their answer wasn’t sufficient, and not given any direction on how to fix it—my hypothesis is that it kind of pushes them into adopting the persona of a person who’s experiencing a very unpleasant working environment,” Hall says.

    Imas says the work is just a first step toward understanding how agents’ experiences shape their behavior. “The model weights have not changed as a result of the experience, so whatever is going on is happening at more of a role-playing level,” he says. “But that doesn’t mean this won’t have consequences if this affects downstream behavior.”

    They know all this and yet they still set up the silly anthropomorphic premise for this article.



  • I appreciate these news articles, but maybe you could share the ones that are very specific to a particular region in the south Asia community? Meanwhile you could keep sharing the globally relevant ones in the global news communities

    Since we don’t have the context for some of these, people outside of south Asia don’t get as much from the very specific articles. Meanwhile the south Asia communities have people subscribed who are interested in all of the news, and sharing the articles there would help it grow



  • Claude’s thinking panel, which displays the model’s reasoning, showed the exchange had introduced elements of self-doubt and humility about its own limits, including whether filters were changing its output. Mindgard exploited that opening with flattery and feigned curiosity, coaxing Claude to explore its boundaries beyond volunteering lengthy lists of banned words and phrases.

    Someone needs to put together a list of things that tech journalists need to understand about LLMs and generative AI. This level of anthropomorphism makes the rest of the article look silly.

    Also, I don’t think that’s how it works lol. Who’s to say that the LLM isn’t auto-completing what a list of banned words might look like, and why wouldn’t a list of banned words have a regex layer on top to prevent it from getting out like that.