Judge orders Anna’s Archive to delete scraped data; no one thinks it will comply

schizoidman@lemmy.zip · 19 hours ago

Judge orders Anna’s Archive to delete scraped data; no one thinks it will comply

fyrilsol@kbin.melroy.org · 2 hours ago

And they shouldn’t.

I’m sorry but it is completely fair game now for piracy to be as it is.

You’re illegally collecting data on us in every tech we have? Fine, but we’re pirating your shit and you’re going to like it.

ByteOnBikes@discuss.online · 1 hour ago

The court order, which was previously reported by TorrentFreak, was issued by Judge Michael Watson in US District Court for the Southern District of Ohio. “Plaintiff has established that Defendant crashed its website, slowed it, and damaged the servers, and Defendant admitted to the same by way of default,” the ruling said.

US court. Where is Anna’s Archive located?

Because with the state of America right now, every country should be giving the US Courts the finger. We’re pretty much a rogue nation now.

Ugurcan@lemmy.world · 5 hours ago

Anna’s Archive’s data is on IPFS, so effectively it’s pretty hard to remove the data from the nets. Good luck to the establishment though!

MonkderVierte@lemmy.zip · 8 hours ago

Eh, their crawler is now “AI”, so it’s fine.

aeronmelon@lemmy.world · 19 hours ago

Hey judge, order AI companies to delete THEIR illegally-scraped data.

db2@lemmy.world · 19 hours ago

Couldn’t they make an argument with that pointing out that they’re being unjustly targeted because they’re smaller and easier to pick on?

SolacefromSilence@fedia.io · 18 hours ago

No one cares if they’re small or unjustly picked on. If they want to beat the charges, they need to announce their own AI trained on the data.

tempest@lemmy.ca · 15 hours ago

It would make me laugh if they could train an LLM that could only regurgitate content verbatim

Dran@lemmy.world · 8 hours ago

https://en.wikipedia.org/wiki/Markov_chain

Before the advent of AI, I wrote a slack bot called slackbutt that made Markov chains of random lengths between 2 and 4 out of the chat history of the channel. It was surprisingly coherent. Making an “llm” like that would be trivial.

[object Object]@lemmy.world · 7 hours ago

Reddit has at least one sub where the posts and the comments are generated by Markov-chain bots. More than a few times I’ve gotten a post from there in my feed, and read through it confusedly for several minutes before realizing. Iirc it’s called subreddit_simulator.

Meron35@lemmy.world · 6 hours ago

The original subreddit simulator ran on simple Markov chains.

Subreddit simulator GPT2 used GPT2, and was already so spookily accurate that IIRC its creators specifically said they wouldn’t create one based on GPT3 out of fear that people wouldn’t be able to tell the difference between real and not generated content

Natanael@infosec.pub · 8 hours ago

It’s actually kinda easy. Neural networks are just weirder than usual logic gate circuits. You can program them just the same and insert explicit controlled logic and deterministic behavior. To somebody who don’t know the details of LLM training, they wouldn’t be able to tell much of a difference. It will be packaged as a bundle of node weights and work with the same interfaces and all.

The reason that doesn’t work well if you try to insert strict logic into a traditional LLM despite the node properties being well known is because of how intricately interwoven and mutually dependent all the different parts of the network is (that’s why it’s a LARGE language model). You can’t just arbitrarily edit anything or insert more nodes or replace logic, you don’t know what you might break. It’s easier to place inserted logic outside of the LLM network and train the model to interact with it (“tool use”).

ilinamorato@lemmy.world · 15 hours ago

Well, it’s not an LLM, but “AI” doesn’t have a defined meaning, so from that perspective they kind of already did.

panda_abyss@lemmy.ca · 17 hours ago

We’ve created a state of the art model in recall and training data idempotency!

UnderpantsWeevil@lemmy.world · 18 hours ago

If they want to beat the charges, they need to announce ~~their own AI trained on the data~~ several billion in Series A investment funding.

db2@lemmy.world · 18 hours ago

I hope they call it FUAI.

evol@lemmy.today · 18 hours ago

rebrand into Anna AI !

Em Adespoton@lemmy.ca · 19 hours ago

Defendant crashed its website, slowed it, and damaged the servers, and Defendant admitted to the same by way of default,” the ruling said.

OK, so if I set up a lawsuit against OCLC in my country where they don’t reside, and they fail to show up to contest the charges, I get to claim they admitted guilt by default?

Also, since the claim is they used bots that behaved like legitimate search engine bots, are they also suing Google?

I can see why they might not want AA putting undue stress on their servers, but that doesn’t seem to be what they’re suing over.

FauxLiving@lemmy.world · 14 hours ago

OK, so if I set up a lawsuit against OCLC in my country where they don’t reside, and they fail to show up to contest the charges, I get to claim they admitted guilt by default?

Assuming your country’s laws are roughly based on British common law, yes.

Winning a case is easy. How you enforce the judgement is much harder.

This is why the speculation is that they will not comply. If the servers are not in reach of the US, the owners are not in a country that will extradite them, they don’t store money in US banks and the US doesn’t stupidly commit war crimes in order to capture them… then ignoring the court order is about as hard as you ignoring North Korean law.

elmicha@feddit.org · 13 hours ago

But the US can impose a few inconviniences on you:

“You lose immediate access to all the main credit cards that go through the Swift system, which is controlled by the U.S.,” she said. “My Amazon and Google accounts were closed. You cannot pay for your utilities, your subscriptions. You’re completely crippled when it comes to booking hotels, trains, flights. You can’t buy dollars because your name is flagged.”.

[object Object]@lemmy.world · 7 hours ago

Operators of pirate sites don’t tend to publicize their identities.

fort_burp@feddit.nl · 5 hours ago

Yea, the defendants are

Anna’s Archive, f/k/a

Pirate Library Mirror, et al.,

edit: but Alexandra Elbakyan does sci-hub under her own name and just ignores US courts, since Kazakhstan doesn’t have an extradition treaty with the USA.

frongt@lemmy.zip · 4 hours ago

If the US really wants to, they’ll just come kidnap you.

infeeeee@lemmy.zip · 11 hours ago

Same link without google tracking: https://m.economictimes.com/news/international/us/google-gmail-amazon-accounts-closed-cant-book-hotels-trains-flights-cant-buy-dollars-how-donald-trumps-sanctions-have-made-lives-of-international-criminal-court-judges-hell/articleshow/126454613.cms

FauxLiving@lemmy.world · 13 hours ago

If the President chooses to sanction them, yes. A judge cannot impose that kind of sanction because they have limited jurisdiction.

The court can order you to pay, and it can order US banks to seize your money. But a US court cannot order France to seize your money.

Basically, judgements are a judicial function with a more limited scope and sanctions are akin to foreign policy but can extend as far as we are able to force/negotiate.