Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

ForgottenFlux@lemmy.world · 4 months ago

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

someguy3@lemmy.world · edit-2 4 months ago

I kinda figured. It was annoying to do one, but then they wanted you to do two or three and that’s absurd. Whenever it comes up now, I usually just close out.

dinckel@lemmy.world · 4 months ago

At a certain point I did like 10 of them, and then ended up closing the page, cause it never let me in, all because I was on a vpn

Bezier@suppo.fi · 4 months ago

they wanted you to do two or three and that’s absurd

Yea how about 20

sramder@lemmy.world · 4 months ago

I tried to order some components on Digikey a few months ago and I’m still mentally scarred. Probably did a few hundred of those things over the course of 2 weeks.

Dudewitbow@lemmy.zip · 4 months ago

if you have to do that many, you either have some privacy setting on or on a flagged ip given from a VPN

snooggums@midwest.social · 4 months ago

Or google knows you will out up with it and want the most interaction it can get from you.

crank0271@lemmy.world · 4 months ago

Google’s just lonely 🥺👉👈

iiGxC@slrpnk.net · 4 months ago

Yeah exactly

Landsharkgun@midwest.social · 4 months ago

Well yah of course I do. Why the hell is that ‘abnormal’?

catloaf@lemm.ee · 4 months ago

Most people don’t, most bots do. You look more like a bot, so you get extra challenges.

Dudewitbow@lemmy.zip · 4 months ago

its abnormal to them because vpns are often also used by bad actors. your use is not abnormal but its a there are other people misusing it making it worse for everyone else.

Landsharkgun@midwest.social · 4 months ago

Wow, way to blame individuals who take basic precautions instead of the corporations who are blantly invading your privacy. Good job making the world a better place, bud.

LucidNightmare@lemm.ee · 4 months ago

VPN? Google will just go in a loop with these things, so I just stopped using Google completely.

I Cast Fist@programming.dev · 4 months ago

Whenever I’m on a private window the captchas just keep on coming. Trying to reset your Steam password via the program will also trigger an infinite loop of captchas, you HAVE to use a browser.

Bezier@suppo.fi · edit-2 4 months ago

No. But it’s also not like I get 20 constantly, it was just the worst I’ve seen. Usually it’s 2 to 5, I think.

I assume they’re just collecting data on how many are users willing to do.

LucidNightmare@lemm.ee · 4 months ago

One time I did five in a row, because I use VPNs for everything, and realized after the 5th time that it would have been easier to just use bing so I do that first now. Google has turned into my last last resort, which is quite funny, because that’s where Bing used to be. Lmao

IphtashuFitz@lemmy.world · 4 months ago

Stop using Tor…

Kusimulkku@lemm.ee · 4 months ago

STOP BEING SNEAKY MICHAEL

Ms. ArmoredThirteen@lemmy.ml · 4 months ago

Cries in battlenet sign up process

yum@lemmy.eco.br · 4 months ago

The one reason I tried to create an account and never came back

radivojevic@discuss.online · 4 months ago

That’s because you’re shady.

SpaceMan9000@lemmy.world · 4 months ago

Had this when at uni, mostly due to the amount of requests coming from a single IP

Bezier@suppo.fi · 4 months ago

They knew I was committing crimes with my adblocker.

radivojevic@discuss.online · 4 months ago

Elon musk wants to know what the government is going to do about you not viewing ads on Xitter

kingthrillgore@lemmy.ml · 4 months ago

Not going to his shithole website.

msage@programming.dev · 4 months ago

The worst kind - crimes against profit!

CosmoNova@lemmy.world · 4 months ago

Funny thing is they stop asking if you do them really slowly. Almost as if to tell you, you‘re too inefficient to even be an unpaid intern or something. Anyway, if they annoy you, take your time.

Fisch@discuss.tchncs.de · 4 months ago

Some captchas have also just gotten obvious AI training. “Click on the living being in this image”, “Select every image of the same object as in this example image”. And the images you have to select look obviously AI generated.

cm0002@lemmy.world · 4 months ago

Heh, I got one just the other day “Select the images containing structures built by people” lmao

SkaveRat@discuss.tchncs.de · 4 months ago

“click on all people not helping with the robot uprising”

WildPalmTree@lemmy.world · 4 months ago

Alas, I have but one up-vote. :~(

unexposedhazard@discuss.tchncs.de · 4 months ago

Im surprised that this is in the news right now. This has been acknowledged as fact for a decade or so.

GhostTheToast@lemmy.world · 4 months ago

Relevant 1053

Petter1@lemm.ee · 4 months ago

I still don’t get this one even after being linked to it so many times 😌🤣

Tja@programming.dev · 4 months ago

Someday you will, and you’ll be one of the lucky 10.000 that day.

Petter1@lemm.ee · 4 months ago

😆👌🏻

Croquette@sh.itjust.works · 4 months ago

Things that are common knowledge for you is not common knowledge for everyone and vice versa.

Instead of making fun of people for not knowing things, you should take the opportunity to teach so that you can get these fun moments of discovery and learning.

Petter1@lemm.ee · 4 months ago

😮l made fun of people that did not know something?

Croquette@sh.itjust.works · 4 months ago

No, I explained what the comic is trying to convey.

Just answering your question.

Petter1@lemm.ee · 4 months ago

❤️

serenissi@lemmy.world · 4 months ago

The objective of reCAPTCHA (or any captcha) isn’t to detect bots. It is more of stopping automated requests and rate limiting. The captcha is ‘defeated’ if the time complexity to solve it, whether human or bot, is less than what expected. Now humans are very slow, hence they can’t beat them anyway.

nickwitha_k (he/him)@lemmy.sdf.org · 4 months ago

There are much better ways of rate limiting that don’t steal labor from people.

serenissi@lemmy.world · 4 months ago

hCaptcha, Microsoft CAPTCHA all do the same. Can you give example of some that can’t easily be overcome just by better compute hardware?

nickwitha_k (he/him)@lemmy.sdf.org · 4 months ago

The problem is the unethical use of software that does not do what it claims and instead uses end users for free labor. The solution is not to use it. For rate limiting a proxy/load-balancer like HAProxy will accomplish the task easily. Ex:

serenissi@lemmy.world · 4 months ago

And what will you do if a person in a CGNAT is DoSing/scraping your site while you want others to access? IP based limiting isn’t very useful, both ways.

tb_@lemmy.world · 4 months ago

I thought captcha’s worked in a way where they provided some known good examples, some known bad examples, and a few examples which aren’t certain yet. Then the model is trained depending on whether the user selects the uncertain examples.

Also it’s very evident what’s being trained. First it was obscured words for OCR, then Google Maps screenshots for detecting things, now you see them with clearly machine-generated images.

smb@lemmy.ml · 4 months ago

[…] reCAPTCHA […] isn’t to detect bots. It is more of stopping automated requests […]

which is bots. bots do automated requests and every automated request doer can also be called a bot (i.e. web crawlers are called bots too and -if kind- also respect robots.txt which has “bots” in its name for this very reason and bots is the shortcut for robots) use of different words does not change reality behind it, but may add a fact of someone trying something on the other.

serenissi@lemmy.world · 4 months ago

There isn’t a good way to classify human users with scripts without adding too much friction to normal use. Also bots are sometimes welcome amd useful, it’s a problem when someone tries to mine data in large volume or effectively DoS the server.

Forget bots, there exist centers in India and other countries where you can employ humans to do ‘automated things’ (youtube like count, watch hour for example) at the same expense of bots. There are similar CAPTCHA services too. Good luck with those :)

Only rate limiting is the effective option.

smb@lemmy.ml · 3 months ago

Only rate limiting is the effective option.

i doubt that. you could maybe ratelimit per IP and the abusers will change their IP whenever needed. if you ratelimit the whole service over all users in the world, then your service dies as quickly into uselessness as effective your ratelimiter is. if you ratelimit actions of logged in users, then your ratelimiting is limited by your ability to identify fake or duplicate accounts, where captchas are not helpful at all.

at the same expense of bots. they might be cheap, but i doubt that anyway, bots don’t need sleep.

i was answering about that wording (that captchas were “not” about bots but about “stopping automated requests”) and that automated requests “are” bots instead.

call centers are neither bots nor automated requests (the opposite IS their advantage) and thus have no relation to what i was specifically saying in reply to that post that suggested automated requests and bots would be different things in this context.

i wasn’t talking about effectiveness of captchas either or if bots should be banned or not, only about bots beeing automated requests (and vice versa) from the perspective of the platform stopping bots. and that trying to use different words for things, (claiming like “X isn’t X, it is really U!”* or automated requests aren’t bots) does not change the reality of the thing itself.

*) unrelated to any (a-)social media platform

serenissi@lemmy.world · 3 months ago

stopping automated requests

yeah my bad. I meant too many automated requests. Both humans and bot generate spams and the issue is high influx of it. Legitimate users also use bots and by no means it’s harmful. That way you do not encounter captcha everytime you visit any google page, nor a couple of scraping scripts gets a problem. Recaptcha (or hcaptcha, say) triggers when there is high volume of request coming from same ip. Instead of blocking everyone out to protect their servers, they might allow slower requests so legitimate users face mininimal hindrance.

Most google services nowadays require accounts with stronger (like cell phone) verification so automated spam isn’t a big deal.

smb@lemmy.ml · 3 months ago

since bots are better at solving captchas and humanoid services exist that solve them, the only ones negatively affected by captchas are regular legitimate users. the bad guys use bots or services and are done. regular users have to endure while no security is added, and for the influx i guess it is much more like with the better lock on the front door: if your lock is a bit better than that of your neigbhour, theirs might be force-opened more likely than yours. it might help you, but its not a real but only relative and also very subjective feeling of 'security".

beeing slower than the wolves also isn’t as bad as long as you are not the slowest in your group (some people say)… so doing a bit more than others always is a good choice (just better don’t put that bar too low like using crowdsnakeoil for anything)

serenissi@lemmy.world · 3 months ago

the bad guys use bots or services and are done. regular users have to endure while no security is added

put in other words, common users can’t easily become ‘bad guy’ ie cost of attack is higher hence lower number of script kiddies and automated attacks. You want to reduce number. These protections are nothing for bitnet owners or other high profile bad actors.

ps: recaptcha (or captcha in general) isn’t a security feature. At most it can be a safety feature.

Mubelotix@jlai.lu · 4 months ago

I bypassed 35000 google recaptcha v2 using bots. Don’t ever rely on this for security

theherk@lemmy.world · 4 months ago

It is neither intended nor even stated to be intended for security.

Gizmokid2005@lemmy.world · 4 months ago

Except, that’s most of its ad copy on Google’s own website?

reCAPTCHA uses an advanced risk analysis engine and adaptive challenges to keep malicious software from engaging in abusive activities on your website. Meanwhile, legitimate users will be able to login, make purchases, view pages, or create accounts and fake users will be blocked.

It’s literally billed as a security measure for a website.

https://www.google.com/recaptcha/about/

theherk@lemmy.world · 4 months ago

I see your perspective, but I don’t consider that security in the context of software, which may also explain why they don’t use that word, though I readily admit that it is technically security of a sort. The term usually implies authentication, authorization, and isolation.

Gizmokid2005@lemmy.world · 4 months ago

I mean, except they do. Just because their simple ad copy omits it, doesn’t mean that’s not what they’re implying. It’s literally listed as one of their security products and also uses the term to talk about demos

Security live demo

https://cloud.google.com/security/products/recaptcha

theherk@lemmy.world · 4 months ago

I’m sorry I wasn’t more agreeable. You’re absolutely correct. I take it back.

Caboose12000@lemmy.world · 4 months ago

Where can I learn this power?

Mubelotix@jlai.lu · 4 months ago

I just spent 3$ worth of bitcoin on NoCaptchaAI. I used their web extension on a server which had a browser opened and controlled by a custom webextension I made so that a solved challenge would be returned to a swarm of clients upon request

Gregor@gregtech.eu · 3 months ago

Your extension is archived, I’d rather not use it.

Mubelotix@jlai.lu · 3 months ago

It’s a custom extension solving my very specific problem on a specific internal website. It was never meant for you to use it, it’s just there to serve as inspiration to others

interdimensionalmeme@lemmy.ml · 4 months ago

When they slow fade in the picture, I add one more software engineer to my kill list.

Appoxo@lemmy.dbzer0.com · 4 months ago

In case you didnt know: This is alrrady a thing with pictures slowly fading in for selecting stuff like traffic cones or busses.

snooggums@midwest.social · 4 months ago

The conclusion can be extended that the true purpose of reCAPTCHA v2 is a free image-labeling labor and tracking cookie farm for advertising and data profit masquerading as a security service,” the paper declares.

I thought this was known since it came out. It seemed even more obvious when the images leaned in heavily to traffic related pictures like stoplights.

polonius-rex@kbin.run · 4 months ago

Google should bear the cost of detecting bots, rather than shifting it to users

how?

IphtashuFitz@lemmy.world · 4 months ago

Don’t know why you’re being downvoted… My employer sees a lot of bot activity on our sites, which are hosted in AWS and protected by Akamai. It’s Akamai that informs us when a bot visits our site, and Akamai that lets us block it. Google never sees this traffic.

radivojevic@discuss.online · 4 months ago

Yeah. Written by someone who doesn’t really understand the internet.

siph@lemmy.world · 4 months ago

Considering the article states that reCAPTCHA v2 and v3 can be broken/bypassed by bots 70-100% of the time, they are obviously not the solution.

conciselyverbose@sh.itjust.works · 4 months ago

At what cost?

100% success rate isn’t even moderately useful if it costs $5 per pass. The discussion is completely pointless without a concrete, documented analysis of the actual hardware and energy costs involved.

radivojevic@discuss.online · 4 months ago

“Google should bear the cost”

Google should shut it down and make sites roll their own verification. Give everyone a month to implement a new solution on millions of websites.

AeroLemming@lemm.ee · edit-2 2 months ago

deleted by creator

radivojevic@discuss.online · 4 months ago

I’m actually 100% for rolling your own… almost everything.

20 years ago I made an e-commerce website for a client. Looking at the code now I’m embarrassed how insecure it is. However, because it was totally custom no one ever found the bugs and it has never been cracked. (Knock on wood) that’s the benefit of not using a prebuilt solution that isn’t a target for mass exploits.

polonius-rex@kbin.run · 4 months ago

how do you get the metric of 70-100% of the time?

the best bots doing it 70-100% of the time is very different to the kind of bot your average spammer will have access to

siph@lemmy.world · 4 months ago

Did you read the article or the TL:DR in the post body?

The paper, released in November 2023, notes that even back in 2016 researchers were able to defeat reCAPTCHA v2 image challenges 70 percent of the time. The reCAPTCHA v2 checkbox challenge is even more vulnerable – the researchers claim it can be defeated 100 percent of the time.

reCAPTCHA v3 has fared no better. In 2019, researchers devised a reinforcement learning attack that breaks reCAPTCHAv3’s behavior-based challenges 97 percent of the time.

So yeah, while these are research numbers, it wouldn’t be surprising if many larger bots have access to ways around that - especially since those numbers are from 2016 and 2019 respectively. Surely it is even easier nowadays.

polonius-rex@kbin.run · 4 months ago

researchers were able to defeat reCAPTCHA v2 image challenges 70 percent of the time

that doesn’t answer the question?

researchers devised a reinforcement learning attack that breaks reCAPTCHAv3’s behavior-based challenges 97 percent of the time

i’d argue “bespoke system, deployed in a very limited context, built by researchers at the top of their field” is kind of out of reach for most people? and any bot network scaled up automatically becomes easier to detect the further you scale it

the cost of just paying humans to break these already at or below pennies per challenge

Chozo@fedia.io · 4 months ago

Then what is?

siph@lemmy.world · 4 months ago

Maybe a billion dollar company has the budget to come up with something?

Looking at the numbers in this post, reCAPTCHA exists to make Google money, not to keep bots out.

I’d rather have no reCAPTCHA than the current state.

OsrsNeedsF2P@lemmy.ml · edit-2 4 months ago

Hi it’s me. I work for a billion dollar company with a budget. We have no ethical ideas on how to stop bots. Thanks for coming to my tech talk.

siph@lemmy.world · 4 months ago

Yeah, that’s about the way I’d expect it to go.

“Traffic resulting from reCAPTCHA consumed 134 petabytes of bandwidth, which translates into about 7.5 million kWhs of energy, corresponding to 7.5 million pounds of CO2. In addition, Google has potentially profited $888 billion from cookies [created by reCAPTCHA sessions] and $8.75–32.3 billion per each sale of their total labeled data set.”

There might be a tiny chance they’re not interested in changing things.

Anti_Iridium@lemmy.world · 4 months ago

Something something free market?

wreckedcarzz@lemmy.world · 4 months ago

I thought this was old news 20 years ago?

radivojevic@discuss.online · 4 months ago

This is bullshit. Author is literally insane.

kingthrillgore@lemmy.ml · edit-2 4 months ago

Remember the good old days when it was just malformed text you have to solve? I miss those days. AI was complete garbage and they had to use farms of eyeballs to solve them for bots, making it a costly operation. We’ve now totally gotten away from all of that.

0laura@lemmy.world · 4 months ago

that was also to train ai.

dan@upvote.au · edit-2 4 months ago

No it wasn’t… It was human-assisted OCR to help digitize books. Initially for Project Gutenberg, but then for Google Books once Google acquired it in 2009.

gentooer@programming.dev · 4 months ago

OCR is a form of AI.

HiramFromTheChi@lemmy.world · 4 months ago

There’s nothing that can express my disdain for Google’s reCaptcha.

😒 We’re training its AI models 😒 It’s free labor for Google 😒 Sometimes it wants the corner of an object, sometimes it doesn’t 😒 Wildly inconsistent 😒 Always blurry and hard to see 😒 Seemingly endless 😒 It’s the robot asking us humans if we’re the robots

Petter1@lemm.ee · 4 months ago

Why is that no news to me? How did so many people not know that? Should I have spread the word more, even if all people I told that where likr “yea, yea, of course, but, what can I do? 🤷🏻‍♀️”?

MonkderVierte@lemmy.ml · 4 months ago

Does this work?

https://addons.mozilla.org/de/firefox/addon/noptcha/

I Cast Fist@programming.dev · 4 months ago

Judging from the reviews, it doesn’t

MonkderVierte@lemmy.ml · 4 months ago

Ah, right, there are reviews too.

ohmyiv@lemmy.world · 4 months ago

I tried it before. It worked for me on one small game website for account creation. After that it was more or less useless on any other site. It has a weird focus thing where it’ll try to solve the captcha before you can enter in login details so if by chance the extension works, you’ll fail the login anyways.

It still needs work. I think if the dev can work out those issues it could be great. Until then, it’s pretty much worthless.

lud@lemm.ee · 4 months ago

Alright, I don’t use google.com

Rin@lemm.ee · 4 months ago

But you might still be using their captcha

reddit_sux@lemmy.world · 4 months ago

Sites you visit use Google, their recaptcha, their analytics, their ads.

Etterra@lemmy.world · 4 months ago

We already knew that, but it’s nice re to have data.

cley_faye@lemmy.world · 4 months ago

reCAPTCHA v2 visual challenge images are all pre-labeled and user input plays no role in image labeling

That’s funny, because when I’m faced with this, I keep adding/removing one of the image randomly and it keeps accepting them as ok.

Pulptastic@midwest.social · 4 months ago

I like this strategy.

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

Google's reCAPTCHA v2 just labor exploitation, boffins say