"Big data" they call it.
Technology
Which posts fit here?
Anything that is at least tangentially connected to the technology, social media platforms, informational technologies and tech policy.
Rules
1. English only
Title and associated content has to be in English.
2. Use original link
Post URL should be the original link to the article (even if paywalled) and archived copies left in the body. It allows avoiding duplicate posts when cross-posting.
3. Respectful communication
All communication has to be respectful of differing opinions, viewpoints, and experiences.
4. Inclusivity
Everyone is welcome here regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.
5. Ad hominem attacks
Any kind of personal attacks are expressly forbidden. If you can't argue your position without attacking a person's character, you already lost the argument.
6. Off-topic tangents
Stay on topic. Keep it relevant.
7. Instance rules may apply
If something is not covered by community rules, but are against lemmy.zip instance rules, they will be enforced.
Companion communities
!globalnews@lemmy.zip
!interestingshare@lemmy.zip
Icon attribution | Banner attribution
If someone is interested in moderating this community, message @brikox@lemmy.zip.
It forgot the server logs that will never be read with no proper retention strategy.
The Cloud made the crap data problem infinitely worse. The Cloud is what happens when the cost of storing data is less than the cost of figuring out what to do with the crap.
Yeah, cheaper to hold it just in case is actually a best case scenario for audit trails and the occasional look back. If 99.9999% is useless down thw road but one file answers some obscure question and it would have been more expensive to sort through it, then the cost savings and benefit was worth it financially.
And nobody in management cares because it’s so ‘cheap’ to store data. And this is what AI is being trained on. And we wonder why AI gets stuff wrong so often? Crap data in. Crap data out. And nobody cares.
Hold up. No, you don't get to blame cheap data retention for AI being shit. AI is shit becsuse they train it on this shitty data instead of curating better quality data. AI gets shit wrong because they are training it on reddit data without taking into account humor subreddits instead of educationally verified content. Libraries curate their content,AI just jams whatever they can find into their AI model.
People and companies are not responsible for AI using their shitty content and presenting it as a reliable source of information.
So are the photos on my phone. I started getting old and now I can see and can't remember so everything gets a picture! Went from big butts and boobs to small text and team viewer logins.
99.9999%