this post was submitted on 26 Aug 2025
122 points (98.4% liked)
Technology
74585 readers
3655 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I know someone who was a translator between two (less widely spoken) languages, and some specifics I recall from our conversations about work:
None of those would be addressed with LLMs. Small training set for language (and language being similar to a few others) is an issue. Anything technical or non-existing would be prone to hallucinations. And tone is difficult enough to convey through text to begin with, let alone with LLM translation.
LLM gets 95% of the translation done, but the 5% is likely every important and it takes longer to confirm it's correct than to do it from scratch anyway
How good is LLM training data for a language spoken by less than 10 million people? Keep in mind that most of those people are probably multilingual (i.e. categorizing which language is which by person is harder), and language itself is similar to its neighbors. And then, again, terms.
Unfortunately it doesn’t have to be better than the worker, we all know this sucks at most of the things it’s being touted as great at.
It just has to convince management who make decisions that it’ll save money (or that they can spin it that way) for the next quarter. That alone is enough to destroy people’s lives.
I wonder what % of all translations are things like patents, legal paper and movies and what are simple localizations. Even in the more complex cases you can pass the entire text through AI first and then just proof read it and correct the errors.
That proofreading is as hard as with code. Defeats the purpose.