Technology

Traditional autoregressive language models generate text sequentially, one token at a time, leading to slower outputs with limited coherence and quality.

Diffusion models are an alternative approach. Instead of direct prediction, they iteratively refine noise, enabling faster generation, dynamic error correction, and greater control. This makes them particularly effective for editing tasks, including in math and code.

https://github.com/ML-GSAI/LLaDA

61

1

China's "drone + firefighting" provides a new solution for high-rise building fires (peertube.mesnumeriques.fr)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

62

1

Microshit has unleashed aGeNtiC AI on one of its public codebases (old.reddit.com)

submitted 3 weeks ago* (last edited 3 weeks ago) by ksynwa@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

Here are the PRs that are created entirely by AI:

Devs are talking to the AI trying to get it to fix some mistakes. It's so cringe.

63

1

NVIDIA reportedly plans to establish research center in Shanghai (technode.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

64

1

DDoSecrets publishes 410 GB of heap dumps, hacked from TeleMessage's archive server (micahflee.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

65

1

Klarna’s AI replaced 700 workers — Now the fintech CEO wants humans back after $40B fall (www.livemint.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

66

1

Xiaomi to unveil breakthrough 3nm chip this week (www.chinadaily.com.cn)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

67

1

AI headphones translate multiple speakers at once, cloning their voices in 3D sound (www.washington.edu)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

68

1

China's humanoid robots will not replace human workers, Beijing official says (www.reuters.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

69

1

BeiDou, China’s version of GPS, now being used over 1 trillion times per day (www.scmp.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

70

1

CCTV i now using AI for animations in their programs (www.youtube.com)

submitted 3 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

71

1

Washington May Regret Overextended AI Chip Controls (foreignpolicy.com)

submitted 4 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

72

1

OpenAlpha_Evolve is an open-source Python framework inspired by the AlphaEvolve research paper on autonomous coding agents (github.com)

submitted 4 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

the goal is to have an agent that can:

Understand a complex problem description.
Generate initial algorithmic solutions.
Rigorously test its own code.
Learn from failures and successes.
Evolve increasingly sophisticated and efficient algorithms over time.

https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/AlphaEvolve.pdf

73

1

China deploys world’s biggest fleet of driverless mining trucks (www.scmp.com)

submitted 4 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

https://www.youtube.com/watch?v=zsTNoEbOVUE

https://archive.ph/uqL6o

74

1

Chinese student Xu Yang breaks ‘impossible’ microdrone world speed record (www.scmp.com)

submitted 4 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

https://archive.ph/F6Dqh

75

1

Xiaomi releases an open-source 7B reasoning model that (huggingface.co)

submitted 4 weeks ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

0 comments fedilink

MiMo-7B, a series of reasoning-focused language models trained from scratch, demonstrating that small models can achieve exceptional mathematical and code reasoning capabilities, even outperforming larger 32B models. Key innovations include:

Pre-training optimizations: Enhanced data pipelines, multi-dimensional filtering, and a three-stage data mixture (25T tokens) with Multiple-Token Prediction for improved reasoning.
Post-training techniques: Curated 130K math/code problems with rule-based rewards, a difficulty-driven code reward for sparse tasks, and data re-sampling to stabilize RL training.
RL infrastructure: A Seamless Rollout Engine accelerates training/validation by 2.29×/1.96×, paired with robust inference support. MiMo-7B-RL matches OpenAI’s o1-mini on reasoning tasks, with all models (base, SFT, RL) open-sourced to advance the community’s development of powerful reasoning LLMs.

an in-depth discussion of mimo-7b https://www.youtube.com/watch?v=y6mSdLgJYQY