Modern NewsTopAskShowBestNew

New

Reasoning Models Fabricate 75% of Their Explanations (ArXiv:2505.05410)

by Aedelon · 2 minutes ago

1|ai.gopubby.com|0 comments

Shifting from Deterministic to Probabilistic Software – Are We Uncomfortable?

by ggonweb · 2 minutes ago

1|medium.com|0 comments

Bad Grades

by avocadosword · 3 minutes ago

1|localthunk.com|0 comments

Gamepad Controls Elden Ring Streamer IRL Using a Wild Balance Hack

by PaulHoule · 4 minutes ago

1|kotaku.com|0 comments

Show HN: Nexus – A social platform where your GitHub profile is your identity

by tita-n · 4 minutes ago

Hey HN,

I built Nexus because I kept asking why developers share their work on Twitter when GitHub already has everything that matters — contributions, repos, streaks, stack.

Nexus uses GitHub OAuth so your profile is built automatically. No bios to write, no follower games. Features so far: project showcases with repo previews, syntax-highlighted code snippets in the feed, threaded discussions, and a trending algorithm.

Just shipped the social feed (Phase 3). Very early, very few users. Looking for honest feedback from people who actually build things.

What would make you use this over just tweeting about your projects?

1|nexus-fqt4.onrender.com|0 comments

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

by xaskasdf · 5 minutes ago

Hi everyone, I'm kinda involved in some retrogaming and with some experiments I ran into the following question: "It would be possible to run transformer models bypassing the cpu/ram, connecting the gpu to the nvme?"

This is the result of that question itself and some weekend vibecoding (it has the linked library repository in the readme as well), it seems to work, even on consumer gpus, it should work better on professional ones tho

2|github.com|0 comments

Anki-CLI

by ubermenchh · 6 minutes ago

1|github.com|0 comments

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours

by doener · 7 minutes ago

1|twitter.com|0 comments

Online Pebble Development

by teekert · 12 minutes ago

2|cloudpebble.repebble.com|0 comments

Laser writing in glass for dense, fast and efficient archival data storage

by macleginn · 13 minutes ago

1|www.nature.com|0 comments

I got annoyed by Claude Code's history, so I built a search CLI

by madzarm · 13 minutes ago

2|github.com|2 comments

India Open Network for Digital Commerce

by kaladin-jasnah · 14 minutes ago

1|www.ondc.org|0 comments

Grove: Detect worktree conflicts at write time to scale parallel agents

by nathan9999 · 17 minutes ago

1|github.com|1 comments

Why optimizing AUC may be insufficient for clinical deterioration systems

by ameenfayed · 18 minutes ago

Medical ML research and competitions often optimize ROC-AUC as the primary performance metric.

However, in real hospital environments, the central question is not classification accuracy — it is escalation timing.

In deterioration detection systems: • A noisy alert creates alarm fatigue. • A late alert costs lives. • A static classifier may fail to reflect dynamic physiology.

I’ve been exploring a framework that introduces: • Dual-threshold activation (high/low) • Temporal stability validation • False-alarm suppression logic • Governed escalation timing

The aim is to shift from probability scoring toward structured decision triggering.

I’m curious how others here would approach modeling escalation timing in a clinically responsible way.

Would love perspectives from ML engineers and clinicians.

1||0 comments

Fury over Discord's age checks explodes after shady Persona test in UK

by duxup · 18 minutes ago

4|arstechnica.com|0 comments

Visualize Your Turborepo Workspace

by kuzeykose · 20 minutes ago

1|turbograph.dev|0 comments

Shai-Hulud-Style NPM Worm Hijacks CI Workflows and Poisons AI Toolchains

by jicea · 21 minutes ago

2|socket.dev|0 comments

Show HN: Formally Verified a Millennium Prize Problem in Coq Yang-Mills Mass Gap

by shariq81 · 22 minutes ago

Hi HN, I'm an independent researcher. Over the last several months, I worked alongside a neuro-symbolic AI daemon to formally verify the Clay Millennium Prize "Yang-Mills Mass Gap" problem directly in the Coq theorem prover.

We mapped the finite lattice topology entirely to the ℝ⁴ continuum by reconstructing the 5 Osterwalder-Schrader axioms, isolating the Millennium formulation into exactly 657 sequential Qed proofs.

We aggressively removed every single heuristic Admitted gap from the main topology. The entire framework now rests on exactly 4 standard textbook axioms (e.g., finite-dimensional Perron-Frobenius theorem, standard statistical mechanics).

The repository contains the raw coqc logic. The formally timestamped preprint is on Zenodo (DOI: 10.5281/zenodo.18726858).

I decided to open-source the kernel execution rather than fight arXiv gatekeepers. Happy to answer any questions about theorem proving, the physics, or the AI methodology.

1|github.com|0 comments

Trump says he'll raise tariffs to 15 percent after Supreme Court ruling

by _rend · 24 minutes ago

4|apnews.com|3 comments

Apple's Ferret AI model is a step towards Siri seeing and controlling apps

by geox · 24 minutes ago

2|appleinsider.com|0 comments