It’s also rolling out a filter on sexual content aimed at blocking material related to gender and trans issues, optional but turned on by default across all plans.
The trouble is, many websites don’t fit neatly into one category. That leaves its maverick founder with broad, subjective control over what is allowed or banned. Read the full story.
—James O'Donnell
This startup’s new mechanistic interpretability tool lets you debug LLMs
The San Francisco–based startup Goodfire has released a new tool, Silico, that lets researchers peer inside an AI model and adjust its parameters during training. It could give users more control over how this technology is built than was once thought possible.
The goal is to make building AI models less like alchemy and more like a science. Using a technique called mechanistic interpretability, Silico maps the neurons and pathways inside a model and lets developers tweak them to reduce unwanted behaviors or steer outputs.
By exposing the “knobs and dials,” Goodfire hopes to bring AI training closer to traditional software engineering. Read the full story.
—Will Douglas Heaven
With mass firing, Trump deals a fresh blow to American science
This past week delivered another gut punch for science in the US. This time, the target was the National Science Foundation—a federal agency that funds major research projects to the tune of around $9 billion. On Friday, the 22 scientists overseeing those efforts were all fired.