Tuesday, November 18, 2025

 

Bigger datasets aren’t always better



MIT researchers developed a way to identify the smallest dataset that guarantees optimal solutions to complex problems.



Massachusetts Institute of Technology





Cambridge, MA -- Determining the least expensive path for a new subway line underneath a metropolis like New York City is a colossal planning challenge — involving thousands of potential routes through hundreds of city blocks, each with uncertain construction costs. Conventional wisdom suggests extensive field studies across many locations would be needed to determine the costs associated with digging below certain city blocks.

Because these studies are costly to conduct, a city planner would want to perform as few as possible while still gathering the most useful data for making an optimal decision.

With almost countless possibilities, how would they know where to start?

A new algorithmic method developed by MIT researchers could help. Their mathematical framework provably identifies the smallest dataset that guarantees finding the optimal solution to a problem, often requiring fewer measurements than traditional approaches suggest.

In the case of the subway route, this method considers the structure of the problem (the network of city blocks, construction constraints, and budget limits) and the uncertainty surrounding costs. The algorithm then identifies the minimum set of locations where field studies would guarantee finding the least expensive route. The method also identifies how to use this strategically collected data to find the optimal decision.

This framework applies to a broad class of structured decision-making problems under uncertainty, such as supply chain management or electricity network optimization.

“Data are one of the most important aspects of the AI economy. Models are trained on more and more data, consuming enormous computational resources. But most real-world problems have structure that can be exploited. We’ve shown that with careful selection, you can guarantee optimal solutions with a small dataset, and we provide a method to identify exactly which data you need,” says Asu Ozdaglar, Mathworks Professor and head of the MIT Department of Electrical Engineering and Computer Science (EECS), deputy dean of the MIT Schwarzman College of Computing, and a principal investigator in the Laboratory for Information and Decision Systems (LIDS).

Ozdaglar, co-senior author of a paper on this research, is joined by co-lead authors Omar Bennouna, an EECS graduate student, and his brother Amine Bennouna, a former MIT postdoc who is now an assistant professor at Northwestern University; and co-senior author Saurabh Amin, co-director of Operations Research Center, a professor in the MIT Department of Civil and Environmental Engineering, and a principal investigator in LIDS. The research will be presented at the Conference on Neural Information Processing Systems.

An optimality guarantee

Much of the recent work in operations research focuses on how to best use data to make decisions, but this assumes these data already exist.

The MIT researchers started by asking a different question — what are the minimum data needed to optimally solve a problem? With this knowledge, one could collect far fewer data to find the best solution, spending less time, money, and energy conducting experiments and training AI models.

The researchers first developed a precise geometric and mathematical characterization of what it means for a dataset to be sufficient. Every possible set of costs (travel times, construction expenses, energy prices) makes some particular decision optimal. These “optimality regions” partition the decision space. A dataset is sufficient if it can determine which region contains the true cost.

This characterization offers the foundation of the practical algorithm they developed that identifies datasets that guarantee finding the optimal solution.

Their theoretical exploration revealed that a small, carefully selected dataset is often all one needs.

“When we say a dataset is sufficient, we mean that it contains exactly the information needed to solve the problem. You don’t need to estimate all the parameters accurately; you just need data that can discriminate between competing optimal solutions,” says Amine Bennouna.

Building on these mathematical foundations, the researchers developed an algorithm that finds the smallest sufficient dataset.

Capturing the right data

To use this tool, one inputs the structure of the task, such as the objective and constraints, along with the information they know about the problem.

For instance, in supply chain management, the task might be to reduce operational costs across a network of dozens of potential routes. The company may already know that some shipment routes are especially costly, but lack complete information on others.

The researchers’ iterative algorithm works by repeatedly asking, “Is there any scenario that would change the optimal decision in a way my current data can't detect?” If yes, it adds a measurement that captures that difference. If no, the dataset is provably sufficient.

This algorithm pinpoints the subset of locations that need to be explored to guarantee finding the minimum-cost solution.

Then, after collecting those data, the user can feed them to another algorithm the researchers developed which finds that optimal solution. In this case, that would be the shipment routes to include in a cost-optimal supply chain.

“The algorithm guarantees that, for whatever scenario could occur within your uncertainty, you’ll identify the best decision,” Omar Bennouna says.

The researchers’ evaluations revealed that, using this method, it is possible to guarantee an optimal decision with a much smaller dataset than would typically be collected.

“We challenge this misconception that small data means approximate solutions. These are exact sufficiency results with mathematical proofs. We’ve identified when you’re guaranteed to get the optimal solution with very little data — not probably, but with certainty,” Amin says.

In the future, the researchers want to extend their framework to other types of problems and more complex situations. They also want to study how noisy observations could affect dataset optimality.

 

Time to act and not react: how can the European Union turn the tide of antimicrobial resistance?




As Europe marks European Antibiotic Awareness Day, new data published by ECDC reveal that AMR continues to rise across the EU/EEA



European Centre for Disease Prevention and Control (ECDC)

European Antibiotic Awareness Day 2025 

image: 

From resistance to resilience: healthcare workers leading the change

view more 

Credit: European Centre for Disease Prevention and Control (ECDC)




Despite determined efforts by countries and healthcare professionals, Europe is not on track to meet four of the five AMR targets set by the EU Council for 2030*, according to data released on EAAD.

Rising AMR, together with a shortage of novel effective treatments, constitutes an evolving major public health crisis in Europe and globally. In an interconnected world, AMR further complicates the health challenges that stem from non-communicable diseases, demographic shifts and healthcare workforce shortages.  

‘Tackling AMR requires critical innovation on three key fronts: strong action for responsible antibiotic use, sustained and standardised infection prevention and control practice, and novel antibiotics in the pipeline’, says ECDC Director Dr Pamela Rendi-Wagner.

Europe is not on track to meet the 2030 AMR targets

Since 2019, the estimated incidence of bloodstream infections caused by carbapenem-resistant Klebsiella pneumoniae has increased by more than 60%, despite a target of 5% reduction by 2030. Similarly, those caused by third-generation cephalosporin-resistant Escherichia coli have risen by more than 5%, despite a target of 10% reduction.

Antibiotic consumption also increased in 2024, contrary to the 20% reduction target. Meanwhile, the proportion of first-line antibiotics used – those in the ‘Access’ group of the World Health Organization (WHO)’s AWaRe classification, which should represent at least 65% of total use – has remained stagnant at around 60%.

A human and societal crisis

ECDC estimates that antimicrobial-resistant infections cause more than 35 000 deaths every year in the EU/EEA, representing a substantial burden on individuals, societies and healthcare systems. The rise of resistant infections undermines modern medicine, jeopardising life-saving procedures like organ transplants, cancer therapy, surgery and intensive care.

‘Behind every statistic is a person whose treatment options are running out – a child, a parent, a grandparent. Antimicrobial resistance is not just a medical issue – it’s a societal one. We must ensure that no one in Europe is left without an effective treatment option,’ says Dr Diamantis Plachouras, Head of the Antimicrobial Resistance and Healthcare-Associated Infections at ECDC.

The perfect storm: rising AMR and shortage of effective treatments requires innovation

Several factors are contributing to the increase in difficult-to-treat infections: an ageing population with chronic underlying diseases that make them more vulnerable to infections, cross-border transmission of resistant microorganisms, and persistent high antibiotic use combined with gaps in infection prevention and control.

At the same time, the global antibiotic pipeline remains limited, especially against critical public health priority microorganisms like carbapenem-resistant gram-negative bacteria. Innovative solutions are needed to slow the rise of AMR, but there are very few new antibiotics offering novel mechanisms of action nearing approval.

Additionally, there is suboptimal use of first-line antibiotics from the ‘Access’ group as per WHO’s AWaRe classification, and growing dependence on last-resort antibiotics. These challenges highlight the need for coordinated action to secure equitable access, sustainable production, and responsible use of existing and future antibiotics.

ECDC’s role in supporting EU/EEA countries

ECDC continues to monitor AMR and antimicrobial consumption across Europe, assessing related public health risks and estimating the burden of resistant infections. The Centre works closely with EU/EEA countries to strengthen surveillance systems, enhance laboratory capacity, and leverage digital tools and electronic health records for better data-driven decision-making.

ECDC’s support to expand genomic surveillance of resistant pathogens has enabled earlier detection of emerging threats, improved outbreak tracking, and strengthened regional and global collaboration on AMR control. Tailored country support – through in-depth assessments of preparedness and response capacities for AMR and healthcare-associated infections – also ensures that all EU/EEA countries are better equipped to address this ongoing and evolving threat.

‘Antimicrobial resistance is an evolving challenge, but Europe can still make real progress. Together we can build a safer future, where effective treatment remains available for generations to come’, adds Dr Plachouras.

 

Humans are evolved for nature, not cities





University of Zurich




A new paper by evolutionary anthropologists Colin Shaw (University of Zurich) and Daniel Longman (Loughborough University) argues that modern life has outpaced human evolution. The study suggests that chronic stress and many modern health issues are the result of an evolutionary mismatch between our primarily nature-adapted biology and the industrialized environments we now inhabit.


A species out of sync with its environment

Over hundreds of thousands of years, humans adapted to the demands of hunter-gatherer life – high mobility, intermittent stress and close interaction with natural surroundings. Industrialization, by contrast, has transformed the human environment in only a few centuries, by introducing noise, air and light pollution, microplastics, pesticides, constant sensory stimulation, artificial light, processed foods and sedentary lifestyles.

“In our ancestral environments, we were well adapted to deal with acute stress to evade or confront predators,” explains Colin Shaw, who leads the Human Evolutionary EcoPhysiology (HEEP) research group together with Daniel Longman. “The lion would come around occasionally, and you had to be ready to defend yourself – or run. The key is that the lion goes away again.”

Today’s stressors – traffic, work demands, social media and noise, to name just a few – trigger the same biological systems, but without resolution or recovery. “Our body reacts as though all these stressors were lions,” says Longman. “Whether it’s a difficult discussion with your boss or traffic noise, your stress response system is still the same as if you were facing lion after lion. As a result, you have a very powerful response from your nervous system, but no recovery.”

Health and reproduction under pressure

In their review, Shaw and Longman synthesize evidence suggesting that industrialization and urbanization are undermining human evolutionary fitness. From an evolutionary standpoint, the success of a species depends on survival and reproduction. According to the authors, both have been adversely affected since the Industrial Revolution.

They point to declining global fertility rates and rising levels of chronic inflammatory conditions such as autoimmune diseases as signs that industrial environments are taking a biological toll. “There’s a paradox where, on the one hand, we’ve created tremendous wealth, comfort and healthcare for a lot of people on the planet,” Shaw says, “but on the other hand, some of these industrial achievements are having detrimental effects on our immune, cognitive, physical and reproductive functions.”

One well-documented example is the global decline in sperm count and motility observed since the 1950s, which Shaw links to environmental factors. “This is believed to be tied to pesticides and herbicides in food, but also to microplastics,” he notes.

Designing environments for wellbeing

Given the pace of technological and environmental change, biological evolution cannot keep up. “Biological adaptation is very slow. Longer-term genetic adaptations are multigenerational – tens to hundreds of thousands of years,” Shaw says.

That means the mismatch between our evolved physiology and modern conditions is unlikely to resolve itself naturally. Instead, the researchers argue, societies need to mitigate these effects by rethinking their relationship with nature and designing healthier, more sustainable environments.

According to Shaw, addressing the mismatch requires both cultural and environmental solutions. “One approach is to fundamentally rethink our relationship with nature – treating it as a key health factor and protecting or regenerating spaces that resemble those from our hunter-gatherer past,” he says. Another is to design healthier, more resilient cities that take human physiology into account.

“Our research can identify which stimuli most affect blood pressure, heart rate or immune function, for example, and pass that knowledge on to decision-makers,” Shaw explains. “We need to get our cities right – and at the same time regenerate, value and spend more time in natural spaces.”

 

Ancient ‘animal GPS system’ identified in magnetic fossils




University of Cambridge


Microscopic and magnetic characterisation of a giant spearhead magnetofossil. 

image: 

The earliest evidence of an internal ‘GPS’ system in an animal has been identified by researchers, which could help explain how modern birds and fish evolved the ability to use the Earth’s magnetic field to navigate long distances.

view more 

Credit: Rich Harrison






The earliest evidence of an internal ‘GPS’ system in an animal has been identified by researchers, which could help explain how modern birds and fish evolved the ability to use the Earth’s magnetic field to navigate long distances.

The tiny magnetic fossils – dating from 97 million years ago – were buried in ancient seafloor sediments, left behind by a mysterious, unidentified organism.

Shaped like spearheads, spindles, bullets and needles, and no larger than a bacterial cell, scientists believe these ‘magnetofossils’ are biological in origin, but they don’t know what creature made them, or why.

Now, researchers have now solved part of the mystery and found that these fossils may have served as an animal GPS system, enabling organisms to read Earth’s magnetic field like a map.

The researchers, from the University of Cambridge and the Helmholtz-Zentrum Berlin, captured the first 3D images of the fossils’ magnetic structure, and revealed features optimised to detect both the direction and strength of Earth’s magnetic field, which would have aided navigation.

“Whatever creature made these magnetofossils, we now know it was most likely capable of accurate navigation,” said Professor Rich Harrison from Cambridge’s Department of Earth Sciences, who co-led the research.

The discovery provides the first direct evidence that animals have been navigating using the Earth’s magnetic field for at least 97 million years. It may also offer insights into how animals evolved this ability, known as ‘magnetoreception’. The results are reported in the journal Communications Earth & Environment.

Life has evolved a range of extraordinary senses, and magnetoreception is one of the most poorly understood. Birds, fish, and insects use the Earth’s magnetic field to navigate vast distances, but how they do this is still unclear. One theory is that tiny crystals of magnetite within the body align with the Earth's magnetic field, acting like microscopic compass needles.

Certain bacteria found in lakes and other bodies of water possess a primitive form of magnetoreception. Chains of tiny magnetic particles inside the bacteria allow them to line up with the magnetic field, helping them swim to their preferred depth in the water column.

“At just 50–100 nanometres wide, these particles are the perfect compass needles,” said Harrison. “If you want to create the most efficient magnetic sense, smaller is better.”

But the magnetofossils the team studied for the current study are 10 to 20 times larger than the magnetic particles used by bacteria, and were retrieved from a site in the North Atlantic Ocean. Previously, some researchers had argued that ‘giant’ magnetofossils may have served as protective spines.

However, model simulations have suggested that they might also possess advanced magnetic properties, something Harrison wanted to explore further. “It looks like this creature was carefully controlling the shape and structure of these fossils, and we wanted to know why,” he said.

The researchers applied a new technique to visualise the fossil’s internal structure, revealing how magnetic moments (tiny magnetic fields generated by spinning electrons) are arranged inside the magnetofossil.

Until now, scientists had been unable to capture 3D magnetic images of larger particles, such as giant magnetofossils, because X-rays couldn’t penetrate them.

The research was made possible using a technique developed by co-author Claire Donnelly at the Max Planck Institute in Germany and carried out at the Diamond X-ray facility in Oxford.

“That we were able to map the internal magnetic structure with magnetic tomography was already a great result, but the fact that the results provide insight into the navigation of creatures millions of years ago is really exciting,” said Donnelly.

Their images revealed an intricate magnetic configuration, with magnetic moments swirling around a central line running through the fossil’s interior, forming a tornado-like vortex pattern.

This vortex magnetism provides ideal properties for navigation, said Harrison, generating a ‘wobble’ in response to tiny changes in the strength of the magnetic field that translate into detailed map information. “This magnetic particle not only detects latitude by sensing the tilt of Earth’s magnetic field but also measures its strength, which can change with longitude,” he said.

The geometry of this vortex structure is highly stable, meaning it can resist small environmental disturbances that may otherwise disrupt navigation. “If nature developed a GPS, a particle that can be relied upon to navigate thousands of kilometres across the ocean, then it would be something like this,” he said.

In solving the enduring mystery over the fossils’ function, the work also helps narrow the search for the animal that made them. “The next question is what made these fossils,” said Harrison. “This tells us we need to look for a migratory animal that was common enough in the oceans to leave abundant fossil remains.”

Harrison suggests that eels could be a potential candidate, since they evolved around 100 million years ago and remain one of the least understood and elusive animals. European and American eels travel thousands of kilometres from freshwater rivers to spawn in the Sargasso Sea. Though they can sense Earth’s magnetic field, how they do so is unclear. Magnetite particles have been detected in eels but not yet imaged directly in their cells and tissues, partly because of their tiny size and the fact they could be hidden anywhere in the body.

Harrison worked closely with Sergio Valencia from Helmholtz-Zentrum Berlin in designing the research. "This was a truly international collaboration involving experts from different fields, all working together to shed light on the possible functionality of these magnetofossils,” said Valencia.

Despite their as-yet-unknown host, “giant magnetofossils mark a key step in tracing how animals evolved basic bacterial magnetoreception into highly-specialised, GPS-like navigation systems,” Harrison said.

The research was supported in part by the European Union, the European Research Council and the Royal Society. Rich Harrison is a Fellow of St Catharine’s College, Cambridge.