Monday, December 01, 2025

 

Natural language is more complex than it strictly needs to be – and for good reason





Saarland University

Michael Hahn 

image: 

Michael Hahn, Chair for Language, Computation, and Cognition at Saarland University

view more 

Credit: Saarland University/Thorsten Mohr




Human languages are complex phenomena. Around 7,000 languages are spoken worldwide, some with only a handful of remaining speakers while others, such as Chinese, English, Spanish and Hindi, are spoken by billions. Despite their profound differences, they all share a common function: they convey information by combining individual words into phrases – groups of related words – which are then assembled into sentences. Each of these units has its own meaning, which in combination ultimately form a comprehensible whole.

'This is actually a very complex structure. Since the natural world tends towards maximizing efficiency and conserving resources, it's perfectly reasonable to ask why the brain encodes linguistic information in such an apparently complicated way instead of digitally, like a computer,' explains Michael Hahn. Hahn, Professor of Computational Linguistics at Saarland University, has been examining this question together with his colleague Richard Futrell from the University of California, Irvine. Encoding information in a classical binary sequence of ones and zeros would, in theory at least, be far more efficient because it compresses information much more tightly than natural languages. So why don't we all communicate – metaphorically speaking – like R2-D2 from Star Wars, but instead speak the way we do? Hahn and Futrell have now found an answer to this conundrum.

'Human language is shaped by the realities of life around us,' says Michael Hahn. 'If, for instance, I was to talk about half a cat paired with half a dog and I referred to this using the abstract term "gol", nobody would know what I meant, as it's pretty certain that no one has seen a gol – it simply does not reflect anyone's lived experience. Equally, it makes no sense to blend the words "cat" and "dog" into a string of characters that uses the same letters but is impossible to interpret,' he continues. We simply wouldn't be able to process a string like 'gadcot', even if it technically contains the letters of both words. In contrast, the phrase "cat and dog" does form a meaningful linguistic unit because the two words "cat" and "dog" refer to animals that virtually everyone will be familiar with.

Hahn summarizes the main findings of the study as follow: 'Put simply, it's easier for our brain to take what might seem to be the more complicated route.' Although the information is not in its most compressed form, the computational load for the brain is much lower because the human brain processes language in constant interaction with the familiar natural environment. Coding the information in a purely binary digital form might seem more efficient, as the information can be transmitted in a shorter time, but such a code would be detached from our real-world experience. Michael Hahn says the daily drive to work provides a good analogy: 'On our usual commute, the route is so familiar to us that the drive is almost like on autopilot. Our brain knows exactly what to expect, so the effort it needs to make is much lower. Taking a shorter but less familiar route feels much more tiring, as the new route demands that we be far more attentive during the drive.' Mathematically speaking: 'The number of bits the brain needs to process is far smaller when we speak in familiar, natural ways.'

Encoding and decoding information digitally would therefore require significantly more cognitive effort for both speaker and listener. Instead, the human brain continuously calculates the probabilities of words and phrases occurring in sequence, and because we use our native language daily for tens of thousands of days across a lifetime, these sequence patterns become deeply ingrained, reducing the computational load even further.

Hahn offers another example: 'When I say the German phrase "Die fünf grünen Autos" (Engl.: "the five green cars"), the phrase will almost certainly make sense to another German speaker, whereas "Grünen fünf die Autos" (Engl.: "green five the cars") won't,' he says.

Consider what happens when a speaker utters the phrase 'Die fünf grünen Autos'. It begins with the German definite article 'Die'. At that point, a German-speaking listener will already know that the word 'Die' is likely to signal a feminine singular noun or a plural noun of any gender. This allows the brain to rule out masculine or neuter singular nouns immediately. The next word, 'fünf', is highly likely to refer to something countable, which rules out non-enumerable concepts like 'love' or 'thirst'. The next word in the sequence 'grünen' tells the listener that the as-yet-unknown noun will be in the plural form and is green in colour. It could be cars, but could just as well be bananas or frogs. Only when the final word in the sequence 'Autos' is uttered does the brain resolve the remaining ambiguity. As the phrase unfolds, the number of interpretative possibilities narrows until (in most cases) only one final interpretation is left.

However, in the phrase 'Grünen fünf die Autos' (Engl.: 'green five the cars'), this logical chain of predictions and correlations breaks down. Our brain cannot construct meaning from the utterance because the expected sequence of cues is disrupted.

Michael Hahn and his US colleague Richard Futrell have now demonstrated these relationships mathematically. The significance of their study is underscored by its publication in the high-impact journal Nature Human Behaviour. Their insights could prove valuable, for example, in the further development of the large language models (LLMs) that underpin generative AI systems such as ChatGPT or Microsoft's Copilot.

 

How reactive oxygen species target viruses differently: new clues for safer water disinfection




Nanjing Institute of Environmental Sciences, MEE
Heterogeneity in biological mechanisms of different structural viruses inactivation by various reactive oxygen species (ROS). 

image: 

Heterogeneity in biological mechanisms of different structural viruses inactivation by various ROS. Schematic illustration of how structurally distinct viruses respond to ROS including hydroxyl radicals (•OH), singlet oxygen (1O2), and superoxide radicals (•O2⁻). Enveloped RNA virus (phi6) exhibits extensive lipid and protein oxidation, while non-enveloped ssRNA (MS2), ssDNA (phix174), and dsDNA (T4) viruses show varying levels of capsid and genome damage. Comparative bars on the right summarize the relative inactivation efficiency and damage intensity of ROS types, revealing that •OH causes the strongest overall inactivation, 1O2 preferentially oxidizes proteins, and •O2⁻ mainly targets RNA.

view more 

Credit: Eco-Environment & Health





Viruses in water pose major public health threats, yet their structural diversity makes them unequally susceptibility to disinfection. This study systematically explored how reactive oxygen species (ROS)—including hydroxyl radicals (•OH), singlet oxygen (1O2), and superoxide radicals (•O2⁻)—inactivate viruses with distinct structures. Using visible-light photocatalysis, the team quantified second-order rate constants and mapped biological damage to viral proteins, genomes, and lipids. The results revealed clear heterogeneity: enveloped RNA viruses were most susceptible to oxidation, while double-stranded DNA viruses showed strong resistance. These findings uncover the kinetic and mechanistic basis of viral susceptibility to ROS and provide theoretical guidance for advanced oxidation technologies in safe water treatment.

Waterborne viruses such as MS2 and T4 can survive conventional disinfection, posing challenges to public health systems. Advanced oxidation processes (AOPs) generate reactive oxygen species (ROS) that can destroy viral structures, offering promising disinfection solutions. However, the susceptibility of viruses with different genomes and envelopes to specific ROS remains poorly understood. Previous research has shown that single-stranded RNA viruses are more easily oxidized than DNA viruses, but the kinetics and mechanisms behind these variations are unclear. The complex interactions between viral components—proteins, lipids, and nucleic acids—and ROS still lack systematic characterization. Based on these challenges, it is necessary to conduct in-depth research on the heterogeneous susceptibility of structurally distinct viruses to various ROS.

Researchers from Jilin University and Zhejiang University have uncovered how viruses with distinct structural and genomic features respond differently to oxidative stress. The study (DOI: 10.1016/j.eehl.2025.100178), published on August 20, 2025, in Eco-Environment & Health, demonstrates the kinetic and biological mechanisms underlying virus inactivation by ROS generated through visible-light photocatalysis. Using four bacteriophage models—MS2, phi6, phix174, and T4—the team quantified their susceptibility to hydroxyl radicals, singlet oxygen, and superoxide radicals, revealing key structural determinants that govern oxidative resistance and susceptibility in viruses.

The study employed visible-light catalytic systems using g-C3N4, TiO2, and C60 nanomaterials to generate dominant ROS species (•O2⁻, •OH, and 1O2). Quantitative kinetic modeling showed significant variation in second-order inactivation rate constants, ranging from 105 to 1010 M⁻1 s⁻1. The viruses exhibited a consistent susceptibility ranking of phi6 > MS2 > phix174 > T4, reflecting their distinct envelopes and genome types. Hydroxyl radicals displayed broad-spectrum oxidative power, while singlet oxygen selectively oxidized capsid proteins, and superoxide radicals preferentially damaged RNA. Transmission electron microscopy revealed that ROS exposure caused capsid distortion, head-tail separation, and envelope collapse, depending on the viral structure. Protein assays, nucleic acid degradation measurements, and lipid peroxidation analyses confirmed that the structural complexity of viral proteins and the double-stranded nature of DNA confer greater resistance. Furthermore, tests in natural water matrices showed that dissolved organic matter and pH significantly reduced inactivation efficiency, with 1O2 proving the most stable and environmentally compatible oxidant.

“Understanding how different ROS interact with viral structures allows us to design more targeted and efficient disinfection systems,” said Professor Cong Lyu, the study’s corresponding author. “Our results highlight that viral resistance is not random—it’s rooted in molecular architecture. Enveloped and single-stranded RNA viruses are inherently more susceptible to oxidative attack, while complex double-stranded DNA viruses exhibit remarkable resistance. This knowledge provides a scientific foundation for improving AOPs in real-world water treatment, ensuring both safety and sustainability.”

This research offers a mechanistic framework for optimizing water disinfection technologies based on virus type and environmental conditions. By linking viral structure to ROS reactivity, it establishes predictive principles for designing selective and energy-efficient oxidation systems. The findings suggest that singlet oxygen–dominated photocatalysis, owing to its stability and selectivity, is particularly suitable for complex water environments. Integrating these insights into advanced oxidation technologies could enhance the safety of municipal and wastewater treatment, support emergency epidemic control, and reduce chemical disinfectant usage—advancing sustainable and resilient public health protection strategies.

###

References

DOI

10.1016/j.eehl.2025.100178

Original Source URL

https://doi.org/10.1016/j.eehl.2025.100178

Funding Information

The present work was funded by the Science and Technology Development Program of Jilin Province, China (No. 20220101214JC).

About Eco-Environment & Health (EEH) 

Eco-Environment & Health (EEH) is an international and multidisciplinary peer-reviewed journal designed for publications on the frontiers of the ecology, environment and health as well as their related disciplines. EEH focuses on the concept of “One Health” to promote green and sustainable development, dealing with the interactions among ecology, environment and health, and the underlying mechanisms and interventions. Our mission is to be one of the most important flagship journals in the field of environmental health.

Why does a faucet drip?





Universiteit van Amsterdam





Some phenomena in our daily lives are so commonplace that we don't realize there could be some very interesting physics behind them. Take a dripping faucet: why does the continuous stream of water from a faucet eventually break up into individual droplets? A team of physicists studied this question and reached surprising conclusions.

The breakthrough in understanding how a water jet breaks up into droplets was made by a team consisting of Stefan Kooij, Daniel T. A. Jordan, Cees J. M. van Rijn, and Daniel Bonn from the University of Amsterdam (Van der Waals-Zeeman Institute / Institute of Physics), along with Neil M. Ribe from the Université Paris-Saclay. The study was published in the journal Physical Review Letters.

Small waves, big consequences

In their study, “What Determines the Breakup Length of a Jet?”, the authors convincingly demonstrate that the initial disturbances that lead to the breakup of so-called laminar jets of liquids into droplets, are not primarily caused by external noise, turbulence, or imperfections in the nozzle, as is often assumed. Instead, their extensive experiments, using a wide range of fluids, nozzles, and flow conditions, reveal that the decisive disturbances are caused by intrinsic thermal capillary waves—thermal fluctuations on the scale of angstroms, tenths of a millionth of a millimeter.

The underlying process is similar to Brownian motion, where random molecular movements cause microscopically visible particles to “dance”. The researchers discovered that in the case of liquid jets, similar thermal oscillations on the surface of a water jet are sufficient to eventually cause the jet to break up into droplets. These tiny undulations, just a few angstroms in size, are then amplified by the so-called Rayleigh-Plateau instability, until the jet breaks.

From nanojets to macrojets

The team was able to confirm this hypothesis by image analysis of beam modulations and systematic variation of the parameters, and found a surprisingly good agreement between the experimentally measured length of the pieces in which the beam breaks up (the so-called breakup length) and a model based on thermal noise, valid over seven orders of magnitude — from so-called nanojets up to macroscopically large jets such as those from a faucet.

The work challenges a nearly 200-year-old notion about the importance of external noise in droplet formation, demonstrating that even in carefully isolated setups, the breakup length is ultimately determined by a fundamental thermal mechanism. These insights yield new fundamental knowledge for diverse application areas involving droplet formation, such as inkjet printing, food technology, and aerosol drug delivery.

 

Comprehensive review reveals how cities can learn from each other to build smarter, more sustainable urban systems




Tsinghua University Press
Steps of CCTL 

image: 

The systematic CCTL framework consists of four essential steps: data fetching and preprocessing, cross-domain linking and feature alignment, model refinement and generalization, and deployment with continuous learning.

view more 

Credit: Communications in Transportation Research




As urbanization accelerates globally and cities face mounting pressure to become both smarter and more sustainable, a fundamental question emerges: How can cities effectively learn from each other’s experiences to overcome data limitations and accelerate urban innovation?

To answer this question, researchers at Shanghai University, Tsinghua University and Northeastern University have conducted the first comprehensive review of Cross-City Transfer Learning (CCTL) in urban computing, systematically analyzing how knowledge transfer between cities can address critical challenges in smart city development and sustainable transportation.

 

They published their study on 3 September 2025, in Communications in Transportation Research.

 

“Despite the growing importance of CCTL, no dedicated review had systematically examined its applications and challenges. Our work fills this crucial gap by providing a holistic overview that bridges urban computing and transfer learning,” explains Ying Yang, a professor at the School of Management at Shanghai University.

 

Mapping the landscape of cross-city knowledge transfer

The review systematically examines CCTL applications across three major urban computing domains: prediction tasks (traffic flow forecasting, air quality monitoring), detection tasks (crime prediction, traffic accident detection), and deployment tasks (facility location planning, infrastructure optimization).

Through extensive literature analysis, the researchers identified common patterns and challenges across different applications. “We found that while cities share similar urban management objectives, significant variations in infrastructure, policy environments, and socioeconomic characteristics create substantial domain gaps that current methods struggle to bridge effectively,” Jiahao Zhan, a Ph.D. majoring in management and science engineering, says.

 

Policy transfer emerges as critical application

Beyond technical applications, the review highlights the growing importance of cross-city policy transfer for sustainable urban development. Cities can learn from successful low-carbon transportation policies, public transit initiatives, and shared mobility programs implemented elsewhere, significantly reducing implementation risks and accelerating the transition to climate-neutral transport systems.

 

Identifying future research directions

The comprehensive analysis reveals five major challenges: data heterogeneity and integration difficulties, limited model adaptability across diverse urban contexts, environmental and infrastructure differences between cities, computational resource constraints, and privacy concerns in cross-city data sharing.

“Our review establishes CCTL as a critical research domain and provides researchers with practical guidance on datasets, methods, and evaluation metrics. As cities worldwide pursue climate neutrality goals, the ability to systematically learn from each other’s experiences will become increasingly vital,” explains Yang Liu, an Associate Research Professor at the School of Vehicle and Mobility, Tsinghua University.

 

The above research is published in Communications in Transportation Research (COMMTR), which is a fully open access journal co-published by Tsinghua University Press and Elsevier. COMMTR publishes peer-reviewed high-quality research representing important advances of significance to emerging transport systems. COMMTR is also among the first transportation journals to make the Replication Package mandatory to facilitate researchers, practitioners, and the general public in understanding and advancing existing knowledge. At its discretion, Tsinghua University Press will pay the open access fee for all published papers in 2025.

 

About Communications in Transportation Research

Communications in Transportation Research was launched in 2021, with academic support provided by Tsinghua University and China Intelligent Transportation Systems Association. The Editors-in-Chief are Professor Xiaobo Qu, a member of the Academia Europaea from Tsinghua University and Professor Shuai’an Wang from Hong Kong Polytechnic University. The journal mainly publishes high-quality, original research and review articles that are of significant importance to emerging transportation systems, aiming to serve as an international platform for showcasing and exchanging innovative achievements in transportation and related fields, fostering academic exchange and development between China and the global community.

It has been indexed in SCIE, SSCI, Ei Compendex, Scopus, CSTPCD, CSCD, OAJ, DOAJ, TRID and other databases. It was selected as Q1 Top Journal in the Engineering and Technology category of the Chinese Academy of Sciences (CAS) Journal Ranking List. In 2022, it was selected as a High-Starting-Point new journal project of the “China Science and Technology Journal Excellence Action Plan”. In 2024, it was selected as the Support the Development Project of “High-Level International Scientific and Technological Journals”. The same year, it was also chosen as an English Journal Tier Project of the “China Science and Technology Journal Excellence Action Plan Phase Ⅱ”. In 2024, it received the first impact factor (2023 IF) of 12.5, ranking Top1 (1/58, Q1) among all journals in “TRANSPORTATION” category. In 2025, its 2024 IF was announced as 14.5, maintaining the Top 1 position (1/61, Q1) in the same category. Tsinghua University Press will cover the open access fee for all published papers in 2025.