Archive for AI scare

Nature tidbits [15 Jan 2026]

Posted in Books, pictures, Travel, University life with tags , , , , , , , , , , , , , , , on March 13, 2026 by xi'an

The cover of the 15 Jan issue of Nature is a blurry reconstitution of snapshots from the James Webb Space Telescope, which contributed to uncover these new astronomical objects, deemed to be young supermassive black holes. A notion I only came across recently, during a pre-defence lunch near the Paris Observatory.

Apart from that astronomical advance, Nature puts its focus on the 25th anniversary of Wikipedia. With a tribune bemoaning the insufficient investment of academics in the platform (with mentions of hypocrisy and betrayal). Which sounds rather unfair, since it requires an additional levy on research time, even though I did contribute to a few entries. And recognise the worth of most scientifics pages, as well as the parasitism by LLMs. And an interview of of Jimmy Wales’ about his Wikipedia memoir, Seven Rules of Trust. Kudos to his vision! He sounds rather optimistic about the chances of Wikipedia surviving the tsunAImi, but only if the users keep resorting (and indeed contributing) to the platform rather than accepting the LLM production at fa(r)ce value!

Other entries on

  • LLMs suffering (!) from “anxiety, trauma, shame and post-traumatic stress disorder”, although the arXiv reporting the experiment is criticized by others for anthropomorphising the machines. The danger is more in them inducing real trauma in vulnerable (human) users!
  • LLMs exhibiting aggressive behaviour (if trained accordingly)
  • the oldest evidence on human controlled fires using pyrite (in SE England), 400,000 years ago
  • the rise of academics being harassed (and not only in the US) and six recommendations for protecting our digital security (mentioning organisations such as Scholars at Risk Europe, Expert Voices Together, and Faculty First Responders)
  • a “Where I work” picture of a food scientist at the National Autonomous University of Mexico flipping a tortilla, with fermented ingredients to improve health and combat malnutrition in poor communities

punk rock [cover]

Posted in Books, pictures with tags , , , , , , , , , on March 19, 2025 by xi'an

watermarking for privacy (with no durian)

Posted in Books, Statistics, Travel, University life with tags , , , , , , , , , , on February 14, 2025 by xi'an

In Scalable watermarking for identifying large language model outputs, published by Dathathri, See, Ghaisas, et (many) al. in Nature of 23 October 2024, the authors propose an algorithm to (voluntarily) watermark synthetic texts to identify them as such, through a statistical test. Here are a few quotes to relate to the authors’ solution.

“LLMs generate text based on preceding context (…) given a sequence of input text x<t = x1, …, xt−1 consisting of t − 1 tokens from a vocabulary V, the LLM computes the probability distribution pLM(x<t) of the next token xt given the preceding text x<t. To generate the full response, xt is sampled from pLM(x<t), and the process repeats until either a maximum length is reached or an end-token is generated.

In a watermarking scheme, a sampling algorithm is an algorithm that takes as input a probability distribution p ∈ ΔV and a random seed and returns a token.

Tournament sampling selects a token from the LLM distribution that is likely to score higher under the random watermarking functions (…) Given the selection of tokens xt based on higher g-values, we expect watermarked text generally to score higher under this score than unwatermarked text (…) [It] requires g-values to decide which tokens win each match in the tournament. Intuitively, we want a function that takes a token xV, a random seed and the layer number  {1, …, m}, and outputs a g-value g(x, r) that is a pseudorandom sample from some probability distribution fg (the g-value distribution).”

At the Ocean privacy workshop, someone came with the question of trusting (or not) synthetic data and I remembered this article. Suggesting watermarking for said synthetic data by having providers or agents (privately) running a disclosed or registered code that delivers a watermark which provides a strong support to the synthetic data being produced likewise. I remain uncertain this is at realistic.

[not so] hostile takeover on my Nature issue

Posted in Books, pictures, Travel, University life with tags , , , , , , , , , , , , , , , , , , , , , , , , , , , , on November 29, 2024 by xi'an

When flying to Montréal last week, I took with me this 16 September issue of Nature but forgot it in the plane. But had time to read a tribune warning on the dangers of governments being involved in running the Internet. And thus disrupting its decentralised structure… And a short highlight on the (minimal) impact of atom-sized, asteroid-mass black holes crossing the Solar System, awakening me to the very existence of such stellar objects! And remained skeptical at the proposal to downsize pint glasses to lower beer consumption in the UK. (Although I now find myself unable to drink a whole pint of beer, outside eating a fiery curry! Or an alcohol free version.) And read how NobAIl Prize winning Alphafold2 unfolded a structure common to a family of flavivirus that make them relative with a common ancestor. And a book review pondering whether or not AI can feel distress, pondering at their sentience. (The book is freely available on OUP webpage.) Unlikely to be re-reviewed for CHANCE! And a fairly interesting long comment on how some Asian nations are massively investing into alternative approaches to meat production. (In particular since mentioning Singapore, where the company employing one of my nephews is soon to introduce in-vitro foie gras.) But missed the research paper on the origin and evolution of the bread wheat D genome, which I would have most certainly devoured, despite my favouring rye and buckwheat breads! (No bread was hurt in the study, since the 60 wheat landraces analysed therein came from the CIMMYT and ICARDA gene banks.) The origin is thus most likely on the “southern shores of the Caspian Sea 8,000–11,000 years ago”. Which sounds quite recent, all things considered!

Nature snapshots [10 Aug 2023]

Posted in Statistics with tags , , , , , , , , , , , , , on November 12, 2023 by xi'an

Many gems in the 10 Aug issue I found home back from Western Canada. Starting with the cover theme related to the fascinating paper Solid-body trajectoids shaped to roll along desired pathways by Sobolev et al. which shows that there exist 3D objects corresponding to any infinitely periodic trajectory and furthermore that such objects can be printed to this effect (watch the great videos!). Although they are more likely to be of a period larger than 1, i.e. producing several copies of the trajectory basis before returning to their original contact point. Amazingly, self-intersecting trajectories may also be produced.

Plenty of AI entries, like Artificial-intelligence search engines wrangle academic literature (as an advanced Google Scholar) and Publishing companies (like Elsevier) developing ChatGPT-like search engines. (While trying to stay clear from fake references!). Plus a long enquiry on Rules to keep AI in check: nations carve different paths for tech regulation about the different approaches envisioned by the EU, the USA, and China, with little practical advance so far for the first two. (While the UK plans its own AI safety summit in Bletchley Park this Fall.)

And an article on how Chinese students are starting to feel less inclined to study abroad. And not only in connection with the COVID pandemic. Or the US growing restrictions on student visas. But mainly because of the quickly growing attractivity of mainland China and Hong Kong universities. (The article states that “For every 1,000 PhDs that have been denied [visa] by the US government, US institutions will lose nearly US$1 billion in tuition fees”, but this does not sound highly relevant since most PhD tuitions are paid from supervisors’ grants, while students from other country may substitute for those denied visa.)