BioPharmaTrend
Latest News
All Topics
  • Artificial Intelligence
  • NeuroTech
  • Premium Content
  • Knowledge Center
Interviews
Companies
  • Company Directory
  • Sponsored Case Studies
  • Create Company Profile
More
  • About Us
  • Our Team
  • Advisory Board
  • Citations and Press Coverage
  • Partner Events Calendar
  • Advertise with Us
  • Write for Us
Subscribe
Login/Join

  Latest News

Google’s AI Auto-Writes Reproducible Research Software for Genomics and COVID Forecasting

by Illia Terpylo   •   Sept. 15, 2025  

Disclaimer: All opinions expressed by Contributors are their own and do not represent those of their employers, or BiopharmaTrend.com.
Contributors are fully responsible for assuring they own any required copyright for any content they submit to BiopharmaTrend.com. This website and its owners shall not be liable for neither information and content submitted for publication by Contributors, nor its accuracy.

# Tech Giants   
Share:   Share in LinkedIn  Share in Bluesky  Share in Reddit  Share in Hacker News  Share in X  Share in Facebook  Send by email

Google Research introduced a Gemini-based system that writes and iteratively improves “empirical software,” with a preprint and an interactive site showing full solution trees. The system treats well-defined scientific problems as scorable tasks and targets a single objective metric, reporting expert-level results across genomics, public health, geospatial analysis, neuroscience, numerical analysis, and time-series forecasting.

#advertisement
AI in Drug Discovery Report 2025

Empirical software here means code optimized to maximize a specified quality score. If a task has a clear metric and data, it qualifies. Users provide a task description, metric, and datasets; the system proposes ideas, implements executable code, evaluates results, then explores thousands of variants via tree search inspired by AlphaZero (Google’s chess engine based on reinforcement learning). It rewrites code using LLM and Tree Search (TS) to return verifiable, reproducible programs. Below is the illustration of the principle - the algorithm is comprised by a code mutation system, where the prompt is enhanced with research ideas, sourced through the search algorithm or a primary literature.  

The system was designed to perform highly at the set of 16 Kaggle curated tasks. The scheme below displays the average public leaderboard percentile performance. Methods based on Google’s methods are listed in bold. TS - tree search; BDT - boosted decision tree:

The team put the system through its paces on six very different benchmarks:

  • Genomics (scRNA-seq batch integration): Discovered 40 new methods; one combined ComBat and BBKNN to outperform the best published method by ~14%.
  • Public health (COVID-19 hospitalizations): Produced models that beat the CDC’s CovidHub ensemble benchmark.
  • Geospatial analysis (remote sensing segmentation): Reached >0.80 mIoU with variants of UNet++ and SegFormer paired with extensive test-time augmentation.
  • Neuroscience (Zebrafish neural activity prediction): Developed a novel forecasting model that outperformed existing baselines, including computationally intensive 3D video-based models.
  • Mathematics (numerical integrals): Correctly evaluated 17 of 19 held-out integrals where standard methods failed.
  • General time-series forecasting: Built a general-purpose forecasting library from scratch across 28 datasets (GIFT-Eval benchmark).

The authors frame this as an LLM-driven tree search that rewrites code while incorporating research ideas from the literature, converting software creation into a scored search over executable candidates; because each candidate is evaluated and only score-improving programs are kept, the system can exhaustively traverse alternatives and, in their tests, surpass expert baselines across multiple fields. They report that this compresses exploration from weeks or months to hours or days and argue that in domains with machine-scorable objectives, such empirical software can materially accelerate the rate at which verifiable, reproducible solutions are found.

Paper: https://arxiv.org/abs/2509.06503

Website with visualizations: https://google-research.github.io/score/

Topics: Tech Giants   

Share:   Share in LinkedIn  Share in Bluesky  Share in Reddit  Share in Hacker News  Share in X  Share in Facebook  Send by email
#advertisement
ThermoFisher Scientific: Integrated genetic technologies for cell therapy development
#advertisement
Webinar: AI in Clinical Trials

BiopharmaTrend.com

Where Tech Meets Bio
mail  Newsletter
in  LinkedIn
x  X
gnews  Google News
rss  RSS Feed

About


  • What we do
  • Citations and Press Coverage
  • Terms of Use
  • Privacy Policy
  • Disclaimer

We Offer


  • Premium Content
  • BioTech Scout
  • Interviews
  • Partner Events
  • Case Studies

Opportunities


  • Membership
  • Advertise
  • Submit Company
  • Write for Us
  • Contact Us

© BPT Analytics LTD 2025
We use cookies to personalise content and to analyse our traffic. You consent to our cookies if you continue to use our website. Read more details in our cookies policy.