Tauska Wang

Tauska Wang

Affiliated to Research
Visiting address: Solnavägen 9, Biomedicum B5, 17165 Solna
Postal address: C3 Fysiologi och farmakologi, C3 FyFa Reproduktiv endokrinologi och metabolism, 171 77 Stockholm

About me

  • I’m a machine learning engineer focused on post-training large language models and high-integrity data pipelines—especially where scientific rigor and real-world impact intersect. 

    My work spans agentic systems, reward modeling (RLAIF/RLHF), and evaluation at scale, with a track record of building reproducible tooling and shipping measurable improvements.

    Recent focus areas:

    LLM post-training & evaluation: Scaled rubric-based reward datasets to the million-pair range; trained summary/CoT reward models; delivered a ~5% lift on a health benchmark.

    Agentic systems & tooling: Built Pantheon-CLI and an LLM query-router; delivered end-to-end single-cell analysis agents; added cross-provider API compatibility; surpassed a general SWE-agent CLI on biomedical tasks.

    Scientific/health AI & data engineering: Parallelized BKMR on k8s; vectorized 30 years of NHANES; developed TCGA/GEO pipelines; created a mixture-of-experts framework for COPD metal-risk assessment; integrated wet-lab and dry-lab workflows.

    Quality & evidence: Author on peer-reviewed studies; reviewer for leading journals and ICLR; co-inventor on patents spanning transformer-based multimodal medical AI and biotechnology.

    Tooling & stack: Python, PyTorch, PyG, SQL; GCP/Azure; open-source contributions (PantheonOS/Agentic Data Science, OmicVerse, RAG Web UI, AstrBot). Currently pursuing advanced studies bridging medical science and computer science, and I thrive on projects where careful ablations, clear metrics, and tough benchmarks drive decisions.

    If you’re tackling LLM post-training, agentic workflows, or scientific/health data at production scale, I’m interested in collaborating on systems that move from promising results to verifiable outcomes.

Research

  • My research integrates reproductive endocrinology, metabolism, and artificial intelligence to uncover mechanisms of endocrine–metabolic disorders such as polycystic ovary syndrome (PCOS). I develop AI-driven frameworks for multi-omics integration and single-cell data analysis to reveal cross-organ communication among reproductive and metabolic tissues. I also explore large language models (LLMs) for biomedical text mining, literature synthesis, and clinical decision support. By combining systems biology with machine learning, my goal is to advance precision medicine and digital health approaches that bridge molecular discoveries with real-world clinical data, ultimately improving diagnosis, prevention, and treatment of reproductive–metabolic diseases.

Articles

All other publications

Grants

  • Thinking Machine Labs
    10 November 2025 - 10 November 2026
    Polycystic ovary syndrome (PCOS) is a heterogeneous endocrine and metabolic disorder in women, characterized by complex dysregulation across multiple tissues, including the ovary, endometrium, adipose tissue, skeletal muscle, and circulating immune and metabolic pathways. This complexity has led to highly fragmented bioinformatics workflows and substantial analyst-to-analyst variability, which limits reproducibility and slows translation of omics findings into clinically relevant insights. Supported by the Tinker research grant (USD 5,000 in compute credits over 12 months), this project aims to develop and evaluate a domain-adapted large language model (LLM)–driven “PCOS data analysis agent” for multi-omics analysis in women’s health. We will curate public bulk and single-cell transcriptomic and proteomic datasets related to PCOS and female metabolic health, along with high-quality method descriptions, analysis scripts, and reporting templates. Using Tinker’s infrastructure, we will fine-tune an LLM on this corpus and integrate it with our existing OmicVerse/OVAgent ecosystem to perform end-to-end tasks, such as pipeline design, parameter selection, quality control reasoning, cell-type annotation, and generation of structured analysis reports. The agent’s performance will be benchmarked against general-purpose LLMs and human expert baselines on accuracy, robustness, and run-to-run reproducibility of analysis outputs. By systematically testing whether a domain-adapted LLM can provide more consistent, transparent, and auditable analyses than ad-hoc expert workflows, this project seeks to establish a blueprint for trustworthy AI assistants in women’s health omics and to release reusable tools, prompts, and evaluation protocols to the research community.

News from KI

Events from KI