Philip Palmer Green (born July 5, 1950) is an American theoretical and computational biologist whose software for DNA base calling, sequence assembly and genetic-linkage analysis became foundational to the Human Genome Project and modern next-generation sequencing workflows.[2][3]
He is a professor of Genome Sciences and an investigator of the Howard Hughes Medical Institute at the University of Washington.[4]
Early life and education
Green grew up in Durham, North Carolina, and matriculated at Harvard College in 1968, earning an A.B. in mathematics (magna cum laude) in 1972.[1]
He completed his Ph.D. at the University of California, Berkeley, in operator-algebra theory under Marc Rieffel but soon transitioned to computational genetics during post-doctoral work at the California Institute of Technology.[1]
Career
After a faculty appointment in mathematics at Caltech, Green moved to the University of Washington in 1987 and co-founded its Genome Center, later the Department of Genome Sciences.[4]
He has held joint appointments in Computer Science & Engineering and Bioengineering, mentoring more than 40 graduate students and post-docs.[5]
Research contributions
Phred – Green and Brent Ewing developed the first base-calling algorithm with statistically calibrated quality scores, reducing sequencing error rates by 40–50 %.[6]
Phrap & Cross_match – His assembly software introduced quality-weighted contig building, critical for shotgun strategies used at the Human Genome Project and Celera.[7]
Lander–Green algorithm – A likelihood-based method for multilocus linkage analysis that enabled dense human genetic maps.[8]
EST analysis – Showed that expressed-sequence tags implied ≈35,000 human genes, a benchmark pre-genome publication.[9]
Ancient conserved regions – Demonstrated deep evolutionary conservation in vertebrate genomes, foreshadowing comparative genomics.[10]
^Ewing, B.; Green, P. (2000). "Analysis of expressed sequence tags indicates 35,000 human genes". Nature Genetics. 25 (2): 232–234. doi:10.1038/76126. PMID10835646.
^Green, P. (1993). "Ancient conserved regions in new gene sequences and the protein databases". Science. 259 (5102): 1711–1716. doi:10.1126/science.8456302. PMID8456302.
This article needs additional or more specific categories. Please help out by adding categories to it so that it can be listed with similar articles.(October 2025)