<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[This Genomic Life]]></title><description><![CDATA[Stories about functional genomics: Multi-omics, machine learning, imaging, and their impact on science and human health.]]></description><link>https://www.thisgenomiclife.org</link><image><url>https://substackcdn.com/image/fetch/$s_!EMoE!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19cdd040-4b52-4fda-897f-37eb9c464b1b_720x720.png</url><title>This Genomic Life</title><link>https://www.thisgenomiclife.org</link></image><generator>Substack</generator><lastBuildDate>Fri, 03 Apr 2026 20:11:25 GMT</lastBuildDate><atom:link href="https://www.thisgenomiclife.org/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Michael White and The McDonnell Genome Institute]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[thisgenomiclife@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[thisgenomiclife@substack.com]]></itunes:email><itunes:name><![CDATA[Mike White]]></itunes:name></itunes:owner><itunes:author><![CDATA[Mike White]]></itunes:author><googleplay:owner><![CDATA[thisgenomiclife@substack.com]]></googleplay:owner><googleplay:email><![CDATA[thisgenomiclife@substack.com]]></googleplay:email><googleplay:author><![CDATA[Mike White]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[How to Make the Case for Science in Polarized Times]]></title><description><![CDATA[We have a proven blueprint and most Americans of all political views still believe science is critical.]]></description><link>https://www.thisgenomiclife.org/p/how-to-make-the-case-for-science</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/how-to-make-the-case-for-science</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Wed, 04 Feb 2026 22:38:58 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!IvVZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IvVZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IvVZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 424w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 848w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 1272w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IvVZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic" width="498" height="654.9922480620155" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1018,&quot;width&quot;:774,&quot;resizeWidth&quot;:498,&quot;bytes&quot;:97340,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/186916042?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IvVZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 424w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 848w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 1272w, https://substackcdn.com/image/fetch/$s_!IvVZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26013f9b-47ce-4bc4-b1d6-ca35c9d0614a_774x1018.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Vannevar Bush</figcaption></figure></div><p>It&#8217;s no secret that it&#8217;s been a rough year for scientific institutions in the U.S. The National Science Foundation and the National Institutes of Health were vandalized by DOGE and they are still struggling to get back on their feet. Those of us awaiting funding decisions are hearing that a layer of political review seems to have been added on top of the normal review process. Last year, several major universities fell into the crosshairs of the Trump administration, which attempted to coerce their behavior by using the nuclear option of freezing all of these institution&#8217;s federal funding&#8211;primarily funds for biomedical research. Some universities were also invited by the administration to sign a &#8220;compact&#8221; that would allow an unprecedented intrusion of the federal government into those institutions&#8217; hiring and curricular decisions.</p><p>I don&#8217;t want to underestimate the damage that&#8217;s been done to research universities in the past year and or minimize the suffering of those whose careers have been affected, but I am optimistic about the future of America&#8217;s scientific institutions. In the budget bills passed this week, Congress completely rejected the Trump administration&#8217;s proposed draconian cuts to the NIH and NSF. Universities are proving to be resilient, and America&#8217;s strong scientific enterprise isn&#8217;t something that can be destroyed overnight or even over a year.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>But in addition to the more immediate challenges, the scientific community needs to get to work on the longer term problem of rebuilding trust with the broader public. Here I want to suggest that we can rebuild that trust by committing to a successful vision of how the government and universities work together to serve the public.</p><p>The vision that we should commit to is not a new one: it is the vision that the U.S. implemented after WWII, leading to our spectacular success in science for over three generations. Most readers have likely heard of Vannevar Bush&#8217;s famous report, <em><a href="https://press.stripe.com/pieces-of-the-action">Science: The Endless Frontier</a></em>, written for the FDR Administration in early 1945. I imagine that fewer readers have actually read it or know in any detail what its arguments are. Well, I recently reread it and can say that it holds up.</p><p>We&#8217;re at a moment when the relationship between universities and the federal government seems to be in crisis, and alterante arrangements are being proposed and considered. I&#8217;m going to make the case that we don&#8217;t need alternatives; we just need to recommit to the principles that Bush laid out 80 years ago.</p><p>Before we get to Bush&#8217;s specific principles for government support of science, there is one essential prerequisite that Bush, writing as the US was winning a major war, could take for granted, but which we cannot: Government support of scientific institutions must rest on a foundation of <em>broad</em> <em>public legitimacy</em>. Scientific institutions need to maintain the respect and trust of the public that supports them, otherwise they will be seen by many as simply another elite interest group looking for taxpayer dollars. When that happens, Vannevar Bush&#8217;s vision for science collapses. I&#8217;ll say more on this below, but let&#8217;s first dig into the specifics of Bush&#8217;s report.</p><h2>Why should the government fund scientific research?</h2><p><em>Science: The Endless Frontier</em> is an answer to the question, why should the government provide stable, long-term funding for basic science? Bush gave four important answers..</p><h3>1. Basic science is the &#8220;seed corn&#8221;  of technological progress</h3><p>Whatever you might think of state of the economy today, one of the most important facts about human life in the 21st century is that we enjoy a prosperity that would be envied by nearly everyone alive 100 years ago. The reason for this is long term economic growth, driven by technological progress. Bush recognized that basic research, whose purpose is to generate new scientific knowledge, is the source of scientific capital from which technological progress draws. He called basic research the &#8220;seed corn&#8221; and &#8220;pacemaker&#8221; of technological progress. He wrote that</p><blockquote><p>Advances in science, when put to practical use, mean more jobs, higher wages, shorter hours, more abundant crops, more leisure for recreation, for study, for learning how to live without the deadening drudgery that has been the burden of the common man for ages past. Advances in science will also bring higher standards of living, lead to the prevention or cure of diseases, promote conservation of our limited national resources, and assure means of defense against aggression. But to achieve these objectives&#8212;to secure a high level of employment, to maintain a position of world leadership&#8212;the flow of new scientific knowledge must be both continuous and substantial.</p></blockquote><p>While scientific knowledge and technological progress can benefit the entire world, Bush correctly noted that nations which rely on other societies for scientific knowledge are at a disadvantage. By having scientific institutions here at home, we gain advantages in intellectual property, in human capital, and in the ability for our businesses to act first on new discoveries.</p><p>Importantly, it is the role of government to support scientific progress: &#8220;Since health, well-being, and security are proper concerns of the government, scientific progress is, and must be, of vital interest to the government.&#8221;</p><h3>2. Scientific progress is critical for the nation&#8217;s health, its economic prosperity, and its national security.</h3><p>Bush noted that major gains in lifespan were due to growth in biomedical knowledge and its application. Not only is a healthier populace a good thing on its own, but a healthier society is also a more economically prosperous one.</p><p>The success of our economy also depends in other ways on scientific knowledge:</p><blockquote><p>How will we find ways to make better products at lower cost? The answer is clear. There must be a stream of new scientific knowledge to turn the wheels of private and public enterprise. There must be plenty of men and women trained in science and technology, for upon them depend both the creation of new knowledge and its application to practical purposes.</p></blockquote><p>And finally, Bush noted that one of the most important public goods provided by the government is national defense. Writing in the waning moths of WWII, he said that:</p><blockquote><p>This war emphasizes three facts of supreme importance to national security: (1) Powerful new tactics of defense and offense are developed around new weapons created by scientific and engineering research; (2) the competitive time element in developing those weapons and tactics may be decisive; (3) war is increasingly total war, in which the armed services must be supplemented by the active participation of every element of civilian population.</p></blockquote><p>Today, war is not always total war, but as the war in Ukraine shows, technological innovation is still essential for defending your nation.</p><h3>3. Progress requires freedom of inquiry</h3><p>You can&#8217;t reliably predict in advance where breakthroughs will happen or what knowledge will prove most valuable:</p><blockquote><p>Many of the most important discoveries have come as a result of experiments undertaken with very different purposes in mind. Statistically it is certain that important and highly useful discoveries will result from some fraction of the undertakings in basic science, but the results of any one particular investigation cannot be predicted with accuracy.</p></blockquote><p>Thus, Bush argued, institutions of scientific research must be committed to freedom of inquiry. Neither politicians nor university administrators should dictate the course of research. Bush thought that universities and research institutes would be optimal sites for basic research because they were committed to open inquiry:</p><blockquote><p>It is chiefly in these institutions that scientists may work in an atmosphere that is relatively free from the adverse pressures of convention, prejudice, or commercial necessity. At their best they provide the scientific worker with a strong sense of solidarity and security, as well as a substantial degree of personal intellectual freedom. All of these factors are of great importance in the development of new knowledge, since much of this new knowledge is certain to arouse opposition because of its tendency to challenge current beliefs or practices.</p></blockquote><p>If universities fail to be places where researchers are free to challenge current beliefs, then they are failing to live up to their part of the bargain that Bush is proposing.</p><h3>4. Need for a scientific workforce</h3><p>A trained, talented scientific workforce is the single most important ingredient for scientific progress, and for converting scientific knowledge into technologies that benefit society:</p><blockquote><p>We shall have rapid or slow advance on any scientific frontier depending on the number of highly qualified and trained scientists exploring it.</p></blockquote><p>Critically, Bush argued that the nation needs to draw on talent from anywhere it is found. It is in the government&#8217;s interest to ensure that social or economic barriers don&#8217;t stand in the way of bringing talented people into science:</p><blockquote><p>Here is a tremendous waste of the greatest resource of a nation&#8212;the intelligence of its citizens. If ability, and not the circumstance of family fortune, is made to determine who shall receive higher education in science, then we shall be assured of constantly improving quality at every level of scientific activity.</p></blockquote><h2><strong>How to recommit to Bush&#8217;s Vision</strong></h2><p>Bush&#8217;s arguments for why the federal government should provide substantial, long-term funding for science are still sound today. To promote the national welfare, the government should support the production of scientific knowledge, at institutions that are committed to free inquiry and to drawing in talent from wherever it is found.</p><p>But the obligation doesn&#8217;t just go one way, from the government to universities. What Bush is proposing is that universities hold an important public trust. In exchange for public support, universities will work in the broad public interest, and they should be accountable to the public for holding up their end of the bargain. We need to be proactive about communicating to the public that we value this trust, and that we take our obligations seriously. Here are some principles scientists should follow, especially university leaders and those who choose to engage with the public:</p><ol><li><p><strong>Don&#8217;t link science with partisan identities.</strong> Don&#8217;t act like science and universities are the province of political liberals. Science is for everyone, and we should welcome colleagues and trainees regardless of their political affiliation. One way that universities have failed at this is to rely too heavily on language and ideas that come across as partisan. For example, DEI efforts, in principle, align with Bush&#8217;s charge to ensure that social and economic barriers don&#8217;t keep talented people out of science. In practice, these efforts often rely on tendentious left-wing arguments that have the counterproductive effect of convincing a big chunk of the population that the scientific community doesn&#8217;t care about them or share their values.</p></li><li><p><strong>Hold high standards and aggressively police fraud.</strong> The public needs to know that when we spend taxpayer funds, we are doing do with care, and that we take misconduct seriously.</p></li><li><p><strong>Defend academic freedom from all threats.</strong> Elected officials right now represent the most serious threat to academic freedom, but the last decade has seen unacceptable levels of censorship in academia. Too often the problem has been dismissed as minor, affecting only a few people. I do agree that the overwhelming majority of academics have not been affected by censorship, but the problem is real and a dismissive attitude sends the wrong signal to the public. The public needs to know that universities are living up to Bush&#8217;s claim that they are committed to free inquiry against all threats.</p></li><li><p><strong>Be willing to listen to public concerns.</strong> Amander Clark, director of the UCLA Center for Reproductive Science, recently wrote in a <em><a href="https://www.nature.com/articles/d41586-026-00090-1">Nature</a></em><a href="https://www.nature.com/articles/d41586-026-00090-1"> commentary</a> that</p></li></ol><blockquote><p>Those of us working in the sector should reflect on the purpose, mission and vision of universities. It&#8217;s crucial that educators and researchers listen to the public on whether university visions align with public needs, values and the jobs that will be needed in the future workforce.</p></blockquote><p>&#9;National Academy of Science President Marcia McNutt <a href="https://www.science.org/doi/10.1126/science.adu4907">wrote</a> in <em>Science</em> that</p><blockquote><p>The scientific community must also better recognize that it may not be helpful to emphasize consensus in policy reports&#8217; recommendations when the underlying values are not universally shared.</p></blockquote><p>Good doctors learn to communicate effectively with patients by cultivating a bedside manner, which more or less means letting the patient know that you are hearing their concerns. We need to cultivate a similar approach in how we present ourselves to the public&#8211;and these days, basically anything we say or do online is public.</p><h3>Americans understand the importance of science</h3><p>The good news is that Americans across the political spectrum do genuinely understand the importance of science, as <a href="https://www.pewresearch.org/science/2026/01/15/do-americans-think-the-country-is-losing-or-gaining-ground-in-science/">recent results</a> from Pew Research show (see below). They don&#8217;t need to be convinced of that. Instead, they need to be convinced that scientists and scientific institutions are committed to working in the interests of everyone.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!G6YK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!G6YK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 424w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 848w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 1272w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!G6YK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png" width="444" height="460.589010989011" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:944,&quot;width&quot;:910,&quot;resizeWidth&quot;:444,&quot;bytes&quot;:431180,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/186916042?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!G6YK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 424w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 848w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 1272w, https://substackcdn.com/image/fetch/$s_!G6YK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe20a0c11-0246-4fe6-8e74-294992b8912f_910x944.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e_W0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e_W0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 424w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 848w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 1272w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e_W0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png" width="450" height="451.2857142857143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:700,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:239564,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/186916042?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e_W0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 424w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 848w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 1272w, https://substackcdn.com/image/fetch/$s_!e_W0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4d3e188-e4d0-43d7-bc4c-a91ec750ea61_700x702.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[What if the Human Genome Project was paywalled?]]></title><description><![CDATA[In 1999 a Perkin-Elmer CEO envisioned a pay-for-access genome scheme that would have smothered two decades innovation in biotechnology and genomic medicine.]]></description><link>https://www.thisgenomiclife.org/p/what-if-the-human-genome-project</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/what-if-the-human-genome-project</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 25 Apr 2025 18:43:02 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Y4fr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y4fr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y4fr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 424w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 848w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 1272w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y4fr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic" width="594" height="512.4065934065934" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1256,&quot;width&quot;:1456,&quot;resizeWidth&quot;:594,&quot;bytes&quot;:297504,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/162141193?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Y4fr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 424w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 848w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 1272w, https://substackcdn.com/image/fetch/$s_!Y4fr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeafcc72-a84f-4c88-ae92-6df5969ddf10_1488x1284.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Bob Waterston, then chair of the Genetics Department at Washington University in St. Louis, at a 2001 press conference announcing the draft sequence of the human genome. Credit: <a href="https://www.genome.gov/about-genomics/educational-resources/fact-sheets/human-genome-project">NHGRI Photo Archive</a>. </figcaption></figure></div><p>Today (April 25) is <a href="https://www.genome.gov/dna-day">DNA Day</a>, in recognition of Watson and Crick&#8217;s transformative 1953 paper describing the double helical structure of DNA. DNA Day also recognizes the completion of the <a href="https://www.genome.gov/about-genomics/educational-resources/fact-sheets/human-genome-project">first Human Genome Project</a> fifty years later. And yesterday the Genetics Department at Washington University in St. Louis celebrated its <a href="https://genetics.wustl.edu/department-of-genetics-50th-anniversary-symposium/">50th anniversary</a>. One of the keynote speakers was Bob Waterston, our former department head who played a critical role in making the Human Genome Project happen. In his talk, Waterston recounted a chilling anecdote that illustrates what might have happened if the Human Genome Project turned out differently. Here&#8217;s a version of the anecdote from his <a href="https://www.annualreviews.org/content/journals/10.1146/annurev-genom-101822-012945">recent memoir</a>:</p><blockquote><p>At one meeting in December 1999, called to explore possible collaboration [between the publicly-funded Human Genome Project and the private genomics company Celera], Tony White, the CEO of Perkin-Elmer, insisted that any combined genome-sequence product would only be available through the Celera website for years to come, and Celera would be the only source for any analyses, including annotations, that might come from joint efforts.</p></blockquote><p>Many readers will remember that Craig Venter&#8217;s Celera was competing with the publicly-funded, international project to sequence the human genome. If you don&#8217;t work in biology, you may not appreciate just how disastrous it would have been if the CEO of Perkin-Elmer had gotten his way. Just about every aspect of biology has been transformed by genome sequencing&#8211;an analogy might be the impact that quantum mechanics has had on physics. Virtually no part of the broader field is untouched by the transformation. A key part of the genomic transformation of biology has to do with new tools that depend on access to reference genomes. <a href="https://www.thisgenomiclife.org/p/the-first-crispr-gene-therapy-is">Gene editing</a> to cure disease, mRNA vaccines, ancient DNA studies, gut microbiome analysis, <a href="https://pubmed.ncbi.nlm.nih.gov/31097668/">brain changes</a> in autism, ecological studies&#8211;all of these depend on technologies that would either not exist or have been delayed for years if a private company had owned the human genome sequence. </p><p>Why? Because the technologies and computational methods that are so central to medical and biological research today depend heavily on aligning new DNA sequencing data to a reference genome sequence. These technologies aren&#8217;t only about &#8216;reading&#8217; a DNA sequence&#8211;many of them use sequencing to measure critical molecular events, quantify the state of a cell, and to design new sequences that can be used in lab research and therapies. The Human Genome Project generated one of the first and most critical reference sequences. If every lab around the world had to pay Celera for access to that reference genome, much of the innovation that drives biology today would never have gotten off the ground. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>This isn&#8217;t just speculation. We can make the direct comparison. Parts of the genome that were sequenced first by Celera locked up as its IP until the public project resequenced a freely available version. Comparing the Celera-first versus public-first parts of the genome, <a href="https://pubmed.ncbi.nlm.nih.gov/24639594/">2010 analysis</a> found that &#8220;Celera's IP led to reductions in subsequent scientific research and product development on the order of 20 to 30 percent.&#8221;</p><p>Yesterday at our department&#8217;s 50th anniversary symposium, Bob Waterston told of how he and others pushed hard for open genome access at a now-famous <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC5634517/">1996 meeting</a> held at the Princess Hotel in Bermuda. At the time, the genome sequence of the nematode <em>C. elegans</em> was nearing completion and large-scale sequencing of the human genome was about to begin. What emerged are known as the Bermuda Principles, a central part of which was that new sequencing data would be shared <em>immediately</em>. Sequencing centers would upload their new sequencing data to an open database every day, thereby putting this data in the public domain. As <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC7307446/">one history</a> of the Bermuda Principles puts it, </p><blockquote><p>the Bermuda Principles addressed concerns about gene patents impeding scientific advancement, and were aspirational and flexible in implementation and justification. They endured as an archetype for how rapid data sharing could be realized and rationalized, and permitted adaptation to the needs of various scientific communities.</p></blockquote><p>Biomedical science and the biotechnology industry would not be what they are today if those principles had not prevailed. Because they did prevail, patents on human genes and gene variants were already <a href="https://psmag.com/environment/someone-else-owns-your-genes-57544/">on their way to a natural death</a> by the time the U.S. Supreme Court killed them for good in 2013.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/what-if-the-human-genome-project?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/what-if-the-human-genome-project?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p>Scientific knowledge is one of the most critical of what economists call public goods. Public goods are &#8220;non-rivalrous&#8221; and &#8220;non-excludable&#8221;. Non-rivalrous means one person&#8217;s use of a public good doesn&#8217;t diminish anyone&#8217;s ability to use it, and while non-excludable means it&#8217;s difficult to keep others from using it. Because of those features, private incentives to produce scientific knowledge are low, and economies then under-produce them, to everyone&#8217;s detriment. The Human Genome Project is a clear example of scientific knowledge as a public good. Public domain reference genomes can be freely used and reused by academic and industry labs to advance knowledge and invent new technologies and therapies. A <a href="https://doe-humangenomeproject.ornl.gov/economic-impact/">2013 report</a> found that every $1 invested by the US government in the Human Genome Project led to a $141 addition to the US economy.  Had Celera succeeded in paywalling the reference genome, it would have increased research costs and stifled innovation. But it also would have been difficult and time-consuming for Celera to enforce its monopoly, since bootleg sequence data could easily be shared and labs could (inefficiently and redundantly) resequence parts of the genome they cared about. </p><p>Publicly-funded work like this is the foundation on which nearly all of today&#8217;s technological innovation is built. It&#8217;s why we can <a href="https://www.thisgenomiclife.org/p/treating-disease-rationally-with">now cure</a> some fatal genetic diseases, treat cancer more effectively with better diagnoses, have a successful drug against obesity, shop at well-stocked grocery stores, text friends, stream movies, and fly to visit family on holidays. Had the human reference genome been locked up behind a paywall, our society would have been sicker and poorer for it.</p><p>For those interested in the early history of the technologies that made the Human Genome Project possible, I&#8217;ve transcribed <a href="https://www.thisgenomiclife.org/p/technologies-that-made-the-human">a talk</a> given by another former Genetics Department Head, Mark Johnston, who had a front row seat and made some critical contributions of his own. Click the link for the video and annotated transcript.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[Are we making AI too human?]]></title><description><![CDATA[A big goal of AI is to achieve human-like intelligence but we may need something else for applications that aren't well-suited to human intelligence.]]></description><link>https://www.thisgenomiclife.org/p/are-we-making-ai-too-human</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/are-we-making-ai-too-human</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Wed, 16 Apr 2025 18:24:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>If you pay any attention to AI in the news you know that many influential people believe that AGI (artificial general intelligence) is on the horizon. In early March, <em>The New York Times</em>&#8217; Ezra Klein <a href="https://www.nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html?unlocked_article_code=1._k4.KPtP.-17Sb4pF5eyl&amp;smid=url-share">said on his podcast</a> that:</p><blockquote><p>For the last couple of months, I have had this strange experience: Person after person &#8212; from artificial intelligence labs, from government &#8212; has been coming to me saying: It&#8217;s really about to happen. We&#8217;re about to get to artificial general intelligence.</p></blockquote><p>Klein defines AGI as AI that is &#8220;capable of doing basically anything a human being could do behind a computer &#8212; but better.&#8221; Achieving AGI is clearly a major goal of the big players in AI. Google, clearly concerned about being perceived as responsible in its quest for AGI, recently released a <a href="https://blog.google/technology/google-deepmind/agi-safety-paper/">paper on AGI safety</a>. The underlying premise is that work pursued by Google and others is definitely on the path to AGI. </p><p>George Mason University economist Tyler Cowen also <a href="https://x.com/tbpn/status/1912280076235993218">thinks AGI is imminent</a>, and he also defines it in terms of human intelligence:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ebcd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ebcd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 424w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 848w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 1272w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ebcd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic" width="558" height="546.5733788395904" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1148,&quot;width&quot;:1172,&quot;resizeWidth&quot;:558,&quot;bytes&quot;:107067,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ebcd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 424w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 848w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 1272w, https://substackcdn.com/image/fetch/$s_!Ebcd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722fc83f-31e0-4cfe-ac98-11de330da4e0_1172x1148.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Others are skeptical that there is any clear threshold that would count as achieving AGI. Arvind Narayanan and Sayash Kapoor, authors of <em>AI Snake Oil</em>, <a href="https://www.aisnakeoil.com/p/ai-scaling-myths?utm_campaign=post&amp;utm_medium=web">argue</a> that we&#8217;ll see AI models climb a &#8220;ladder of generality,&#8221; with new models capable of new tasks; however, they are skeptical that we&#8217;ll get to anything one might call AGI simply by building bigger LLMs (large language models):</p><blockquote><p>An unknown number of steps lie ahead before we can reach a level of generality where AI can perform any economically valuable job as effectively as any human (which is one definition of AGI).</p></blockquote><p>Why all the fuss about AGI? A big part of the answer is that AGI has been the holy grail of AI throughout its entire <a href="https://www.science.org/doi/10.1126/science.ado7069">history</a> because the field is specifically motivated by <em>human </em>intelligence. There are many ways that you can use probability and statistics to build powerful models of the world. But the AI field, starting at least with the very first artificial neurons has proceeded on the premise that we can learn how human intelligence works by building computer models with similar capabilities. In other words, one purpose of AI is to simply understand how intelligence works&#8211;and thus AGI would represent an important milestone.</p><p>But AI clearly has value when it&#8217;s applied to many tasks that humans <em>don&#8217;t</em> do especially well. And so it&#8217;s worth asking, is the focus on human-like capabilities holding back AI progress in applications that don&#8217;t depend on recreating features of human reasoning? To be more specific, in the context of genomics, are AI architectures cribbed from language models and image recognition tools really the right way to go?</p><p>Borrowing a Jetsons analogy from <a href="https://www.harperacademic.com/book/9780063286344/a-brief-history-of-intelligence/">Max Bennet</a>, are we wasting effort trying to build a humanoid robot that can hold a conversation and clean the house (left) when what you really need is a one-armed robot (right) that can do complex but specialized tasks in a factory?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wpLK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wpLK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 424w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 848w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 1272w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wpLK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic" width="598" height="418.10714285714283" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1018,&quot;width&quot;:1456,&quot;resizeWidth&quot;:598,&quot;bytes&quot;:128429,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wpLK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 424w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 848w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 1272w, https://substackcdn.com/image/fetch/$s_!wpLK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91f5aa9e-0b99-44b4-b4f2-8d0aa4e4158a_1994x1394.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Left: Rosey the Robot, from The Jetsons (Warner Bros Entertainment). Right: Industrial robot from KUKA Roboter.</figcaption></figure></div><p>I don&#8217;t pretend to have a serious answer to this question, but it seems useful to raise it. In the rest of this post I want to explore the question by thinking about thoughtful benchmarks. To measure progress and spur innovation, we need to think carefully about tests of AI that are not merely based on whatever evaluations are convenient to make, but rather on tasks that are specifically designed to reflect our big goals for AI. The punchline is that progress in biological applications of AI is going to depend on devising much better ways to test our models.</p><h2>Benchmarking AI on fluid intelligence</h2><p>While Tyler Cowen thinks AGI might be here in a few days, AI scientist Fran&#231;ois Chollet does not. <em>The Atlantic</em> recently ran a <a href="https://www.theatlantic.com/technology/archive/2025/04/arc-agi-chollet-test/682295/?gift=i_RjWLDY_BE3FbT184VWrYSu8sHd6hnu3flRJa9Nlp8&amp;utm_source=copy-link&amp;utm_medium=social&amp;utm_campaign=share">story</a> profiling Chollet, called &#8220;The Man Out to Prove How Dumb AI Still Is&#8221;. The piece is largely about a famous set of benchmarks that Chollet created with his collaborators, ARC-AGI. (ARC stands for Abstraction and Reasoning Corpus.) The ARC benchmarks are based on an interesting <a href="https://arxiv.org/abs/1911.01547">2019 paper</a> by Chollet called &#8220;On the Measure of Intelligence.&#8221; He argued that most tests of AI&#8217;s capabilities overestimate how good it is, because they they are too heavily based on prior knowledge:</p><blockquote><p>[S]olely measuring skill at any given task falls short of measuring intelligence, because skill is heavily modulated by prior knowledge and experience: unlimited priors or unlimited training data allow experimenters to "buy" arbitrary levels of skills for a system, in a way that masks the system's own generalization power.</p></blockquote><p>Showing that GPT  can score in the 90th percentile on the LSAT is not really a significant test of intelligence, because the GPT was trained on the entire corpus of the internet. What we should be measuring, Chollet argued, is the capacity to solve problems that <em>don&#8217;t</em> depend on lots of prior skill and knowledge, something called fluid intelligence. The ARC benchmarks were designed to test fluid intelligence and, as the <em>Atlantic </em>piece points out, AI models completely bombed when they were first tested. </p><p>The ARC benchmarks have spurred innovation and AI now does <a href="https://lewish.io/posts/arc-agi-2025-research-review">much better</a>. OpenAI&#8217;s o3 showed <a href="https://arcprize.org/blog/oai-o3-pub-breakthrough">impressive gains</a>, but it still fails to solve puzzles that most elementary school students could solve. And when it does solve these problems it takes on average 14 minutes of compute time to do something that takes most humans just a few moments of inspection. Here&#8217;s an example of a puzzle o3 couldn&#8217;t solve:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ihen!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ihen!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 424w, https://substackcdn.com/image/fetch/$s_!ihen!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 848w, https://substackcdn.com/image/fetch/$s_!ihen!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 1272w, https://substackcdn.com/image/fetch/$s_!ihen!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ihen!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic" width="1456" height="790" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:129745,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ihen!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 424w, https://substackcdn.com/image/fetch/$s_!ihen!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 848w, https://substackcdn.com/image/fetch/$s_!ihen!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 1272w, https://substackcdn.com/image/fetch/$s_!ihen!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b054e1b-3d29-4566-957c-fec813c6c604_1932x1048.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ARC-AGI puzzle <a href="https://arcprize.org/blog/oai-o3-pub-breakthrough">not solved</a> by o3.</figcaption></figure></div><p>You can <a href="https://arcprize.org">try these puzzles</a> yourself. Spend a few minutes with them and you&#8217;ll quickly realize that they test features of intelligence that are much more universal among humans and more profound than what&#8217;s needed to get a high score on the LSAT. </p><p>Thinking about the ARC-AGI benchmarks is useful for a few reasons. First, it should deflate any expectations that AGI is just around the corner. My ninth-grader solved the above puzzle in a few seconds. Second, ARC-AGI shows that thoughtful benchmarks, as opposed to benchmarks of convenience like the LSAT, can spur innovation. And third, they highlight the heavy emphasis of AI development on achieving features of <em>human</em> intelligence. ARC-AGI is easy for humans but hard for AI. What about applications of AI that are <em>not</em> easy for humans? Does it make sense to rely almost exclusively on AI methods that are developed to achieved human-like capabilities? </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>What&#8217;s the equivalent of fluid intelligence for molecular reasoning?</h2><p>One could reply to the above question by pointing to the Nobel Prize won by the Demis Hassabis and John Jumper for AlphaFold. AlphaFold&#8217;s ability to accurate predict 3D protein structures is incredibly useful, and its is based on decades of human-intelligence-inspired AI development. However, AlphaFold does include some <a href="https://www.nature.com/articles/s41586-021-03819-2">critical innovations</a> in its architecture, so it&#8217;s not just an off-the-shelf LLM. That&#8217;s not always true of the increasingly common genomic LLMs that seem to appear in the literature every week. Not all, but much of the AI in biology is taken almost off-the-shelf from applications in computer vision or human language.</p><p>I&#8217;m not an AI expert and definitely not qualified to discuss innovations in AI architecture. But as a field, we should ask ourselves, how can we spur innovation that&#8217;s relevant for biology? Humans are good at solving the ARC-AGI puzzles, but we&#8217;re terrible at finding subtle patterns in megabases of genomic sequence. What&#8217;s the genomic equivalent of fluid reasoning that we want genomic AI to achieve?</p><p>Right now, the field relies on the equivalent of benchmarking GPT on the LSAT. We use convenient large-scale datasets to evaluate our models. Sometimes this makes sense, because we want AI models that can accurately predict the outcomes measured in those datasets. But, to push the LSAT analogy, an AI model that gets a perfect score on the LSAT likely won&#8217;t be able to engage in real-life legal reasoning if it lacks the kind of fluid intelligence that ARC-AGI tests for. Similarly, it&#8217;s one thing for a genomic model to predict gene expression levels in a handful of cell types; it&#8217;s something else to be able to predict gene expression in <em>any</em> cell type, in any genetic background or disease state. One definition of genomic AGI might be AI that can replace our need to do an experiment, because the model predicts what happens in a new context with as much accuracy as our experimental measurements would have had.</p><p>Again, I don&#8217;t have the answer. But to highlight the problem, I&#8217;ll summarize some important genomic benchmarking papers that are relevant to considering where we go from here.</p><h3>Protein-ligand binding AI models are still just memorizing training data</h3><p>A group out of the University of Basel has <a href="https://www.biorxiv.org/content/10.1101/2025.02.03.636309v2">introduced</a> the &#8220;Runs N&#8217; Poses&#8221; benchmark dataset, a set of 2,600 high-resolution protein-ligand structures not included in the training data of the latest deep learning 3D structure predictors, including AlphaFold3. They binned the structures in their benchmark dataset according to how similar the structures were to other structures in the training data. They then asked, how accurate are the models on structures that are very different from the training data, compared with structures that are similar. The results show that these models don&#8217;t generalize well:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OMYg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OMYg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 424w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 848w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 1272w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OMYg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic" width="484" height="463.7865429234339" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:826,&quot;width&quot;:862,&quot;resizeWidth&quot;:484,&quot;bytes&quot;:69942,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OMYg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 424w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 848w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 1272w, https://substackcdn.com/image/fetch/$s_!OMYg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f53fecc-70b3-4e7d-9567-d52ab296438a_862x826.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From figure 1 of &#352;krinjar, et al, <a href="https://www.biorxiv.org/content/10.1101/2025.02.03.636309v2.full">bioRxiv 2025.02.03.636309</a></figcaption></figure></div><p>There is a big decrease in accuracy on structures that don&#8217;t look like what the models have previously seen. This is evidence that success in this area still largely comes from memorizing the training data, rather than learning the biophysical rules that govern how proteins bind ligands. </p><h3>Predicting regulatory genetic variant effects: Big AI doesn&#8217;t beat simpler models</h3><p>A group at UC Berkeley and Genentech has <a href="https://www.biorxiv.org/content/10.1101/2025.02.11.637758v2.full">released</a> TraitGym, a benchmark dataset of non-coding regulatory DNA variants associated with a set of Mendelian diseases or a set of complex traits. For readers not steeped in genetics, Mendelian diseases are usually caused by a variant in one gene with a large effect on the phenotype, while complex traits are typically affected by variants in many different genes with more subtle effects. </p><p>The key result is that relatively simple models based on evolutionary alignments of DNA sequences. In the plot below, performance (AUPRC) is represented on the x-axis. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uFyP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uFyP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 424w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 848w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 1272w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uFyP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic" width="1456" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:71823,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uFyP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 424w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 848w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 1272w, https://substackcdn.com/image/fetch/$s_!uFyP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8b9107d-1ba6-4f31-8ddb-2ab6cf21ae54_1560x772.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Portion of figure 5 of Benegas, Eraslan, and Song, <a href="https://www.biorxiv.org/content/10.1101/2025.02.11.637758v2.full">bioRxiv 2025.02.11.637758</a></figcaption></figure></div><p>The best model, <a href="https://pubmed.ncbi.nlm.nih.gov/38183205/">CADD</a>, as the TraitGym authors describe it, is a &#8220;is a logistic regression model trained to distinguish proxy-deleterious from proxy-neutral variants,&#8221; built on a large set of feature annotations, some of which were derived from protein language models. Enformer (ranked #2 on complex traits) is a a supervised, transformer-based AI model. Notably, the big self-supervised AI models (HyenaDNA, Evo2 with 40 billion parameters, etc.) did much worse.</p><h3>Large language models don&#8217;t outperform simpler models, again</h3><p>Anshul Kundaje and his lab at Stanford found similar results in their <a href="https://arxiv.org/html/2412.05430v1">benchmark analysis</a>, DART-Eval.  The benchmarks were designed to test &#8220;sequence motif discovery, cell-type specific regulatory activity prediction, and counterfactual prediction of regulatory genetic variants.&#8221;</p><p>There are multiple interesting results in this analysis. One suggests that language models have not learned to identify DNA sequence features that define cell type-specific regulatory DNA. The UMAP plots below show the results of clustering the model embeddings of cell type specific regulatory sequences. The models were presented with a collection of cell type-specific regulatory sequences (derived from scATAC-seq data) and tried to learn which sequences are active in the same cell type. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2_qu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2_qu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 424w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 848w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 1272w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2_qu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic" width="1456" height="570" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:570,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:182475,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2_qu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 424w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 848w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 1272w, https://substackcdn.com/image/fetch/$s_!2_qu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53cc845d-b56e-4960-86dd-4d8e55433808_2044x800.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 4 pf Patel, et al., <a href="https://arxiv.org/abs/2412.05430">arXiv:2412.05430</a></figcaption></figure></div><p>There is much more in the paper, but I&#8217;ll just cite a key conclusion:</p><blockquote><p>Our systematic evaluations reveal that current annotation-agnostic DNALMs [DNA Language Models] exhibit inconsistent performance and do not offer compelling gains over alternative baseline models for most tasks, despite requiring significantly more computational resources.</p></blockquote><h3>Language model embeddings underperform</h3><p>The <a href="https://www.biorxiv.org/content/10.1101/2024.02.29.582810v2.full">last study</a> I want to highlight is from Peter Koo&#8217;s lab at Cold Spring Harbor. The authors asked whether the more complex representations learned by language models were better at predicting the activity of regulatory DNA compared to simpler representations based on one-hot encoding. The question is whether the language models learned a good (and generalizable) representation of the underlying DNA sequence grammar. The answer is that simpler representations do better:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZAr7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZAr7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 424w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 848w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 1272w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZAr7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic" width="1410" height="698" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:698,&quot;width&quot;:1410,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:40860,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161022131?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZAr7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 424w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 848w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 1272w, https://substackcdn.com/image/fetch/$s_!ZAr7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94e3f37e-fb4c-4bf2-91e5-cec2c95645ba_1410x698.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 2 of Tang, et al, <a href="https://www.biorxiv.org/content/10.1101/2024.02.29.582810v2.full">bioRxiv 2024.02.29.582810</a>.</figcaption></figure></div><p>Their conclusion:</p><blockquote><p>To assess the transferability of knowledge acquired during pre-training for current genome language models, we evaluated the predictive power of pre-trained representations from four gLMs on whole genomes (without fine-tuning) across six functional genomics prediction tasks with appropriate baselines for comparison. We found that within <em>cis</em>-regulatory elements, representations from pre-trained gLM provide little to no advantage compared to standard one-hot sequences.</p></blockquote><p></p><p>There is much more to discuss, but this post is already getting too long. What the results above illustrate is that very large models&#8211;the GPTs of genomics&#8211;don&#8217;t seem to be learning general representations of protein-ligand interactions or DNA sequence grammars. They&#8217;re not learning the biology, and in many cases, simpler machine learning tools outperform the big models. We&#8217;re far from AGI for biology, whatever that might mean. To spur innovation, we need more benchmarking analyses like those above.</p><p>Perhaps this should include new experimental datasets that are specifically designed, like ARC-AGI, to test whatever the equivalent of fluid intelligence should be for genomic reasoning. Our goal should be to achieve AI that learns meaningful representations of fundamental biology that generalize across contexts. Rather than aiming for AI that&#8217;s can do anything that humans can do, but better, in genomics we need AI that can accurately predict anything that humans can <em>measure</em> in an experiment, but haven&#8217;t.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[This week's finds: Bad AI is bad science, autism versus cancer mutations, open drug development, etc.]]></title><description><![CDATA[For the weekend, this week's finds in genomics and more, with a science fiction recommendation.]]></description><link>https://www.thisgenomiclife.org/p/this-weeks-finds-bad-ai-is-bad-science</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/this-weeks-finds-bad-ai-is-bad-science</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 11 Apr 2025 21:01:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ahfg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ahfg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ahfg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 424w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 848w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 1272w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ahfg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic" width="1456" height="1273" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1273,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:842515,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161100268?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ahfg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 424w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 848w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 1272w, https://substackcdn.com/image/fetch/$s_!ahfg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c31d7e6-d129-4ff6-a4ce-8f3009ad609a_2166x1894.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Epicycles in the geocentric model of the solar system, from the 1771 Encyclopedia Britannica.</figcaption></figure></div><h2>1. Bad AI will be bad for science</h2><p>In this week&#8217;s issue of <em>Nature</em>, computer scientists Arvind Narayanan and Sayash Kapoor <a href="https://www.nature.com/articles/d41586-025-01067-2">warn</a> about the potential for bad AI to set science back. Narayanan and Kapoor are the authors of <em><a href="https://www.aisnakeoil.com/p/starting-reading-the-ai-snake-oil">AI Snake Oil</a></em>, a book warning about AI hype, and one that I included in my <a href="https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai">recent post</a> of recommendations for learning about AI. The authors suggest that an overreliance on AI in science is bad for two reasons. </p><p>The first is incompetence: like conventional statistics, AI can be used badly by people who lack the training or commitment to use it properly. &#8220;Machine-learning modelling is the chainsaw to the hand axe of statistics &#8212; powerful and more automated, but dangerous if used incorrectly.&#8221; One of science&#8217;s dirty secrets is that many, if not most scientists have inadequate formal training in statistics. That includes me&#8211;I never took a stats course or even a course with a serious statistical component. Bad statistical practices like p-hacking, due in part to poor training, has caused pervasive reproducibility problems in some fields. Widespread, inexpert use of AI has the potential to be much worse. Narayanan and Sayash write that </p><blockquote><p>[E]rrors are becoming increasingly common, especially when off-the-shelf tools are used by researchers who have limited expertise in computer science. It is easy for researchers to overestimate the predictive capabilities of an AI model, thereby creating the illusion of progress while stalling real advancements.</p></blockquote><p>One of the critical problems is &#8220;leakage&#8221;, which means that the test dataset is not fully independent of the training data. If information is shared between test and training data, AI models will score well by just memorizing the training data, and then fail to generalize to other datasets when someone tries to use the model for another analysis. The authors created <a href="https://reproducible.cs.princeton.edu/#rep-failures">a list</a> of recent studies across disciplines that identify common problems with data leakage in published research. Everyone should read a few of those papers to learn about problems in their field.</p><p>It is common in genomic AI modeling to evaluate models on held-out data from the same experiment, such as RNA reads from a subset of chromosomes in a single RNA-sequence experiment. While this may not formally count as data leakage, I don&#8217;t think it&#8217;s great for the field to almost exclusively evaluate AI models on test data taken from the same experiments as the training data. We should hold ourselves to higher standards and evaluate our models either on independent benchmark datasets, or, if feasible, just do another experiment to generate your test data.</p><p>The other problem that Narayanan and Kapoor worry about this that we&#8217;ll confuse AI predictive success for real conceptual progress:</p><blockquote><p>AI excels at producing the equivalent of epicycles. All else being equal, being able to squeeze more predictive juice out of flawed theories and inadequate paradigms will help them to stick around for longer, impeding true scientific progress</p></blockquote><p>In other words, we shouldn&#8217;t outsource our thinking to AI models. This gets to the ultimate goal of scientific theories, which, aside from their practical uses, is to describe nature in a way that is comprehensible by humans. Science is for human minds. Centuries of scientific progress show that a program of ruthless experimental testing of big ideas about the world leads to new discoveries. In genomics, AI can be a powerful discovery tool that will help us learn how genomes function, but in the end what we want is an <em>understanding</em> that explains how DNA sequence, interacting with other molecular players, leads to cellular and organismal phenotypes.</p><p>Aside from a commitment to the empirical foundations of science, one of the most important solutions to the problems raised by Narayanan and Kapoor is education. Everyone, especially everyone who does work or aspires to work in science should <a href="https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai">learn about AI</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>2. Different diseases, same genes: mutations in cancer and autism</h2><p>My WashU Genetics Department colleague Tychele Turner and her collaborator Rachel Karchin of Johns Hopkins have a <a href="https://doi.org/10.1016/j.xgen.2025.100807">paper</a> out in <em>Cell Genomics</em> looking at genetic variation genes that are associated with both cancer and neurodevelopmental disorders (NDDs) like autism. They sought to learn why mutations or genetic variants in the same genes can lead to different outcomes (cancer or NDDs). They looked at missense variants, whose effects are often difficult to figure out because these variants cause single amino acid changes in a protein, as opposed to larger, more obviously damaging mutations. </p><p>The challenge with any genetic variant analysis is that there are many variants that exist in cancer or in individuals with NDD that have no effect whatsoever on disease. Thus geneticists try to come up with clever ways to sift the wheat from the chaff. In this case, Turner and Karchin hypothesized that causal variants for either NDD or cancer would cluster in similar parts of the protein. That is, if we take a protein that plays a role in both cancer development and NDD, we would see that mutations that genuinely lead to NDD would disrupt the protein in similar ways. Mutations that cause cancer would affect the same protein but in different a way, and thus would cluster in a different part of the protein. That is indeed what they found:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CLHH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CLHH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 424w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 848w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 1272w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CLHH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic" width="463" height="362" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:362,&quot;width&quot;:463,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:26236,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161100268?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CLHH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 424w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 848w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 1272w, https://substackcdn.com/image/fetch/$s_!CLHH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87c23871-a3f1-4601-8c85-ee9f6563e4b2_463x362.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 2B from Ng, <em>et al.</em>, <em><a href="https://doi.org/10.1016/j.xgen.2025.100807">Cell Genomics</a></em><a href="https://doi.org/10.1016/j.xgen.2025.100807"> 5:100807</a>. CC BY-NC-ND 4.0</figcaption></figure></div><p>In the figure above, showing two views of the same protein, blue represents sites of mutations found in cancers, while red represents genetic variants found in NDD patients. You can see that the sites of the clustered mutations are different for different diseases,  suggesting that causal mutations for each disease affect different parts of the protein.</p><p>The study illustrates the power of AI for discovery in genomics&#8211;the scientists used AlphaFold structure predictions to perform their 3D clustering analysis. Looking at the 3D structure is important because mutations that may be distant in the linear sequence of the protein may be close together in folded, 3D structure. The big takeaway is that while both NDD and some cancers are associated with missense mutations in the same gene, the patterns of missense mutations, at least for some genes, differ between the two diseases. The same gene can be damaged in different ways and lead to dramatically different outcomes.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/this-weeks-finds-bad-ai-is-bad-science?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/this-weeks-finds-bad-ai-is-bad-science?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>3. Data problems for AI models in drug development</h2><p>In 2023, a group of news organization <a href="https://www.npr.org/2025/01/14/nx-s1-5258952/new-york-times-openai-microsoft">sued</a> OpenAI and Microsoft for copyright infringement, claiming that GPT was trained on millions of copyrighted works without permission from the copyright holders. The suit highlights one of the challenges of cutting edge progress in AI: you need enormous amounts of data, on a scale that couldn&#8217;t possibly be generated by one organization. At the same time, it&#8217;s probably not feasible in most cases to pay for access to all of the data you need, when that data is scattered among many different owners. If the data you need is a bunch of online content, then, like OpenAI and others, you just scrape everything you can get from the internet and figure you can settle the lawsuits later.</p><p>But this strategy doesn&#8217;t work if your AI needs to be trained on big datasets obtained from experiments, which aren&#8217;t always so east scrape from the web. A <a href="https://www.science.org/doi/10.1126/science.adx0339">comment piece</a> in the latest issue of <em>Science</em> highlights the lack of large, high quality open datasets that could be used to advance AI methods for drug discovery. Not even the largest pharmaceutical companies will, at least any time soon, have their own datasets at the scale needed to implement truly cutting-edge AI. The whole drug discovery enterprise would be better off if the players involved collectively supported the development of high quality, <em>open</em> datasets that would allow the field, including industry AI teams, to innovate:</p><blockquote><p>To help AI mature, developers need nonproprietary, open, large, high-quality datasets to train and validate models, managed by independent organizations.</p></blockquote><p>The authors draw an analogy to the bad old days of gene patents, when a lot of useful genomic data was locked up in proprietary databases. Open genomic data has spurred an enormous amount of innovation, and the same thing could happen for AI-based drug discovery if academic and industry players worked together to contribute open data. There is an opportunity here to avoid the problems at the center of the news versus OpenAI lawsuit, and to make deliberate decisions about how to support AI innovation in this field. While I&#8217;m on board with the open datasets, I do think the authors of the opinion piece underestimate the impact of task-specific AI models that can be trained on smaller datasets. Not everything needs to be GPT or AlphaFold. </p><h2>4. Can statistical models really just regress out all confounding factors?</h2><p>Some time ago I read a piece by Brown University health economist Emily Oster that has stuck with me. I can&#8217;t remember the original piece, but there is a <a href="https://parentdata.org/why-i-look-at-data-differently/">2024 version</a> on her blog that confronts a critical question about epidemiological studies: is it really possible to control for all of the confounding factors in a statistical model? Oster suggests that confounding is much worse than we typically think.</p><p>There are many common examples epidemiological claims that are a longstanding source of controversy, such as the question of whether moderate drinking is better or worse than no drinking, whether there is a risk between certain foods and cancer, etc. For example, are people who drink more at a higher risk for cancer because of the alcohol, or because those who drink more also tend to smoke? To answer that, epidemiologists try to control for smoking (and age, sex, exercise, etc.) in their analyses of what alcohol&#8217;s effects are.</p><p>Oster gives another example of this kind of reasoning:</p><blockquote><p>[T]he reason we know that the processed food groups differ a lot is that the authors can see the characteristics of individuals. But because they <em>see </em>these characteristics, they can adjust for them (using statistical tools). While it&#8217;s true that education levels are higher among those who eat less processed food, by adjusting for education we can come closer to comparing people with the same education level who eat different kinds of food.</p></blockquote><p>But what about controlling for the variables that you don&#8217;t see? This is of course something that is well known to statisticians, called residual confounding. Oster says:</p><blockquote><p>We all agree that this is a concern. Where we differ is in how much of a limitation we believe it to be. <em>In my view, in these contexts (and in many others), residual confounding is so significant a factor that it is hopeless to try to learn causality from this type of observational data.</em></p></blockquote><p>She argues that many of the differences among human study subjects that statisticians treat as random (after controlling for confounders) are actually not random. They are much more highly correlated with confounders than scientists think and, importantly, subject to social feedbacks in a way that is not sufficiently appreciated. </p><p>Oster gives an example of studies looking at a link between breastfeeding and intelligence:</p><blockquote><p>The link between breastfeeding and IQ is a good example. This is a research space where you can find many, many papers showing a positive correlation. The concern, of course, is that moms who breastfeed tend to be more educated, have higher income, and have access to more resources. These variables are also known to be linked to IQ, so it&#8217;s difficult to isolate the impacts of breastfeeding.</p></blockquote><p>So what happens to this link when you control for confounding variables? She points to a <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC1633819/">2006 paper</a> that controlled for parent IQ and demographics, as most papers do. But then they kept adjusting for additional effects, such as within family effects (comparing siblings, only one of whom was breastfed). After these adjustments, <em>the correlation between breastfeeding and IQ disappeared entirely.</em></p><p>How bad is this problem across all observational studies in economics, health, sociology, etc? Oster thinks it&#8217;s very bad, &#8220;that a huge share of the correlations we see in observational data are not close to causal.&#8221; These poor studies lead to bullshit headlines and bad media coverage, and populace that is confused by a series of flip-flopping scientific claims about moderate drinking, processed foods, breastfeeding, etc. I&#8217;m inclined to agree&#8211;we should be very skeptical about the ability of scientists to statistically control for all of the relevant confounding variables in observational studies. While in some cases observational studies are the best we can do, being &#8216;the best we can do&#8217; should not be an excuse to simply accept flawed evidence.</p><h2>5. The 100-year old science fiction of the Radium Age.</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hlWt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hlWt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 424w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 848w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 1272w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hlWt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic" width="188" height="273.2878048780488" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:596,&quot;width&quot;:410,&quot;resizeWidth&quot;:188,&quot;bytes&quot;:29761,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/161100268?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hlWt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 424w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 848w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 1272w, https://substackcdn.com/image/fetch/$s_!hlWt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F008409de-622a-434c-8a0b-fdd6d8429234_410x596.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To round off the newsletter, here&#8217;s something for fans of science fiction. Most people who are into science fiction are aware of the so-called Golden Age, when pulp magazines dominated the genre. This ran largely from the 1930&#8217;s (John Campbell became editor of <em>Astounding</em> in 1937) through first decade or so after WWII when mass market paperbacks took over. But before the Golden Age was the fascinating Radium Age, which was both more innovative and less male-dominated than what came later. Science fiction of the first quarter of the 20th century, called the <a href="https://www.hilobrow.com/2012/12/23/the-radium-age/">Radium Age</a> by publisher Joshua Glenn and characterized by him as filled with a &#8220;dizzying, visionary blend of acerbic social commentary and shock tactics.&#8221; Most of the classic plots and character types that we ascribe to the Golden Age were already present in the Radium Age. There is a <a href="https://www.hilobrow.com/sisters/">long list</a> of really interesting works by women of this era. Consider the <a href="https://www.hilobrow.com/radium-age/">plot</a> of a book written in 1909:</p><blockquote><p>When Mary Hatherley, an intrepid British explorer, is kicked in the head by the camel she was riding through the Arabian desert, she finds herself transported to what seems to be an alternate version of Earth. Arriving in Armeria, she discovers a society in which the very concept of gender is unknown.</p></blockquote><p>How about a 1935 Bengali <a href="https://mitpress.mit.edu/9780262547611/the-inhumans-and-other-stories/">cult classic</a> describing lost race of Bengali supermen in Uganda?</p><p>The era also includes a <a href="https://thefinchandpea.com/2015/01/26/apocalypse-1912-naturalisms-post-apocalypse-novel/">post-apocalyptic plague novel</a> by Jack London, a satirical cyborg novel, H.G. Wells&#8217; 1913 novel about nuclear war, a 1926 eugenic dystopia, and much more. MIT Press is publishing many of these works, so if this is sounds interesting to you, go check out <a href="https://www.hilobrow.com/radium-age/">the series.</a> </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Treating disease rationally with antisense oligo therapy]]></title><description><![CDATA[How nucleic acids make rational drug design easier.]]></description><link>https://www.thisgenomiclife.org/p/treating-disease-rationally-with</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/treating-disease-rationally-with</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 04 Apr 2025 21:01:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sMav!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>People in every society in history have used drugs to try to treat illness. Sometimes they were successful, but more often they were not. Until recently, the discovery of effective drugs was very slow, and those that worked were used alongside ineffective folk remedies. In truth, effective drugs <em>were</em> folk remedies that happened to work. By the 19th century, medicinal chemists <a href="https://collections.countway.harvard.edu/onview/exhibits/show/apothecary-jars/nineteenth-century-drugs">were attempting</a> to make drug discovery more systematic by performing experiments on animals and making rigorous observations. But they were not particularly successful, largely because the molecular underpinnings of disease were almost completely invisible to 19th century biologists. </p><p>Drug development became much more effective in the 20th century as medical scientists began to rationally design potential therapies grounded in a molecular-level understanding of disease. One of the great successes of 20th century medicine is the treatment of childhood acute lymphoblastic leukemia, which was almost 100% fatal in 1900. Millennia of human efforts to treat disease had been completely ineffective at finding a successful treatment for what we now call childhood leukemia. In the mid-20th century, <a href="https://www.dana-farber.org/about/history/sidney-farber">Sidney Farber</a> and his colleagues transformed the treatment of childhood leukema when they introduced folic acid antagonists as chemotherapy. This treatment was based on their growing understanding of the role of folic acid metabolism in leukemia. After decades of further improvement, nearly 90% of children diagnosed with childhood leukemia now survive.</p><p>As our understanding of the molecular basis of disease has grown, so has our ability to rationally design drugs. While we still rarely know all of the relevant molecular variables involved in any disease, we do have a significant capacity to rationally design potential therapies based on our knowledge of the proteins and other molecules that are active in pathogenic pathways. We get a lot of mileage out of a fairly simple framework: Disease can often be treated by inhibiting or activating specific macromolecules (proteins and RNA). Therefore, we can devise new treatments by designing therapeutics that physically bind to a target molecule, in order to inhibit or activate it.</p><p>It&#8217;s hard to state how profoundly different this framework is from what came before. In 1949, Linus Pauling, Harvey Itano, and Ibert Wells published a <a href="https://www.jstor.org/stable/1676635">classic paper</a> in <em>Science</em>, titled &#8220;Sickle Cell Anemia, a Molecular Disease.&#8221; In it they compared the chemical and physical properties of hemoglobin from patients with sickle cell disease to hemoglobin from unaffected people. They proposed, for the first time in history, an evidence-based molecular mechanism of disease, which, they said, &#8220;if it is correct, it supplies a direct link between the existence of &#8216;defective&#8217; hemoglobin molecules and the pathological consequences of sickle cell disease.&#8221;</p><p>Of course, the route from an understood molecular disease mechanism to a successful cure can be a long one. Just last year, a <a href="https://www.thisgenomiclife.org/p/the-first-crispr-gene-therapy-is?r=bwtw3&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">CRISPR gene therapy</a> for sickle cell disease became the first CRISPR-based therapy approved by the FDA, 75 years after the paper by Pauling and his colleagues. But our improving capacity for rational drug design allows scientists come up with potential therapies much faster, and enables us to treat diseases that no amount of human ingenuity could have solved under the old paradigm of drug discovery.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/treating-disease-rationally-with?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/treating-disease-rationally-with?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>Rational drug design with nucleic acids</h2><p>Today, rational drug design most often refers to using 3D models of protein structure to find potential surfaces to which a designed chemical or peptide inhibitor could bind. One of the most famous examples is <a href="https://www.nature.com/scitable/topicpage/gleevec-the-breakthrough-in-cancer-treatment-565/">Gleevec</a>, a molecule that was <a href="https://www.nature.com/scitable/content/Effects-of-a-selective-inhibitor-of-the-26670/">specifically designed</a> to inhibit the ATP binding pocket of the Abl protein kinase that drives chronic myelogenous leukemia. This kind of rational drug design is still difficult, because most proteins do not have an obvious druggable surface or binding pocket. The development of therapeutic antibodies and, more recently, <a href="https://www.nature.com/articles/d41586-024-01383-z">AI models</a> of protein structure do help, but the fact remains that it is very difficult to design specific inhibitors or activators of particular proteins.</p><p>But one class of therapeutics avoids the complex problem of 3D modeling altogether&#8211;DNA and RNA. Because nucleic acids interact (usually) through standard Watson-Crick base pairing, you don&#8217;t need to solve a complex 3D puzzle to design a DNA or RNA molecule that binds a specific target; all you need is the sequence of A&#8217;s, C&#8217;s, G&#8217;s, and T&#8217;s from a genome database. And because every protein that you might want to inhibit is derived translation from RNA, there is an RNA target for everything. It is completely trivial to design an RNA or DNA molecule that will bind to an RNA target. Perhaps the most striking example of the ease of RNA design is the COVID mRNA vaccine, which took just two days after the virus sequence was posted.</p><p>I don&#8217;t want to get too carried away&#8211;mRNA vaccines and oligonucleotide therapies rest on decades of work. It may be simple to design a DNA or RNA sequence, but raw DNA and RNA delivered into human tissues are not effective. They need to be chemically modified to increase their stability, reduce their toxicity, etc. It was this work that led to the 2023 Nobel Prize awarded to Katalin Karik&#243; and Drew Weissman for their work on mRNA vaccines.</p><h2>Rationally treating disease with antisense oligonucleotides </h2><p>In 2017, the <em>New England Journal of Medicine</em> published an <a href="https://pubmed.ncbi.nlm.nih.gov/29091570/">astonishing example</a> of rational drug design. The study reported results of a new treatment for a neurodegenerative disease called spinal muscular atrophy (SMA). In its severe form, symptoms of motor neuron degeneration appear in infants, who are never able to sit up and eventually can&#8217;t breathe unassisted. SMA was first recognized as a distinct disease in the 1890&#8217;s, and it was invariably fatal before a child turned one or two. (There are &#8220;milder&#8221; forms of the disease with symptoms that appear in adults and become progressively worse.) In one tragic example, described in 1902, four out of eight children in the same family <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC10418635/">died of SMA</a>:</p><blockquote><p>In 1902, Beevor described a male patient with a more severe form of SMA. The patient was the eighth child in a family with eight children, and he was the fourth affected child. The first affected sibling (sister) was noticed to be paralyzed all over at the end of the first month of life and died before 4.5 months of age. The second affected sibling (sister) developed symptoms at 6 months of age, and died at 8 months of age. The third affected sibling (gender unknown) developed symptoms at 6 weeks of age and died at 7 months of age. The patient was noticed to be paralyzed all over, excepting the diaphragm, before 5 weeks of age, and died after living eight weeks.</p></blockquote><p>In the 2017 study, led by Richard Finkel of the Nemours Children&#8217;s Hospital in Orlando, Florida, 51% of infants treated with a new therapeutic reached a developmental milestone of motor function, compared to 0% of infants who received the placebo. The effect was so impressive that the study was terminated early, so that other children could be treated. A <a href="https://pubmed.ncbi.nlm.nih.gov/37409780/">related study</a>, which began treating infants with the disease before the onset of symptoms found that all 25 children in the study were alive several years later, at a median age of 4.9. The new therapeutic that successfully treated this fatal disease was nusinersen, an antisene oligonucleotide (ASO). Nusinersen was not the first ASO treatment, but it has become the most successful. The nusinersen story illustrates the power of rational nucleic acid drug design </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sMav!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sMav!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 424w, https://substackcdn.com/image/fetch/$s_!sMav!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 848w, https://substackcdn.com/image/fetch/$s_!sMav!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 1272w, https://substackcdn.com/image/fetch/$s_!sMav!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sMav!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic" width="1179" height="752" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/26589f14-46f1-4258-b150-86939cb77305_1179x752.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:752,&quot;width&quot;:1179,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:51601,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/160543204?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sMav!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 424w, https://substackcdn.com/image/fetch/$s_!sMav!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 848w, https://substackcdn.com/image/fetch/$s_!sMav!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 1272w, https://substackcdn.com/image/fetch/$s_!sMav!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26589f14-46f1-4258-b150-86939cb77305_1179x752.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1 of Finkel, <em>et al.</em>, <em>N Engl J Med</em> 2017;377:1723-1732</figcaption></figure></div><p>The concept of an ASO to treat disease is simple, though development is still challenging. Thinking back on our framework for rational drug design, the goal is typically to design a drug that either inhibits or activates a molecular target. In the case of ASOs, the molecular target is the RNA transcript of a gene. The ASO is designed to have a sequence that is the reverse complement (&#8216;antisense&#8217;) of a short region of the RNA molecule. When the ASO binds to the RNA, the RNA can be affected in several ways. The most straightforward effect is shown below: ASO-bound RNA is degraded by RNase H1, which breaks down RNA/DNA double helices. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0jqM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0jqM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 424w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 848w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 1272w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0jqM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic" width="436" height="314.8888888888889" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:442,&quot;width&quot;:612,&quot;resizeWidth&quot;:436,&quot;bytes&quot;:12184,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/160543204?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0jqM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 424w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 848w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 1272w, https://substackcdn.com/image/fetch/$s_!0jqM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4944c7-3bcf-4315-ad44-5dca7a91fdc5_612x442.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1A from Crooke, <em>et al.</em>, <em><a href="https://pubmed.ncbi.nlm.nih.gov/33600796/">J Biol Chem </a></em><a href="https://pubmed.ncbi.nlm.nih.gov/33600796/">2021 Jan-Jun:296:100416</a>, CC BY-NC-ND 4.0</figcaption></figure></div><p>Other ways that an ASO destabilized an RNA target is prevent translation of the RNA into protein by physically blocking the binding of the ribosome, or by preventing the RNA from being properly spliced, leading to nonsense-mediated decay. You can also stabilize an RNA in various ways, including by physically obstructing a binding site for a negatively-regulating microRNA (miRNA), thereby preventing RNA degradation. This <a href="https://pubmed.ncbi.nlm.nih.gov/33600796/">recent review</a> has a nice discussion of different approaches. The key point is that, while a big fraction of proteins in their 3D forms might be undruggable, <em>every</em> messenger RNA that codes for those proteins can likely be inhibited or stabilized by an ASO. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>In the case of nusinersen, a rational design was based on careful knowledge of the underlying cause of SMA. In the 1990&#8217;s scientists mapped the genetic basis of SMA to a gene called <em>SMN1</em>, which is necessary for the survival of motor neurons. When an infant is born with two loss-of-function mutations in <em>SMN1</em>, they develop the disease. However, there is a gene that can compensate for the loss of <em>SMN1</em>, called, sensibly enough, <em>SMN2</em>. The problem is that <em>SMN2</em> rarely produces a functional protein, because it is spliced improperly, and one of the <a href="https://www.musculardystrophyuk.org/research/what-is-exon-skipping-and-how-does-it-work/">critical exons is skipped</a>. In 2006, a team led by Ravindra Singh at U Mass Worcester <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC1367187/">described</a> an ASO that that blocked the site on the RNA that caused exon-skipping. The ASO thereby led to the production of functional SMN2 protein. In patients whose <em>SMN1 </em>genes are non-functional, nusinersen increases the expression of functional <em>SMN2</em>, which compensates for the missing <em>SMN1</em>. </p><p>It took 10 years to go from this original ASO design to an FDA-approved drug. For a previously untreatable, fatal genetic disease, that is a fairly fast drug development timeline. Since the approval of nusinersen, gene therapy to cure the disease and a small molecule drug that has the same effect as nusinersen have been approved. That&#8217;s great news for SMA patients, and nusinersen is now being considered a bridge therapy to keep patients healthy before receiving gene therapy.</p><h2>Future ASO Therapeutics</h2><p>I don&#8217;t want to sound too pollyannish; identifying individuals with ASO-correctable diseases and finding RNA sites that can be effectively targeted with ASOs requires careful development. And delivery of ASOs can also be a challenge. To treat neurological diseases, ASOs, which can&#8217;t get through the blood-brain barrier, need to be injected intrathecally (into the spinal cord). SMA patients have a miracle drug available, but it requires such injections every four months. </p><p>But the tremendous advantages of rational drug design with ASOs, which now include a massive collection of biotechnological and computational tools that didn&#8217;t exist in 2006 when nusinersen was designed, has created a growing opportunity. One of these includes the ability to create organoids derived from a patient's cells, which allows scientists to <a href="https://www.nature.com/articles/s41586-024-08462-1">rapidly screen</a> many different ASOs for effectiveness in the lab. These new tools have created <a href="https://www.nature.com/articles/s41392-024-02112-8?fromPaywallRec=false">broader opportunities</a>, and ASOs are in development to treat <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC11592788/">various cancers</a> and <a href="https://pubmed.ncbi.nlm.nih.gov/31283897/">other neurological diseases</a> like Huntington&#8217;s.</p><p>One of the most interesting opportunities is the potential to treat very rare genetic diseases, even diseases for which the size of the patient population might only be 1. In a <a href="https://www.nature.com/articles/s41467-024-54077-5">review article</a> published last year, a group called the N=1 Collaborative, led by Timothy Yu at Boston Children&#8217;s Hospital, argued that the potential to create rationally designed, individualized therapies with of ASOs and other nucleic acids requires the medical community to think about how to evaluate treatments designed for a single person. There are thousands of extremely rare genetic diseases, which cumulatively affect a lot of people. Many of these diseases could be treated with ASOs, but there are too few individuals in each case to assemble a statistically well-powered clinical trial to evaluate the therapy. Yu and his colleagues layout a framework for developing rigorous N=1 trials. </p><p>In a sense, such efforts at individualized drug testing are a throwback to the era before rational drug design, when potential therapies were tried on small numbers of patients, and physicians would carefully follow the patients to see whether they got better or worse. Without a foundation of knowledge of the biological basis of disease, this approach wasn&#8217;t very effective. But with today&#8217;s molecular knowledge and capacity for rational drug design, we  have the chance to reclaim part of that older paradigm and design afforable, individualized treatments for thousands of rare diseases.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[How to teach yourself about AI]]></title><description><![CDATA[Recommended reading from easy to graduate level.]]></description><link>https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Mon, 31 Mar 2025 22:36:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!F5g7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>Production note:</strong> Posting has been slow due to travel and grant/teaching deadlines. In the pipeline are some upcoming posts on anti-sense oligos as therapy, neoantigen vaccines, new developments in spatial transcriptomics, benchmarking AI models of gene regulation, etc. This newsletter is meant to be much more than link roundups and book recommendations, but today I need to make due with some book recommendations. I hope these are useful.</em></p><p>One of the most important abilities you need for a successful career is knowing how to update your skills. As I tell my kids, this isn&#8217;t just important advice for scientists, who work in a profession that is supposed to generate new knowledge and new technology, and thus is always changing. The <em>New York Times</em> <a href="https://www.nytimes.com/interactive/2025/03/28/style/gen-x-creative-work.html?unlocked_article_code=1.8E4.76ac.YT0mGnleljN6&amp;smid=url-share">recently covered</a> the &#8220;Gen X career meltdown&#8221;, describing the challenges of media professionals who entered journalism, advertising, film, etc. in the 1990&#8217;s and early 2000&#8217;s. The media business looks nothing like it did 20-30 years ago. </p><p>The same is true of much of biology. When I started grad school, a little over 20 years ago, I had no sense whatsoever of the pace of change in science. While some of the skills I learned in grad school (<em>e.g.</em>, yeast molecular genetics) are <a href="https://pubmed.ncbi.nlm.nih.gov/39480920/">still useful</a>, most of them, in my graduate field of functional and structural genomics, haven&#8217;t withstood the transformative impact of next-generation sequencing, CRISPR, cryoEM, deep learning, long-read sequencing, and high-throughput imaging. What <em>did</em> withstand those transformations was solid training in scientific reasoning and in the ability to teach myself what I needed to learn. Back then, when the draft human genome sequence was still hot off the press, some of my fellow grad students and I decided to teach ourselves Perl and R so that we could be better participate in this trendy field called genomics.</p><p>In that spirit, I want to encourage everyone learn more about AI, including scientists whose training and work may be mostly non-computational. Especially if you&#8217;re still early in your career, you have time to build your skills and bring deep learning into your work. To be clear, if you&#8217;re serious about doing computational biology, you&#8217;ll need to do more than just read some books; you&#8217;ll need to spend some time training with real computational biologists. But it is possible to, in the words of Harvard computational biologist Sean Eddy, &#8220; go where a question takes you, not where your training left you.&#8221; Eddy described his own background, in this great <a href="https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.0010006">2005 piece</a>, &#8220;Antedisciplinary Science&#8221;:</p><blockquote><p>I've been a computational biologist for about 15 years now. We're still not quite sure what &#8220;computational biology&#8221; means, but we seem to agree that it's an interdisciplinary field, requiring skills in computer science, molecular biology, statistics, mathematics, and more. I'm not qualified in any of these fields. I'm certainly not a card-carrying software developer, computer scientist, or mathematician, though I spend most of my time writing software, developing algorithms, and deriving equations. I do have formal training in molecular biology, but that was 15 years ago, and I'm sure my union card has expired. For one thing, they all seem to be using these clever, expensive kits now in my wet lab, whereas I made most of my own buffers (after walking to the lab six miles in the snow, barefoot).</p></blockquote><p>For the antedsciplinary scientists (and non-scientists) out there, here are my recommendations for books to learn AI. They range from gentle and popular introductions to fantastic textbooks that rigorously cover the math. (Eddy&#8217;s piece, by the way, helped tip the scales in favor of my decision to come to Washington University in St. Louis, where Eddy was at the time. I wanted to be around people who thought like that.)</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>1. The Hundred-Page Machine Learning Book, Andriy Burkov</h2><p>If you want to spend one weekend jump starting your AI education, <a href="https://themlbook.com">this is the book</a>. Anyone with decent high school math skills can pick this up and learn about probability, linear and logistic regression, decision trees, SVMs, basic learning algorithms, and your standard neural networks. There are some small code examples in Python. It&#8217;s well-written, well-illustrated and covers all of the basics. After reading this book you&#8217;ll know the basic vocabulary of machine learning, have a general idea of what the basic algorithms are, and know how they are applied to some standard problems. I recommend this book as your first foray into AI because it includes accessible math, rather than only vague verbal descriptions of what AI does. You can download chapters for free to try before you buy.</p><h2>2. AI Snake Oil, Arvind Narayanan and Sayash Kapoor</h2><p>A colleague serving on a university AI committee with me recommended this book. <a href="https://www.aisnakeoil.com/p/starting-reading-the-ai-snake-oil">Read this book</a>, by two computer scientists, to teach yourself not to be credulous about AI. The authors write a popular Substack bearing the same name of the book (see the previous link), and their big argument is this: predictive AI is wildly overrated, performing worse than advertised, while generative AI is much more useful. They are very skeptical that AGI (artificial general intelligence) is imminent, and find AGI itself to not be a very coherent concept. It&#8217;s a good book, another one you can read in a weekend.</p><p>The most important message, not emphasized enough in contemporary discussions of AI, is that we should be very skeptical about AI performance claims, because so often models that perform well on benchmark datasets used during development will fail when applied in another context. Domain shift is when a model is asked to make predictions in a setting that is systematically different from its training data, and it is one of the most difficult problems to solve in AI. But that fact doesn&#8217;t stop snake oil salesmen from overselling their models to predict everything from insurance risks to recidivism to product recommendations. As a culture, we should be <em>much</em> more skeptical of predictive AI.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/how-to-teach-yourself-about-ai?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>3. Why Machines Learn, Anil Anathaswamy</h2><p>AI pioneer Geoffrey Hinton blurbed this book as &#8220;a masterpiece,&#8221; and he&#8217;s right. If you want to read a popular-level book that covers the math and history of AI, in a non-technical way, this is the <a href="https://anilananthaswamy.com/why-machines-learn">book to read</a>. It&#8217;s pitched at about the level of Cornell mathematician Steven Strogatz&#8217;s book <em>Infinite Powers</em> or Johns Hopkins physicist Sean Carroll&#8217;s <em>Biggest Ideas in the Universe</em> books. Anathaswamy takes the story of AI from the perceptron in the 1950&#8217;s up to today&#8217;s deep learning models, while very gently presenting the math along the way. He&#8217;s interviewed many of the pioneers in the field, who describe what it felt like to work at various stages in machine learning history. Read this book to build your intuition for the field, learn some history, and to get a lucid layperson&#8217;s explanation of Bayes&#8217; theorem, matrices and vectors, maximum likelihood estimation, and the other important math that makes AI work.</p><h2>4. The Science of Deep Learning, Iddo Drori</h2><p>Now we get to real textbooks. Iddo Drori wrote this <a href="https://www.thescienceofdeeplearning.org">concise textbook</a> based on courses he taught. If you have a decent background in calculus and probability, you can dive right in to deep learning with this book. It starts with forward and backpropagation, and covers optimization, various deep learning architectures, including transformers, generative models, and reinforcement learning. If you&#8217;re busy with your day job but want to learn the math behind the critical backpropagation algorithm, or to understand how transformers work, this textbook has what you want.</p><p>Along with the clear explanations and rigorous math, there are very helpful illustrations. When it comes to math, I&#8217;m a visual learner. I can parse the math in a condensed description of backpropagation, but I can&#8217;t get an intuition about how it works without pictures. Drori&#8217;s illustrations are very helpful for building intuition. Some of the chapters are free online - see the link above.</p><h2>5. Probabilistic Machine Learning, Kevin Murphy</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F5g7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F5g7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 424w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 848w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 1272w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F5g7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic" width="1456" height="977" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:977,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:395738,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/160291467?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F5g7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 424w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 848w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 1272w, https://substackcdn.com/image/fetch/$s_!F5g7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa38e8b82-284e-44c1-9899-9add90d50061_2546x1708.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is a <a href="https://probml.github.io/pml-book/book1.html">recent update</a> of a classic textbook. In contrast to Drori&#8217;s concise book, which starts <em>in media res</em> with backpropagation, this book starts at the beginning with a rigorous presentation of basic probability and statistics. While in principle anyone could pick up this book and learn probability from scratch, the reality is that the notation and concepts, while clearly presented, will be hard to follow if you don&#8217;t have some college-level background in this field. However, the good news is that, if you have had some introductory, college-level calculus, probability, and statistics, this book has everything you need to learn the math of AI. </p><p>The book is <a href="https://probml.github.io/pml-book/book1.html">free online</a> via Github and has tons of figures and code examples, all of which can be downloaded as jupyter notebooks. For advanced, up-to-date topics, there is the 1300-page volume 2. I have a whole shelf full of math and statistics textbooks, few of which are written as clearly as this one by Kevin Murphy. If you are really serious about learning the mathematical foundations of AI, and ready to invest a substantial chunk of time, this is the book to use.</p><p>The one thing missing from these recommendations is programming for AI. There are plenty of good books out there; I like <a href="https://www.packtpub.com/en-us/product/deep-learning-with-tensorflow-and-keras-3rd-edition-9781803245713">this one</a> from the publisher Packt.</p><p>If you like recommendations for learning scientific topics like this, I suggest checking out John Baez&#8217;s <a href="https://math.ucr.edu/home/baez/books.html">How to Learn Math and Physics</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[This week's finds: the purpose of health research, the mystery of long distance enhancers, GMO regulations, and more.]]></title><description><![CDATA[Some worthwhile links to start your week.]]></description><link>https://www.thisgenomiclife.org/p/this-weeks-finds-the-purpose-of-health</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/this-weeks-finds-the-purpose-of-health</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Mon, 10 Mar 2025 12:01:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!VfLU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>1. What is the purpose of academic health research?</h2><p>Our new dean of the school of public health at Washington University in St. Louis, Sandro Galea, lays out the real stakes in the current head-on assault of our health institutions in an <a href="https://jamanetwork.com/journals/jama/fullarticle/2830676">editorial</a> he co-authored with UC San Francisco&#8217;s Kirsten Bibbins-Domingo. They start out by correctly noting that the Trump administration&#8217;s crudely implemented policies, such as the proposed cut of indirect costs to 15%, would be &#8220;a near-existential threat to academic health research as we know it.&#8221; They also note that it is not clear how much, if any, of the proposed changes will stick. I&#8217;m still inclined to think that most of them won&#8217;t, but it&#8217;s a low-confidence opinion.</p><p>One way to think about the value of health research is through a cost-benefit lens. The authors point out that the scale of NIH funding, which is greater than any other funding body in the world enables large-scale projects and research infrastructure that wouldn&#8217;t be possible otherwise. Being a country that leads in such large-scale projects brings enormous benefits to the U.S. Publicly funded advances in vaccine technology and public health efforts for routine childhood vaccination <a href="https://pubmed.ncbi.nlm.nih.gov/24590750/">prevent</a> tens of thousands of deaths and tens of millions of illness over the lifetime of each year&#8217;s birth cohort, saving billions of dollars in societal costs.</p><p>But there is more to say in defense of academic health research than cost-benefit analyses. Galea and Bibbins-Domingo ask &#8220;How should we consider the value of academic health research, and perhaps, implicitly, of health itself?&#8221; Their answer is that the purpose of fostering health among all of our citizens is to give people the opportunity to have more agency over their lives:</p><blockquote><p>The point of this work is to allow people to be healthy so they can live, however they choose to. Hence, in a world of ideological and partisan division, health should be the ultimate nonpartisan good. We do the research we do so that more people can live long, healthy lives, and live them as they wish to live them.</p></blockquote><p>Those of us in academic health research should make it clear to our fellow citizens that we take this purpose seriously. We value the trust we&#8217;ve been given, largely through tax-payer dollars, to do work that is ultimately meant to make <em>everyone&#8217;s</em> lives better. As health researchers, regardless of what motivated us to go into science, the ultimate purpose of our work is to allow people and democratic society more broadly to thrive by reducing the burden of disease:</p><blockquote><p>If a country sees itself as a robust, vibrant, thriving, and growing country&#8212;as certainly the US of myth and nationalistic narrative suggests it to be&#8212;it requires a strong academic health research enterprise to allow it to inhabit that vision of itself. That should make academic health research as core to the national identity as our vision of a democratic country that permits and encourages self-determination.</p></blockquote><h2>2. The mystery of long distance enhancers deepens.</h2><p>Most of the functional genome consists of regulatory DNA, but exactly how regulatory DNA works is still one of the biggest mysteries in biology. Distal enhancers, which, in linear genomic sequence, are often located very far away from the genes they regulate, are among the most puzzling functional elements of the genome. We know a few things about them: they are bound by DNA-binding proteins, they are transcribed by RNA polymerase to produce enhancer RNAs (whose function, if any, is unclear), and their chromatin epigenetic state is often distinctly different from the DNA around it. </p><p>We also know that distal enhancers are typically very cell type-specific. Their specificity is why the same genome can specify the gene expression programs of the great diversity of cell types in the human body, which include motor neurons that transmit an electrical signal across the ~1 meter distance between the human foot and the base of the spine, and a ~100 &#181;m cardiomyocyte that keeps up an unrelenting rhythmic contraction every moment of our life. </p><p>But what exactly happens, biochemically, when an enhancer activates its target genes? This is still mysterious. What does an enhancer deliver to the target gene promoter? How close in 3D space, and for how long, does it need to be to deliver whatever it is that it delivers? Why do enhancers often skip by the nearest gene to target another, more distant one?</p><p>One source of clues about enhancers are cases when one long distance enhancer regulates more than one gene. In these cases, two different target genes seem to compete for the enhancer, suggesting that the enhancer can only do whatever it does at one gene at a time.</p><p>A <a href="https://genesdev.cshlp.org/content/39/5-6/348.full">new paper</a> in <em>Genes &amp; Development</em> casts doubt on that idea. The paper describes a single enhancer that regulates two functionally unrelated genes, and it can do so without any competition between the target gene promoters.</p><p>The study, led by Wouter de Laat of the University Medical Center Utrecht in the Netherlands, describes an impressive series of deletion experiments that tease out the role of this particular enhancer in shaping the local 3D genome structure, thereby allowing its two target genes to come into close proximity.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VfLU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VfLU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 424w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 848w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 1272w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VfLU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic" width="605" height="488" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:488,&quot;width&quot;:605,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:27114,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/158547810?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VfLU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 424w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 848w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 1272w, https://substackcdn.com/image/fetch/$s_!VfLU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fb94ee-1166-4d38-badb-7c3d737d591e_605x488.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 3C of <a href="https://genesdev.cshlp.org/content/39/5-6/348.full">Huang, et al.  Genes &amp; Dev. 2025. 39: 348-363</a>, showing the impact of a CTCF deletion on enhnacer-promoter proximity. CC BY-NC 4.0</figcaption></figure></div><p>Those who care about the details should check out the paper, but I&#8217;ll just make two comments about this investigation into one of the most critical molecular processes at the foundation of life: First, the enhancer itself, and not just the normal CTCF/Cohesin machinery seems to be important for shaping the compact 3D genomic domain that brings the enhancer and its target genes close together. Not all enhancers do this, as far as we can tell. And second, there seems to be no competition between the enhancer&#8217;s target genes, meaning that whatever it is that this enhancer brings to a target gene promoter, there is enough of it to go around. When the investigators inhibited one target gene, there was no effect on the other, which is not what we expect if the gene promoters were competing for the enhancer&#8217;s attention.</p><p>This is further evidence that not all enhancers work in the same way, and that is what makes these regulatory DNA elements so mysterious. Enzymes aren&#8217;t like this - alcohol dehydrogenases all more or less work the same way (to the best of my knowledge). But &#8216;enhancers&#8217; may not be a natural biophysical category&#8211;different regulatory DNA sequences may have evolved to activate their target genes in different ways.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/this-weeks-finds-the-purpose-of-health?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/this-weeks-finds-the-purpose-of-health?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>3. Will GMO regulations slow down new therapeutic biotechnologies?</h2><p>The March 2025 issue of <em>Trends in Biotechnology </em>has several pieces reflecting on the 50-year legacy (or notoriety) of the <a href="https://en.wikipedia.org/wiki/Asilomar_Conference_on_Recombinant_DNA">Asilomar meeting</a> on the then-emerging technology of recombinant DNA. For some scientists, Asilomar is an example of the community showing the ability to regulate itself: the rapidly developing ability to move genes around among species had potentially catastrophic consequences (in the view of some), and so scientists agreed on a moratorium until the issues could be more carefully considered. Others consider Asilomar the epitome of scientific arrogance, because scientists assumed that they, and not legislators, should be the ones to decide how to handle the risks of recombinant DNA. (This is for another day, but I think neither of those positions is correct.)</p><p>A <a href="https://doi.org/10.1016/j.tibtech.2024.12.006">piece</a> by Hans-Georg Dederer, a law professor at the University of Passau, Germany, discusses how GMO regulations might inhibit medical advances based on new genetic technologies. He begins with a premise that I agree with (and, again, will discuss another day):</p><blockquote><p>The 1975 Asilomar conference contributed to the misperception that recombinant DNA (rDNA) technology is inherently risky to human health and the environment.</p></blockquote><p>Dederer argues that Asilomar confused the risks of a <em>process</em>&#8211;methods and technologies that involve recombinant DNA&#8211;with risks of specific <em>products</em> that should be evaluated on their own merits, and not by virtue of the fact that recombinant DNA was used to create them. He writes about how this confusion, a legacy of Asilomar, could obstruct the approval of new therapies based on CRISPR or other gene editing technologies, or pharmaceuticals that are produced by what Europe categorizes GMOs. While Europe has made some modifications to their laws to address concerns like this, Dederer argues that the law has not kept up with the rapid technological progress, and thus may work to hinder the arrival of say, new cancer therapies based on genetically engineered cells.</p><p>It&#8217;s a thought-provoking article that puts current debates within the larger historical context of post-Asilomar views on genetic engineering.</p><h2>4. Early origins of the human genome project</h2><p>Bob Waterston, former chair of the Department of Genetics at Washington University in St. Louis, was one of the early movers and shakers of the Human Genome Project. His insights were crucial to developing many of the important technologies that got the project off the ground, and his role is why we here at Washington University have a Genome Institute today. Waterston later moved from Washington University in St. Louis to the University of Washington in Seattle, where he became chair of the Department of Genome Sciences. (The fact that he was chair at both places does not help the common confusion of the University of Washington with Washington University.)</p><p>Waterston has a <a href="https://www.annualreviews.org/content/journals/10.1146/annurev-genom-101822-012945">memoir </a>out in the <em>Annual Review of Genomics and Human Genetics</em>. It&#8217;s an insightful and well-told account of how one of the most consequential efforts in biology came to be. If, like me, you&#8217;re inspired by frank accounts of how stuff really gets done in science, don&#8217;t miss this. Also, for a second perspective, I posted <a href="https://thisgenomiclife.substack.com/p/technologies-that-made-the-human">a transcript</a> of a talk by another of our former chairs, Mark Johnston, who was deeply involved in these early days of genomics.</p><h2>5. A Brief History of Intelligence</h2><p>I have been a <em>New York Times </em>subscriber since the 90&#8217;s. For most of that time, one of my favorite sections was the book review. But something there seems to have changed&#8211;these days I rarely find reviews of anything that interests me. Part of the problem is that they&#8217;ve turned half of the book review into a listicle section, and the other problem is that they now rarely (not never, just rarely) review serious books on science or intellectual history that connects to science. They also review basically no fiction that interests me. The whole book review section has become boring.</p><p>I now get my book review fix from <em>The New York Review of Books</em>, a list of interesting literature posters on X that I follow, and from recommendations on Substack. I found something interesting this week that I&#8217;ll pass along here&#8211;Max Bennett&#8217;s <em>A Brief History of Intelligence</em>. It came out in 2023, but if it was reviewed in the <em>NYT</em> I missed it. (And a search didn&#8217;t turn up anything.) I found the book due to a <a href="https://open.substack.com/pub/stetson/p/a-history-of-intelligence?r=bwtw3&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">recommendation</a> by Stetson, author of the interesting Substack newsletter <em>Holodoxa</em>, and the economist Jason Furman:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rGOd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rGOd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 424w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 848w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 1272w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rGOd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic" width="440" height="663.5031847133758" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:947,&quot;width&quot;:628,&quot;resizeWidth&quot;:440,&quot;bytes&quot;:72635,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.thisgenomiclife.org/i/158547810?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rGOd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 424w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 848w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 1272w, https://substackcdn.com/image/fetch/$s_!rGOd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe56b0379-7363-45f9-87a0-33ed9a0ef718_628x947.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here is the book&#8217;s description:</p><blockquote><p>Artificial intelligence entrepreneur Max Bennett chronicles the five &#8220;breakthroughs&#8221; in the evolution of human intelligence and reveals what brains of the past can tell us about the AI of tomorrow. In the last decade, capabilities of artificial intelligence that had long been the realm of science fiction have, for the first time, become our reality. AI is now able to produce original art, identify tumors in pictures, and even steer our cars. And yet, large gaps remain in what modern AI systems can achieve&#8212;indeed, human brains still easily perform intellectual feats that we can&#8217;t replicate in AI systems. How is it possible that AI can beat a grandmaster at chess but can&#8217;t effectively load a dishwasher?</p></blockquote><p>Max Bennett, the book&#8217;s author, is a tech entrepreneur and not a historian or scientist. I don&#8217;t usually buy big think books by people who aren&#8217;t scholars in the field, but I&#8217;m making an exception for AI. The field is moving really fast, and so much of the innovation is taking place outside of academia. Anyway, I&#8217;m on an AI reading binge. While I haven&#8217;t read this one yet, I thought the recommendation was worth passing on.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[What it means to stand up for science]]></title><description><![CDATA[My take on what we should stand for to maintain trust and increase prosperity.]]></description><link>https://www.thisgenomiclife.org/p/what-it-means-to-stand-up-for-science</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/what-it-means-to-stand-up-for-science</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 07 Mar 2025 23:07:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!kfvA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kfvA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kfvA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 424w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 848w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 1272w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kfvA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic" width="370" height="459.8758865248227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1402,&quot;width&quot;:1128,&quot;resizeWidth&quot;:370,&quot;bytes&quot;:149672,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thisgenomiclife.substack.com/i/158618020?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kfvA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 424w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 848w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 1272w, https://substackcdn.com/image/fetch/$s_!kfvA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d542975-25f6-42d0-bff0-4c1e55ddedcf_1128x1402.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Today there were nationwide &#8220;Stand Up for Science&#8221; rallies. The organization <a href="https://standupforscience2025.org/science-stories/">website</a> suggests that we explain why we stand up for science. Here is my take. </p><p>I studied for my PhD at the University of Rochester, whose motto is <em>meliora</em>, a Latin adjective that the University translates to mean &#8220;ever better&#8221;. When I first heard the motto, I associated it with the more familiar <em>ameliorate</em>&#8211;to make better. If I were to sum up what it means to stand up for science, it would be that motto: meliora. When we stand up for science, we should be standing up for a better life for all. More specifically, we should stand up for the following:</p><h3>1. Science as the long-term engine economic progress.</h3><p>200 years ago, most people lived on less than the modern equivalent of two dollars a day. The reason why more than 90% of us today are <em>not</em> that poor is because of science-driven technological progress that thoroughly transformed the material conditions of human life. No amount of education, hard work, free markets, or government reform would have achieved this transformation without scientific advances that increased crop yields, reduced the burden of disease, and are the basis of the materials and technologies involved in modern transportation and communication. We should stand up for the institutions and approach to knowledge that make up this engine of progress.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h3>2. Public trust that makes modern science possible.</h3><p>Those of us who work in science are able to do so thanks to decades of public trust underlying the state-funded work that is the seedbed of the scientific enterprise. That public trust is founded on the expectation of <em>meliora</em>&#8211;that what we do as scientists can, over the long run, make all of our fellow citizens&#8217; lives better. We can&#8217;t control the wise or unwise decisions of politicians that have the biggest impact on how broadly prosperity is shared. But as scientists we can stand up for the idea that our work is for everyone, regardless of partisan affiliation, for minorities and majorities. Standing up for that principle means acknowledging that people are different, and that, to take the example of health research, we do need to study differences between sexes, genders, races, etc. Everyone deserves evidence-based medicine.</p><h3>3. Honesty, rigor, and a commitment to evidence.</h3><p>The reason science has been the engine of progress is because, when its institutions are healthy, it is reliable. Those of us who work in science need to be absolutely clear about our commitment to the values that make science work. It means speaking out against fraud, arguing with but not canceling dissenting views, not shelving negative results, especially not with the excuse that we did so to avoid &#8220;weaponization&#8221; of the results, and not trying to pretend that low quality studies supporting our prior views are in fact strong evidence. It means not our pulling punches against cranks, like the one who now runs HHS, but it also means standing up against attempts to foist on science unreliable epistemic values that come from more radical corners of the humanities&#8211;such as <a href="https://www.science.org/doi/10.1126/science.adi9606">risible claims</a> that &#8220;indigenous&#8221; knowledge should be taught alongside science.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/what-it-means-to-stand-up-for-science?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/what-it-means-to-stand-up-for-science?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h3>4. A willingness to be accountable for our work.</h3><p>We should be willing to answer honest questions. Yes, the world is filled with bad-faith trolls who are trying to discredit science, and they should be treated as such. But the attitude we shouldn&#8217;t take is something I saw on a sign in a photo of a protest&#8211; &#8220;In science we trust.&#8221; &#8220;Trust the science&#8221; is too often used as a way to brush off criticism and avoid answering tough questions. While politicians, journalists, and the general public do not have our disciplinary expertise, we owe it to them to explain what we&#8217;re doing and take critical questions seriously. One of the worst recent examples is the effort to shut down debate over the origins of Covid. Philosopher Peter Godfrey-Smith <a href="https://petergodfreysmith.com/wp-content/uploads/2023/01/Climatic-and-Other-Catastrophes-PGS-2023-E.pdf">describes</a> what happened as &#8220;a degree of insistence and refusal to doubt that I think is very discouraging&#8230; an attempt to <em>pre-label </em>future dissent as incompetence or deception.&#8221; Every one of us with a Ph.D. in science was trained to answer tough questions. We owe the public more than just &#8220;trust the science.&#8221;</p><h3>5. Stand up for our students and colleagues.</h3><p>I am as much an unbeliever as Richard Dawkins, but I greatly appreciate my wife&#8217;s church. Each week, when the congregation is invited to come up for communion, the pastor finishes his lines by says &#8220;<em>all</em> are welcome.&#8221; And he means it. I&#8217;m sure many churches do this, but I know for a fact that a great many don&#8217;t. All should be welcome in science too, and we should stand up for them. I know we&#8217;re not going to fix misogyny and racism with platitudes, but at least as individuals we can make it clear that our student, technician, postdoc, and faculty colleagues belong here, that they are evaluated on how they do science and not on who they are. Almost everyone here could be making more money doing something else. From a purely mercenary perspective, we get a better scientific workforce when we draw talent from all across society. From a humane perspective, we should defend our community from malicious outside efforts to label our non-white colleagues as DEI hires, or to deny respect to our gay and trans colleagues.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Can generative AI write genomes on demand?]]></title><description><![CDATA[A reality check for generative DNA models]]></description><link>https://www.thisgenomiclife.org/p/can-generative-ai-write-genomes-or</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/can-generative-ai-write-genomes-or</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Tue, 04 Mar 2025 23:00:50 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!htfV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!htfV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!htfV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 424w, https://substackcdn.com/image/fetch/$s_!htfV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 848w, https://substackcdn.com/image/fetch/$s_!htfV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 1272w, https://substackcdn.com/image/fetch/$s_!htfV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!htfV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic" width="554" height="552.083044982699" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1152,&quot;width&quot;:1156,&quot;resizeWidth&quot;:554,&quot;bytes&quot;:155036,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thisgenomiclife.substack.com/i/158372065?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!htfV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 424w, https://substackcdn.com/image/fetch/$s_!htfV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 848w, https://substackcdn.com/image/fetch/$s_!htfV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 1272w, https://substackcdn.com/image/fetch/$s_!htfV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf789faa-f898-459b-bfca-b2b6e6c92209_1156x1152.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Not very realistic AI-generated engine blueprint</figcaption></figure></div><p>If you <a href="https://press.asimov.com/articles/evo-2">scanned</a> the headlines of some leading science outlets the other week, you could be forgiven for thinking that the generative AI can now design a working genome. Here is <a href="https://www.nature.com/articles/d41586-025-00531-3">a story</a> in <em>Nature</em> describing the <a href="https://arcinstitute.org/news/blog/evo2">release</a> the Arc Institute and NVIDIA&#8217;s&#8217;s <a href="https://www.biorxiv.org/content/10.1101/2025.02.18.638918v1">new foundation model</a>, Evo 2:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DM7q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DM7q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 424w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 848w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 1272w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DM7q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic" width="1456" height="1072" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1072,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:70754,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://thisgenomiclife.substack.com/i/158372065?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!DM7q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 424w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 848w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 1272w, https://substackcdn.com/image/fetch/$s_!DM7q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3dbfb18f-1cff-4754-b9a6-0038a9c1419c_1472x1084.heic 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This story reached beyond the bounds of the scientific community and was even mentioned on <em>The Ezra Klein Show</em> this week. But can Evo 2 (or any generative AI) design completely new genomes that actually <em>function</em>? The answer is that we have no idea. Nobody has shown it yet.</p><p>I&#8217;m not writing this post to pick on Evo 2. Brian Hie, Patrick Hsu, Hani Goodzari and the Arc Institute team (working with NVIDIA) are pushing the envelope of genomic foundation models. Their innovations in model design are described in a <a href="https://arcinstitute.org/manuscripts/Evo2-ML">companion paper</a> to Evo 2. I am enthusiastic about this work. But we need to be clear: while ever larger generative AI models (40 billion parameters for Evo 2) can natural-looking sequences at scale, there is very little evidence&#8211;and in some cases none at all&#8211;showing that these sequences function as expected. The rate-limiting step in genome design is still the experiment.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>So why are we hearing claims that generative models are writing genes and genomes? At this point, the field largely relies on comparisons to existing datasets, rather than a thorough program of experimental testing. In the case of Evo 2, the team prompted the model with various sequences,such as a bacterial genome, and then examined the generated sequence to see how well it resembled natural sequences. In the case of the generated bacterial genome (prompted with the very small <em>M. genitalium</em> genome), the authors show that a model-generated 580 kb sequence has a gene density resembling the natural  genome, and that the synthetic gene sequences show structural matches to natural enzyme genes. </p><p>But could this genome function coherently, with properly regulated, fully-functioning genes? We won&#8217;t know until someone synthesizes the entire thing and puts it into a bacterial cell. The experiment is feasible with today&#8217;s technology (actually with <a href="https://pubmed.ncbi.nlm.nih.gov/18218864/">2008 technology</a>), but it is resource intensive. My intuition is that no generated genome will actually function without a lot more tinkering to make it work, or without training models in a different way (more on that below). I could be wrong! Be we don&#8217;t know because the experiment hasn&#8217;t been done. </p><p>Here is a more specific example of one of the problems. The Arc team had Evo 2 generate a mitochondrial genome. For those not familiar with mitochondrial genomes, they are independent, mini genomes present in the eukaryotic cells&#8217; mitochondria. They encode a handful of genes, many of which work in complexes. </p><p>Evo 2 generated a set of mitochondrial genomes that seemed to have the right complement of genes. But there are three issues. First, we don&#8217;t know which of the generated genes are actually functional. They resemble the natural mitochondrial genes, but they may have variation in them that leave them inactive. Second, regulation of genes in the very compact mitochondrial genome is complex, and we have no idea whether Evo 2 produced functional regulatory sequences. And third, even if we assume that all these genes are indeed functional, each one most closely resembles a natural gene from a different species, as seen in Table S6 of the paper:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UoVd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UoVd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 424w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 848w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 1272w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UoVd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic" width="1456" height="638" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:638,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92696,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://thisgenomiclife.substack.com/i/158372065?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UoVd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 424w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 848w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 1272w, https://substackcdn.com/image/fetch/$s_!UoVd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2eab174-648b-4cf5-96b0-a158a99419e7_1478x648.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Those species are the muntjac (a deer native to China), a type of mouse-eared bat, a species of labeonins (freshwater fish), a harbor porpoise, a moose, a yak, the Balkhash marinka (freshwater fish), the Sambar deer, the domestic sheep, a species of antelope, the zebu (a type of cow), a species of eelpouts (marine fish), and a species of carp found in Thailand. While these are all vertebrates, and mitochondrially encoded genes are generally well-conserved, I am very skeptical that you can put together a functioning <a href="https://en.wikipedia.org/wiki/Electron_transport_chain">electron transport chain</a> with a mix of fish and mammalian genes. </p><p>The problem is that Evo 2, trained on over 8 trillion bases of sequences from many species, may not have learned how different members of a complex within a <em>single</em> species genome work together. Yes, Evo 2 learned which genes go together in the mitochondrially encoded genome. But the 45 carp genes for mitochondria Complex I have evolved to work with each other in carp cells, and the 45 harbor porpoise genes evolved to work each other in porpoise cells. The carp genes did not evolve to work with the porpoise genes-and Evo 2 probably doesn&#8217;t know that.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/can-generative-ai-write-genomes-or?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/can-generative-ai-write-genomes-or?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p>Again, I could be wrong, but until the experiment is done, we do not know how realistic the genomes generated by Evo 2 are. They could be genuinely functional, or they could be like the non-sensical AI-generated engine blueprint up at the top of this post. We just don&#8217;t know until we&#8217;ve done the experiments.</p><p>To be fair, the Evo 2 team is working on a set of experiments, and in their <a href="https://www.science.org/doi/10.1126/science.ado9336">Evo 1 paper</a> last fall, they tested some designed CRISPR-Cas complexes and a new transposable system. But AI-designed sequences at the scale of even a small genome have not yet been tested.</p><p>One of most successful gene designers is the recent Nobel laureate David Baker, who has spent decades investing in tools for both careful design and experimental synthesis and testing. In their most <a href="https://www.biorxiv.org/content/10.1101/2024.08.29.610300v4">recent preprint</a>, the Baker lab tested their AI-designed peptide inhibitors of the proto-oncogene Ras. They characterized the designed proteins <em>in vitro</em>, and showed that these inhibitors bound their targets in live cells. </p><p>Thorough experimental testing needs to become the standard in this field. I was surprised to see <a href="https://genome.cshlp.org/content/34/9/1411.abstract">a paper</a> last fall in <em>Genome Research</em>, which claimed to design realistic sequences but did not show that any of them function. Again, this paper was from a great team (at Genentech) whose work I admire, and I understand the value of papers that report innovation in model architecture and training. There are non-trivial computational problems to solve. But without experiments, we do not know whether these models work at all.</p><p>I suspect that large models trained on large sequence datasets will not be enough. There are real efforts to design and physically build synthetic genomes, the most ambitious of which is the <a href="https://syntheticyeast.github.io">yeast Sc2.0 project</a>. It has taken years of debugging and re-synthesis to build a synthetic yeast genome, which is, by eukaryotic standards, pretty small. AI design will require <em>a lot</em> more human intervention to be successful, at least in the near future. Successful design will also require skilled experimentalists who can show that the designed sequences work.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[This week's finds in genomic life: Genomes and autoencoders, regulating AI in medicine , and more]]></title><description><![CDATA[Five links for the weekend on AI, biomedicine, and the NIH budget.]]></description><link>https://www.thisgenomiclife.org/p/weekend-links-the-genome-as-autoencoder</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/weekend-links-the-genome-as-autoencoder</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 21 Feb 2025 13:02:44 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!IDC_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>1. The genome as a generative model</h2><p>Understanding how genotype relates to phenotype has been one of the fundamental challenges in biology for more than a century. More explicitly, the challenge is to understand how the linear DNA sequence of the genome encodes the necessary information to robustly generate an adult organism, and how variation in DNA sequences leads to differences between members of the same species.</p><p>Kevin Mitchell (at Trinity College Dublin, and author of multiple books on neurobiology, including most recently <em><a href="https://www.kjmitchell.com/books">Free Agents</a></em>) and Nick Cheney (at the University of Vermont) have a <a href="https://doi.org/10.1016/j.tig.2025.01.008">thought-provoking piece</a> in <em>Trends in Genetics</em> that addresses these questions by working out an extended analogy between the genome and generative neural networks, specifically <a href="https://www.ibm.com/think/topics/variational-autoencoder">variational autoencoders</a> (VAEs). I like this comparison because it is a very specific way of seeing the genome as a compressed representation of an organism that is decoded during development.</p><p>To motivate their argument, Mitchell and Cheney point out the flaws in the more common metaphors of the genome&#8211;as a program, a recipe, or a blueprint. Here they are on the genome as a blueprint:</p><blockquote><p>First, an architectural or engineering blueprint is isomorphic with the desired product, that is, distinct parts of the blueprint correspond directly and specifically to distinct parts of the product. In this way, the blueprint concept is almost preformationist, with the genome containing a direct mapping of the final product. Second, a blueprint does not usually contain instructions on how to build the object in question, it only has information on what it should look like when completed. This clearly leaves a major question unanswered: how are the processes of development specified so as to yield the desired outcome? And finally, a blueprint typically specifies an object in such detail as to be effectively deterministic, leaving little room for the kind of variability in developmental trajectories and outcomes that is typically observed even in genetically identical organisms raised in highly controlled, effectively identical environments.</p></blockquote><p>A vivid example of the genome as a compressed representation is the single-celled zygote, the state in which we all began.</p><p>What does the genome encode directly? Sequences of proteins and functional RNAs, as well as regulatory elements (whose encoding in DNA sequences is still not well understood). In all of these sequences, Mitchell and Cheney say that the key feature that is encoded is &#8220;differential affinities.&#8221; Proteins and RNAs function by forming selective interactions with other proteins, RNAs, DNA, and small molecules. Back in a <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC528649/">1958 paper</a>, a few years before the genetic code was cracked, D.L. Nanney made a similar argument, that the genome is a &#8220;library of specificities.&#8221;</p><p>Mitchell and Cheney argue that, like a VAE which reconstructs new images or other outputs by running the compressed information through a series of higher dimensional neural network layers, genomic information is realized through a succession of cell types as development and differentiation occur. I haven&#8217;t considered this enough, but I&#8217;m not sure how induced pluripotent stems cells and direct reprogramming of one type of cell into another fit into the analogy of a compressed representation being decoded through a series of developmental states. With direct reprogramming, you skip directly from one state to another. </p><p>Mitchell and Cheney do make it clear that they&#8217;re not arguing for a perfect analogy between any kind of neural network and development. The real strength of their conception is that it lets them set up a more specific, formalized model.</p><p>There is a lot more in the piece, including a section on evolution as the encoder in the VAE analogy. It&#8217;s worth a read for anyone interested in ways to conceptualize the relationship between genotypes and phenotypes.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>2. Why different mutations in the same transcription factor genes have such different clinical outcomes</h2><p>Speaking of cell biology being largely about differential affinities and specificities, a <a href="https://doi.org/10.1016/j.ajhg.2024.12.012">new paper</a> in the <em>American Journal of Human Genetics</em> shows that the phenotypic impact of mutations in the gene for the transcription factor BCL11B corresponds to whether those mutations alter the affinity or specificity of this protein&#8217;s binding to DNA.</p><p>Mutations in BCL11B cause a variety of developmental syndromes that feature cognitive, morphological, and immunological problems. The challenge is that, since all of these mutations occur in the same gene, it is very hard to predict which mutations will have severe versus mild outcomes. </p><p>The scientists analyzed pathogenic or likely pathogenic BCL11B mutations from 92 patients, and found that they could group patients into three clinical subtypes based on where in the gene the mutation occurred. Interestingly, mutations at positions in BCL11B that contact DNA, and which are responsible for its DNA binding affinity or its specificity, resulted in the more severe and variable clinical outcomes. Mutations elsewhere in the protein were relatively mild. </p><p>The results indicate that the most damaging mutations are those that change the ability of BCL11B to target the right set of genes, rather than simply attenuating its ability to activate genes. This alone isn&#8217;t so surprising: if you fail to turn on the right genes, or turn on the wrong ones in a particular context, bad things happen. What this paper shows, and what I think is less appreciated, is that different mutations in the same transcription factor will <em>not</em> affect all of its target genes in the same way. The different clinical outcomes probably reflect the fact that each mutation affects different sets of targets, with different biological consequences. (We made a similar argument in a <a href="https://pubmed.ncbi.nlm.nih.gov/38355306/">paper</a> last year.)</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IDC_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IDC_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 424w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 848w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 1272w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IDC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic" width="1456" height="908" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:908,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196835,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://thisgenomiclife.substack.com/i/156859678?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IDC_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 424w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 848w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 1272w, https://substackcdn.com/image/fetch/$s_!IDC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1277ecef-b82e-467d-98db-00069962cff4_2894x1804.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1, showing patient mutations in different regions of BCL11B, of Lessel, <em>et al.</em> <em><a href="https://doi.org/10.1016/j.ajhg.2024.12.012">AJHG</a></em><a href="https://doi.org/10.1016/j.ajhg.2024.12.012"> 112:394-413 (2025)</a>, CC BY-NC-ND 4.0</figcaption></figure></div><h2>3. Regulating AI in medicine</h2><p>Former FDA Director Scott Gottlieb has a written a <a href="https://pubmed.ncbi.nlm.nih.gov/39913129/">thoughtful piece</a> about how AI in medicine should be regulated. It&#8217;s obvious to everyone that the incorporation of AI into health care is inevitable, because there are clearly enormous potential benefits, but there are also enormous risks. The issue facing everyone, but especially the FDA as a regulatory agency, is how to balance the risks.</p><p>Gottlieb is worried that AI tools &#8220;with advanced analytical capabilities&#8221; will be automatically classed as medical devices by the FDA, under recent change in its guidelines. Why is that a problem? Because it would prevent new AI capabilities from being incorporated as add-ons to existing software tools, like electronic medical record (EMR) systems or other software intended to support clinical decision making. If an AI tools is classified as a medical device, that means EMR developers would be incentivized to exclude them, because it brings a higher level of regulation to bear on the entire software suite:</p><blockquote><p>If these tools are classified as medical devices merely because they draw from multiple data sources or possess analytical capabilities that are so comprehensive and intelligent that clinicians are likely to accept their analyses in full, then nearly any AI tool embedded in an EMR could fall under regulation. The risk is that EMR developers may attempt to circumvent regulatory uncertainty by omitting these features from their software. This could deny health care clinicians access to AI tools that have the potential to transform the productivity and safety of medical care.</p></blockquote><p>Gottlieb notes that the 2016 21st Century Cures Act was written to avoid exactly this kind of outcome, in which digital health tools are regulated not based on their clinical use but rather their analytical sophistication. AI tools that don&#8217;t make autonomous diagnoses or treatment decisions shouldn&#8217;t be treated as medical devices.</p><p>The rapid growth of EMRs in clinical practice has opened up a lot of potential to improve health care, by letting different doctors for the same patient easily share information, and by facilitating medical research with very large cohorts and high statistical power. The ability of AI tools to make recommendations based on the synthesis of different data sources is another way EMRs could lead to better care. That will only happen if the incentives are there to develop these tools.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/weekend-links-the-genome-as-autoencoder?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/weekend-links-the-genome-as-autoencoder?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>4. What&#8217;s the return on investment for the NIH budget?</h2><p>The NIH has been a target for the new administration&#8217;s firing and spending freezes. Despite a court ruling that the NIH needs to spend the money appropriated by Congress, the works are still gummed up and study sections have by and large not been able to review new grant submissions. This is a frustrating and completely unnecessary self-own; the U.S. benefits by being the place where much of the world&#8217;s biotechnology is invented. (Though there are ways we could benefit more&#8211;choices made by our government have left us with higher drug prices than elsewhere.)</p><p>Any time the NIH or NSF or NASA budgets come up, the perennial question returns to the discourse: what do we get for our research dollars? One of the best places to find an answer to that question is at Matt Clancy&#8217;s New Things Under the Sun Substack page, where he has a <a href="https://www.newthingsunderthesun.com/pub/d4ggviu4/release/1">continuously updated piece</a> on the ROI of government spending on R&amp;D. His answer:</p><blockquote><p>This is very challenging to estimate, but a variety of research points to an additional dollar of government sponsored R&amp;D generating $2-$5 in benefits via economic growth.</p></blockquote><p>If you want the details of the calculation, with references to the literature on this subject, check it out. </p><p>As someone who has received multiple NIH grants, there are a few points about these grants I think are worth highlighting:</p><ol><li><p>I, and most colleagues that I am aware of, value the trust placed in us by the taxpayers whose money funds our labs. The NIH funds our work in order to benefit <em>everyone</em> through advances in knowledge and technology, and I take that seriously. We all should&#8211;the knowledge generated by science should benefit everyone, regardless of partisan affiliation, identity, etc.</p></li><li><p>Grant budgets are tight. There is not a lot of slack in a standard NIH R01. You need at least two funded R01s to run a modestly-sized lab these days. It does not cover all of costs of research, some of which are in fact borne by the universities and hospitals we work for. This is not intended as a complaint. The point is that government-funded scientists have very strong incentives to spend their grant money efficiently, because if you waste it you won&#8217;t have enough to accomplish your scientific aims.</p></li><li><p>Over the decades, the U.S. has developed world-leading scientific research institutions, and, even though there are some aspects many of us would reform, we are still the place where scientists from around the world want to work. We have our edge in biomedical research thanks to a longstanding, bipartisan commitment to government-funded science. This is an enormous national resource that we shouldn&#8217;t recklessly discard, and we shouldn&#8217;t assume it can easily be rebuilt if we break it through ignorance and carelessness.</p></li></ol><p>On the related issue of indirect costs paid by the NIH to universities, Harvard geneticist Sasha Gusev, who write <em>The Infinitesimal</em>, has a <a href="https://open.substack.com/pub/theinfinitesimal/p/a-faq-on-federal-research-cuts?r=bwtw3&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">detailed discussion</a> of indirect costs and much more. Indirect costs have been a big source of misunderstanding:</p><blockquote><p>A research contract is work: when the government hires General Dynamics to build a jet, they pay for the hangar; and when the government hires a university to study cancer, they pay for the lab space.</p></blockquote><p>In his conclusion, which I agree with, he doesn&#8217;t mince words:</p><blockquote><p>What happens while everything is shut down? Excellent funding applications that took months to develop are summarily rejected because their mechanism was cancelled; high-scoring grants are passed over for funding because the review council cannot meet; effective drugs are delayed for approval because the FDA has a staff shortage; longitudinal cohorts that have been collected for decades collapse and their data becomes polluted; and &#8212; most importantly &#8212; the next generation of talented people who planned to pursue research for the public good are cruelly let go or not hired and drift away. In most cases we may not know for years that a study or a dataset was important and should not have been abandoned. And because the whole thing is being conducted without rigor, even when cuts increase efficiency they will be confounded by all the other functions grinding to a halt.</p></blockquote><h2>5.  What is convolution and why is it important in deep learning?</h2><p>Convolutional neural networks are everywhere these days, and they often have impressive predictive power compared to more standard statistical models, like linear regression. So why is a <em>convolutional</em> neural network so special? If you&#8217;ve wondered just what convolution is, Christopher Olah has a superb series of blog posts on the concepts behind neural networks, including <a href="http://colah.github.io/posts/2014-07-Understanding-Convolutions/">convolution</a>. It was written in 2014, but it&#8217;s still a great explanation of the idea. </p><p>If you&#8217;ve had any statistics, you are probably familiar with some of the ideas behind convolution. If you have two random variables, each with their own probability density function, what is the probability density function of the product of these two variables? Let&#8217;s take a discrete case: if I roll two dice, a six-sided die and twenty-sided die, what is the probability that the product of the two rolls is 18? Well, if you roll a 1 with the first die, then you only get 18 if you roll 18 on the second die. But you could also get 18 by rolling a 2 and a 9, etc. You get the idea. To see why this matters, check out Olah&#8217;s post.</p><p></p><p>OK, there was one other piece of big news this week in the world of AI and biology. The Arc Institute and Nvidia <a href="https://arcinstitute.org/tools/evo/evo-designer">announced</a> a new foundation model, Evo-2. I&#8217;ll have some things to say about it next week, along with some commentary about what standards we should have for judging the performance of generative DNA models. Have a great weekend.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[A Medical Milestone: The First Non-Opioid Drug for Severe Pain ]]></title><description><![CDATA[How publicly-funded science, pharmaceutical companies, and the FDA brought us a new first-in-class non-opioid analgesic]]></description><link>https://www.thisgenomiclife.org/p/the-first-non-opioid-drug-for-severe</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/the-first-non-opioid-drug-for-severe</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Tue, 18 Feb 2025 13:01:31 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!AYFy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AYFy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AYFy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 424w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 848w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 1272w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AYFy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic" width="570" height="557.2767857142857" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:657,&quot;width&quot;:672,&quot;resizeWidth&quot;:570,&quot;bytes&quot;:106215,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AYFy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 424w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 848w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 1272w, https://substackcdn.com/image/fetch/$s_!AYFy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21ca007a-a5df-40d5-be4c-b3e6798a7e7b_672x657.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://alphafold.ebi.ac.uk/entry/Q9Y5Y9">Alphafold structure</a> of the sodium channel alpha subunit NaV 1.8</figcaption></figure></div><p>Why was there an opioid addiction epidemic in America? <a href="https://www.nytimes.com/2020/10/21/health/purdue-opioids-criminal-charges.html?unlocked_article_code=1.xE4.RRgO.YAAD2XGZdm6X&amp;smid=url-share">One cause</a> was Purdue Pharma&#8217;s aggressive efforts to convince physicians to freely prescribe a drug that the company knew was addictive. But another cause is that treating pain effectively is an unsolved problem. Millions of Americans suffer from not only acute pain, after surgery or an injury, but also debilitating <a href="https://pubmed.ncbi.nlm.nih.gov/20797916/">chronic pain</a>. For awhile, I was one of them. Terrible ergonomic habits at work caused me <a href="http://goodmenproject.com/health/500-million-years-of-pain/">to develop</a> severe neck and shoulder pain that lasted for years. If you had asked me, I would have rated my pain as at least a six out of ten on most days, and it was rarely better than that. Chronic pain is not just uncomfortable but distracting, making it difficult to focus on anything. I suffered, but my productivity also suffered, as it does for many people with chronic pain. Across the U.S., according to a <a href="https://pubmed.ncbi.nlm.nih.gov/22607834/">2012 study</a>, the total annual cost of untreated pain, including both health care costs and lost productivity, is more than half a trillion dollars.</p><p>For a long time, the only drugs on the market to treat severe pain were opioids. Because they can be addictive, and because they lose their efficacy over time as people develop tolerance, opioids are not a solution for chronic pain, but people suffering from chronic pain take them anyway. I was once prescribed opioids after an outpatient procedure. I didn&#8217;t need them to recover from the procedure, but I took them anyway to get some relief from my incessant neck and shoulder pain. I stopped taking opioids when my prescription ran out, but others aren&#8217;t so lucky. In <a href="https://pubmed.ncbi.nlm.nih.gov/31349994/">one study</a> of Medicare patients, 10% continued to persistently use opioids after receiving a short-term prescription with a surgery. When you combine an unmet need for pain treatment with lax prescribing practices for an addictive drug, you get the origins of an opioid epidemic.</p><p>America&#8217;s opioid epidemic, now fueled by cartel-produced fentanyl, has developed beyond its origins in the abuse of prescription pain medicine. But the need for better drugs to treat severe pain is still unmet. That is why the FDA&#8217;s <a href="https://www.fda.gov/news-events/press-announcements/fda-approves-novel-non-opioid-treatment-moderate-severe-acute-pain">first approval</a> of a non-opioid drug for severe pain is an impotant milestone. Approved at the end of January, suzetrigine (commercial name is Journavx, don&#8217;t ask me how to pronounce it) is a first-in-class, non-opioid analgesic from Vertex Pharmaceuticals, the company known for its breakthrough cystic fibrosis drugs. Suzetrigine was <a href="https://pubmed.ncbi.nlm.nih.gov/37530822/">found to be effective</a> for treating pain in two randomized, double-blind trials of patients who had undergone a surgical procedure. These trials looked at efficacy for acute, post-surgical pain, but ongoing trials will determine whether the drug is safe and effective for other indications, such as diabetic neuropathy and other forms of chronic pain. If it is, then this drug, and <a href="https://pubmed.ncbi.nlm.nih.gov/39835903/">likely others</a> in the pipeline, will transform how we treat pain. </p><p>Reaching this milestone is something that could only<em> </em>be accomplished on a deep foundation of neurobiology built with decades of publicly-funded research. The story behind Suzetrigine shows how enterprising and forward-looking industry research programs, working in tandem with academic scientists, can turn scientific knowledge into therapeutics that make people&#8217;s lives better.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>Blocking pain before it reaches the brain</h2><p>Suzetrigine is a small molecule that works by inhibiting a voltage-gated sodium channel called Na<sub>V</sub>1.8. Sodium channels mediate the electrical signaling of neurons by allowing sodium ions to pass through the cell membrane. (Wikipedia has a <a href="https://en.wikipedia.org/wiki/Action_potential">nice primer</a> if you need it.) By blocking the action of Na<sub>V</sub>1.8 sodium channels, Suzetrigine prevents pain signals from being transmitted to the brain.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!63R_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!63R_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 424w, https://substackcdn.com/image/fetch/$s_!63R_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 848w, https://substackcdn.com/image/fetch/$s_!63R_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 1272w, https://substackcdn.com/image/fetch/$s_!63R_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!63R_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic" width="422" height="294" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:294,&quot;width&quot;:422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:10134,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!63R_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 424w, https://substackcdn.com/image/fetch/$s_!63R_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 848w, https://substackcdn.com/image/fetch/$s_!63R_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 1272w, https://substackcdn.com/image/fetch/$s_!63R_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ab8fe0d-6bb4-44e5-b92f-3d8a87ad0178_422x294.heic 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Suzetrigine from <a href="https://pubchem.ncbi.nlm.nih.gov/compound/Suzetrigine">PubChem</a></strong></figcaption></figure></div><p>This sounds simple, but the contrast with opioids illustrates the challenge of inhibiting the neuronal signaling in very specific ways. Opioids <a href="https://www.nejm.org/doi/full/10.1056/NEJMra1507771">act on &#181;-opioid receptors</a>, many of which are located in pain-processing regions of the brain. Unfortunately, these receptors are also present in the reward systems of the brain, as well as in the brain stem, which controls our breathing, and in the small intestine. Hence, opioids not only affect our sensation of pain, but they cause euphoria, produce constipation, and depress breathing. In other words, opioids are dangerous because they aren&#8217;t <em>selective</em> enough for those parts of our nervous system responsible for signaling pain. </p><p>The history of the past few decades of pharmaceutical research for analgesics is basically a quest for drugs that are selective inhibitors of pain signaling. Suzetrigine is a medical milestone because it is <a href="https://pubmed.ncbi.nlm.nih.gov/39322410/">highly selective</a> for Na<sub>V</sub>1.8 sodium channels, which are almost exclusively present in the peripheral nervous system, and not the brain. By inhibiting sodium channels in the peripheral nervous system, primarily at the <a href="https://www.ncbi.nlm.nih.gov/books/NBK532291/">dorsal root ganglion</a> where the peripheral nervous system connects with the spinal cord, Suzetrigine blocks pain signals before they reach the central nervous system. The drug thus leaves the pain and reward centers of the brain unaffected, and avoids the side effects that make opioids so dangerous.</p><p>Before we get into the scientific backstory, it is important to emphasize another important contrast between Suzetrigine and opioids. This is the difference between drugs that are developed to be <em>specific</em> using the molecular knowledge of modern science, and <em>non-specific</em> drugs based on a phenomenological understanding of medicines common in pre-scientific thought. 8000-year-old cuneiform tablets from Sumer <a href="https://www.neurology.org/doi/10.1212/WNL.92.15_supplement.P4.9-055">describe the use of opium</a> as medicine. For most of human existence, people discovered and used plant products as medical treatments without knowing anything at all about how they worked. While there are undoubtedly useful medicines that were discovered this way, all drug action happens at the molecular level. It is <em>much</em> more effective to develop drugs based on a molecular-level understanding of biology. </p><p>This may seem obvious, but we live in a society in which loud so-called health influencers with big platforms want to re-mystify biology and medicine <a href="https://open.substack.com/pub/pauloffit/p/understanding-rfk-jr?r=bwtw3&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">by invoking pre-scientific theories</a> like miasmas. Just like you can&#8217;t fix a car that won&#8217;t start unless you know the parts of the ignition system, we can&#8217;t develop medicines that address challenging medical conditions without detailed knowledge of the specific molecular interactions at the foundation of biology. <em>The knowledge of those molecular interactions is the product of decades of publicly-funded research.</em> In the rest of this post, I will walk through the important discoveries that made Vertex Pharmaceuticals&#8217; success possible.</p><h2>Discovery 1: Sodium channels</h2><p>Alan Hodgkin and Andrew Huxley, at the University of Cambridge, <a href="https://physoc.onlinelibrary.wiley.com/doi/abs/10.1113/jphysiol.1952.sp004717?sid=nlm%3Apubmed">discovered</a> the existence of transmembrane sodium currents in neurons in 1952. They received the 1963 Nobel Prize for their fundamental work on the action potential of neurons. Bertile Hille (University of Washington) and Clay Armstrong (University of Rochester) began <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC3424717/">characterizing the propertie</a>s of sodium channels in the 1960&#8217;s and 70&#8217;s, including how these channels were acted on by anesthetics. Daniel Beneski and William Catterall (University of Washington, supported by NIH grant HL-22234 and an NIH postdoctoral fellowship) <a href="https://pubmed.ncbi.nlm.nih.gov/6928649/">characterized </a>the protein components of sodium channels. </p><p>Over the subsequent decades, researchers discovered different subtypes of sodium channels, eventually identifying nine in humans. The gene for the subtype targeted by Suzetrigine, Na<sub>V</sub>1.8, was cloned from rat neural tissues in 1996 by two groups, one at <a href="https://pubmed.ncbi.nlm.nih.gov/8538791/">University College London</a> (funded by the Welcome Trust) and the other at <a href="https://pubmed.ncbi.nlm.nih.gov/8626372/">Roche</a>. Sodium channels are critical throughout the central and peripheral nervous systems, and different subtypes are expressed in different neural tissues. The discovery of Na<sub>V</sub>1.8, not expressed in the brain but rather in the peripheral nervous system, was an important advance.</p><h2>Discovery 2: Link between Na<sub>V</sub>1.8 and pain</h2><p>In 1999, the University College London group (led by John Wood) <a href="https://pubmed.ncbi.nlm.nih.gov/10448219/">showed</a> that mice missing the gene for Na<sub>V</sub>1.8 were less sensitive to pain. A University of Arizona group, led by John Hunter, <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC33594/">showed</a> a similar effect in rats. Three years later, this same group, in collaboration with Roche, <a href="https://pubmed.ncbi.nlm.nih.gov/11790477/">showed</a> that a suppressing the production of  the Na<sub>V</sub>1.8 with an <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC7355792/">anti-sense oligonucleotide</a> induced resistance to pain in rats. Further establishing the link between Na<sub>V</sub>1.8 and pain, an international group led by Stephen Waxman at Yale <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC3511073/">discovered</a> that some people with painful neuropathies cary gain-of-function mutations in Na<sub>V</sub>1.8 (funded in part by Pfizer and by the U.S. Department of Veterans Affairs). Many other academic groups further characterized the role of Na<sub>V</sub>1.8 in pain signals in the peripheral nervous system, such as <a href="https://pubmed.ncbi.nlm.nih.gov/28450535/">work</a> led by Richard Carr and Matthias Ringkamp at Johns Hopkins University and the University of Heidelberg (supported by NIH grants R01NS09722 and F32DA036991.)</p><p>With the link between pain sensation and Na<sub>V</sub>1.8 established, <a href="https://pubmed.ncbi.nlm.nih.gov/17483457/">Abbott laboratories</a>, <a href="https://pubmed.ncbi.nlm.nih.gov/25625641/">Pfizer</a>, and others, developed molecules that specifically targeted Na<sub>V</sub>1.8, in the hope that these could lead to new analgesic drugs.</p><h2>Discovery 3: Na<sub>V</sub>1.8 is expressed only in the peripheral nervous system</h2><p>The critical piece of the success of Suzetrigine is that it selectively targets a sodium channel that is not expressed at significant levels in the brain or other non-targeted organs like the heart. It is this feature of Na<sub>V</sub>1.8 that made it the focus decades of research by both pharma and academic scientists, who knew years ago that a drug that could successfully inhibit only Na<sub>V</sub>1.8 would almost certainly avoid the dangerous side effects of opioids.</p><p>The specific expression of Na<sub>V</sub>1.8 in the peripheral nervous system was established early on. But two important publicly funded human expression atlas projects were cited by Vertex in their clinical trial report as evidence of the expression pattern of Na<sub>V</sub>1.8: the <a href="https://www.proteinatlas.org/ENSG00000185313-SCN10A/brain">Human Protein Atlas</a> and <a href="https://gtexportal.org/home/gene/SCN10A">GTEx</a>, two publicly funded projects that set out to comprehensively catalog the distribution of every gene and protein among human tissues. The knowledge of where in the body each gene and protein is expressed is crucial for developing selective drugs that act only where you want them to act.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/the-first-non-opioid-drug-for-severe?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/the-first-non-opioid-drug-for-severe?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>Converting knowledge into therapy </h2><p>Fundamental knowledge of sodium channels, and Na<sub>V</sub>1.8 specifically, is what enabled pharma scientists to choose Na<sub>V</sub>1.8 as an attractive target for drug development. Vertex Pharmaceuticals tested several compounds and discovered one that was much more selective than the rest. Other companies also have Na<sub>V</sub>1.8 inhibitors in development. Getting a working drug across the finish line is difficult, even with a deep foundation of molecular knowledge, in part because our knowledge is always imperfect. One reason, though not the only one, that drug development is <a href="https://pubmed.ncbi.nlm.nih.gov/32300238/">so expensive</a> is that the failure rate is high. An <a href="https://pubmed.ncbi.nlm.nih.gov/29348681/">improvement in success rates</a> for taking a drug from pre-clinical work to FDA approval will help bring down the cost producing new drugs. One of the most effective ways to improve success rates is to base drug discovery on <a href="https://pubmed.ncbi.nlm.nih.gov/35804044/">genetic knowledge</a>, most of which is produced by publicly funded academic scientists. A <a href="https://pubmed.ncbi.nlm.nih.gov/31830040/">2019 analysis</a> found that using human genetic evidence to target drug development increased approval rates by more than two-fold. The story of Suzetrigine illustrates why. Extensive evidence in model organisms and in human patients with Na<sub>V</sub>1.8 mutations established a strong scientific foundation for a drug development program. </p><p>The final important piece of drug development is the regulatory agency. The FDA has an obligation to keep unsafe and ineffective drugs off the market, but equally important is its mission to facilitate the release of drugs that are safe and effective. The agency established important <a href="https://www.fda.gov/patients/learn-about-drug-and-device-approvals/fast-track-breakthrough-therapy-accelerated-approval-priority-review">guidelines and processes</a> for drugs that could treat urgent, unmet medical needs, and suzetrigine was approved under the FDA&#8217;s Breakthrough Therapy, Fast Track, and Priority review pathways.</p><p>The opioid epidemic was driven in part by the <a href="https://www.nytimes.com/2020/10/21/health/purdue-opioids-criminal-charges.html?unlocked_article_code=1.xE4.RRgO.YAAD2XGZdm6X&amp;smid=url-share">criminal activity</a> of one pharmaceutical company, but a major, unmet medical need was another important driver. There are still many unmet medical needs that could be solved with better drugs. Those future drugs will, more often than not, emerge from the fertile soil of our rapidly growing, publicly funded stock of genetic and genomic knowledge.</p><p>If you want to learn more about the development of Na<sub>V</sub>1.8 sodium channel inhibitors, The New England Journal of Medicine&#8217;s podcast Intent to Treat put out an <a href="https://www.nejm.org/doi/full/10.1056/NEJMp2305759">informative episode</a> with Yale neurobiologist Stephen Waxman back in 2023, when the suzetrigine study was published. Waxman has long been one of the leaders of the Na<sub>V</sub>1.8 field.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[How to train your model ]]></title><description><![CDATA[...when you don't have enough data. A new preprint presents a clever simulation.]]></description><link>https://www.thisgenomiclife.org/p/how-to-train-your-model</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/how-to-train-your-model</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 14 Feb 2025 13:03:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!609e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It&#8217;s probably not controversial to say that the limiting resource for better AI tools in biomedical research is not GPUs or new AI architectures, but data. As a <a href="https://www.oneusefulthing.org/p/scaling-the-state-of-play-in-ai?utm_campaign=post&amp;utm_medium=web">general rule</a>, more training data is needed to build more capable models. The problem is that there <a href="https://www.nature.com/articles/d41586-024-03990-2">may not be enough data</a> in the world to achieve the capabilities we want from AI. <a href="https://arxiv.org/abs/2211.04325">This paper</a> predicts that the biggest large language models (LLMs) could run out of human text data as early as 2026. And then there are specific AI applications, like reading CT scans or tissue sections, which (at least for now) rely heavily on supervised training that requires human-labeled data (images) that the model learns. Labeling images to create these datasets requires time-consuming and expensive human effort.</p><p>An obvious solution to the data problem is to simulate training examples. With a good simulation, you could generate as much training data as you need. How do you create a good simulation? For LLMs like GPT-4, you could use the trained model to generate much more text, and then train the next on the generated text, etc. That kind of recursive training risks sending your model into a strange place. A <a href="https://www.nature.com/articles/s41586-024-07566-y">paper</a> published last year reported that AI models collapse when trained on data they generated. The authors wrote that &#8220;the model becomes poisoned with its own projection of reality.&#8221;</p><p>An alternative to recursive training is to simulate your data with a different model, one that is not tied into deep learning model that you are attempting to train. My WashU Genetics colleague Willie Buchser and his lab and collaborators at the McDonnell Genome Institute (MGI) have a <a href="https://www.biorxiv.org/content/10.1101/2024.08.02.605366v2.full">preprint</a> out that uses an interesting approach: they use the popular 3D computer graphics software <a href="https://www.blender.org/about/">Blender</a> to simulate biophysically realistic tissue samples. They then use the simulated data to train a convolutional neural network (CNN) that learns to automatically annotate real tissue samples.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/how-to-train-your-model?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/how-to-train-your-model?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h3>Using all of the data in spatial &#8216;omics</h3><p>The context in which Willie and his colleagues are trying to automate tissue annotation is the exploding new field of spatial &#8216;omics. The ability to <a href="https://pubmed.ncbi.nlm.nih.gov/18516045/">measure the expression of all genes</a> in cells and bulk tissues was transformative back when RNA-seq arrived in the scene in 2008. But tissues are composed of complex arrangements of cells of different types, and even cells of the same type may be in different gene expression states at different locations in a tissue. The invention of technologies to <a href="https://pubmed.ncbi.nlm.nih.gov/26000488/">perform RNA-seq on single cells</a> was an important step forward because now different cell types in a tissue could be distinguished based on their gene expression profiles. But while single-cell RNA-seq resolves the identity of individual cells, information about the spatial distribution of these cells within a tissue or a tumor is lost.</p><p>Barely a decade after the introduction of RNA-seq, RNA sequencing <a href="https://pubmed.ncbi.nlm.nih.gov/30385464/">within</a> <a href="https://pubmed.ncbi.nlm.nih.gov/29930089/">intact</a> tissues arrived, known as spatial transcriptomics. Science advances in large part by the ability to make measurements at new scales, and the capacity to determine cell types and their expression states on the spatial scale of whole tissues is beginning to transform disease genomics.</p><p>A detailed discussion of these technologies will be a post for another day. In the context of the new work by Willie and his colleagues, an important advantage of spatial transcriptomics is that it provides not just gene expression values and spatial coordinates, but actual images of tissue slices whose features can be annotated with labels that mark known anatomical features. But this anatomical information isn&#8217;t often used because most spatial &#8216;omics analyses take what the authors call a bottom-up approach. When working from the bottom up, the molecular sequencing data is used to group individual cells by their gene expression patterns, and then only afterwards are cell clusters mapped back onto spatial coordinates. Instead of directly using prior information about anatomical structure of the tissue, anatomical regions are roughly reconstructed by the clusters of cell types mapped back on to the tissue coordinates.  The MGI group argues instead for a top-down approach, in which cells are first labeled by the annotated anatomical region from which they came, after which gene expression patterns can be compared:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!609e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!609e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 424w, https://substackcdn.com/image/fetch/$s_!609e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 848w, https://substackcdn.com/image/fetch/$s_!609e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 1272w, https://substackcdn.com/image/fetch/$s_!609e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!609e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic" width="502" height="333.91460674157304" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:592,&quot;width&quot;:890,&quot;resizeWidth&quot;:502,&quot;bytes&quot;:41839,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!609e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 424w, https://substackcdn.com/image/fetch/$s_!609e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 848w, https://substackcdn.com/image/fetch/$s_!609e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 1272w, https://substackcdn.com/image/fetch/$s_!609e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbc277c-42ad-43d4-bde1-d942ce5814d0_890x592.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1A from Ali, <em>et al</em>., <a href="https://doi.org/10.1101/2024.08.02.605366">https://doi.org/10.1101/2024.08.02.605366</a>. CC-BY-NC-ND 4.0</figcaption></figure></div><h3>Automation is great but annotation is expensive</h3><p>The top-down approach would have many important applications, such as comparing healthy and pathological brain samples. But the limitation is that annotating tissue sections is time-consuming and requires a pathologist or some other expert to manually mark the boundaries of different regions on each new image. This is just not feasible in most settings, which is why the top-down approach isn&#8217;t widely used. But the result is that, as the MGI group points out, information in the dataset is left on the table.</p><p>Software that could automatically segment tissue images into different anatomical regions would make the top-down approach feasible. Image analysis is a task that neural networks excel at, so such automatic segmentation of tissue images would be a natural job for a CNN. This, however, takes us back to the data problem I mentioned at the outset: obtaining labeled data to train neural networks is expensive, and in the case of tissue annotation, the data doesn&#8217;t exist at the scale you&#8217;d need to train a sufficiently good model.</p><p>The solution that Willie and his colleagues came up with is the main point of this post: they developed a method to simulate realistic images of tissue sections, and then used the simulated data to train a CNN capable of automatically segmenting anatomical regions in images of tissue slices. Here is the outline of how it works:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E37y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E37y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 424w, https://substackcdn.com/image/fetch/$s_!E37y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 848w, https://substackcdn.com/image/fetch/$s_!E37y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 1272w, https://substackcdn.com/image/fetch/$s_!E37y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E37y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic" width="1456" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:167697,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E37y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 424w, https://substackcdn.com/image/fetch/$s_!E37y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 848w, https://substackcdn.com/image/fetch/$s_!E37y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 1272w, https://substackcdn.com/image/fetch/$s_!E37y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47cc49cd-5707-4b7d-8051-7caa4e86437c_2860x840.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 7 from Ali, <em>et al.</em>,  <a href="https://doi.org/10.1101/2024.08.02.605366">https://doi.org/10.1101/2024.08.02.605366</a>, CC-BY-NC-ND 4.0</figcaption></figure></div><p>They being by simulating individual cells with different sizes, shapes, and intensities, determined by the simulated cell type. This microscale simulation is then used to build a regional simulation, in which populations of cells are built up into larger regions based on simulation parameters for cell density and variation in cell types. Regions are then built up into whole tissue sections, such as the simulated coronal section of a mouse brain shown in panel C above. All of this is done with the CAD software Blender, an open-source suite of tools used for game and movie animations. </p><p>Once the 3D tissue simulations are put together, the MGI group then simulates the process of microscopic imaging, including adding debris, tears, optical distortions, etc.:</p><blockquote><p>To do this, we use optics and the rendering engine within Blender. Specifically, we employ a virtual camera with a 15.5 mm focal length and a low f-stop (0.01) to focus on the tissue section from about 1.4 mm away. Then the rendering engine &#8216;Cycles&#8217; calculates the light being emitted by or bouncing off the samples and uses it to produce an image. After the image layer is produced, we use a compositor layer to add varying amounts of white noise (0-95%), making the image slightly harder to interpret. This process is repeated (per &#8216;frame&#8217;) for the total number of images and produces the simulated &#8216;microscope&#8217; images for our dataset.</p></blockquote><p>The simulated images are then used to train the segmentation CNN, which they call Si-Do-La (<strong>Si</strong>mulate <strong>Do</strong>n&#8217;t <strong>La</strong>bel). Using a few desktop computers, they generated ~4000 simulated images per hour.</p><p>How well does Si-Do-La predict the boundaries of cells and anatomical regions? For the detailed performance assessment, check out the paper, but the authors show that Si-Do-La performs well at different scales on samples of mouse brain and spinal cord, and pig sciatic nerve. Here is an example from the paper:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zyEn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zyEn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 424w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 848w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 1272w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zyEn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic" width="1456" height="746" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:746,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:151264,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zyEn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 424w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 848w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 1272w, https://substackcdn.com/image/fetch/$s_!zyEn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083ddcb5-516f-44f4-9557-95c98a29159e_1488x762.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From Fig. 3A of Ali<em>, et al.</em>, <a href="https://doi.org/10.1101/2024.08.02.605366">https://doi.org/10.1101/2024.08.02.605366</a>. CC-BY-NC-ND 4.0</figcaption></figure></div><p>I like this paper because it is a clever example of a realistic simulation that is independent of the machine learning model that will be trained on the output. Unlike recursive training, this approach avoids the risk that the model will fall ever deeper into its own mistaken projection of reality. Realistic simulations probably can&#8217;t be done to address every type of data shortage, but it will be worthwhile to put effort into developing good simulations where they are feasible. This might include genome sequence, since the human genome itself is <a href="https://pubmed.ncbi.nlm.nih.gov/38093018/">too small and lacks the sequence diversity </a>we&#8217;ll need to achieve many of our modeling goals. (Though not all&#8230; see AlphaFold.) Simulating data is only one tool in <a href="https://journalofbigdata.springeropen.com/articles/10.1186/s40537-023-00727-2">the toolbox</a> when it comes to dealing with limited training data, but there is surely more potential in simulations that could be realized with clever methods like the one described by my MGI colleagues.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Darwin and The Cultural Rise of Science]]></title><description><![CDATA[An appreciation of The Voyage of the Beagle to celebrate Darwin Day.]]></description><link>https://www.thisgenomiclife.org/p/darwin-and-the-cultural-rise-of-science</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/darwin-and-the-cultural-rise-of-science</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Wed, 12 Feb 2025 14:03:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!MiR-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MiR-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MiR-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 424w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 848w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 1272w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MiR-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic" width="522" height="558.2887700534759" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1000,&quot;width&quot;:935,&quot;resizeWidth&quot;:522,&quot;bytes&quot;:181178,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MiR-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 424w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 848w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 1272w, https://substackcdn.com/image/fetch/$s_!MiR-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffec4e640-8c0c-4d0e-98d5-e912886eff6c_935x1000.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Sitting on a bookshelf in my living room are the fifty volumes of my great-grandfather's Harvard Classics. This series, first published in 1909-1910, was selected and edited by Harvard President Charles Eliot, not as an attempt to pick the &#8220;best books of all time&#8221;, as he put it, but rather &#8220;to give, in twenty-three thousand pages or thereabouts, a picture of the progress of the human race within historical times, so far as that progress can be depicted in books.&#8221; At the time the Harvard Classics came out, my great-grandfather was a young Latvian political refugee fleeing the violence of the 1905 Russian Revolution. He arrived in America in 1906 at the age of 19, and eventually became an accomplished bacteriologist at Merck Sharp &amp; Dohme.</p><p>I never met him, but I suspect that my great-grandfather would have subscribed to Eliot's notion of human progress, progress that is the result of, as Eliot put it, &#8220;observing, recording, inventing, and imagining." The name &#8220;Harvard Classics&#8221; may sound pretentious today, but Dr. Eliot&#8217;s "five foot shelf of books" was mass-marketed to an American middle class that was presumed to share Eliot&#8217;s optimism about the trajectory of our civilization. The Harvard Classices were billed as a means by which, with just fifteen minutes of reading a day, anyone could gain access to the &#8220;prodigious store of recorded discoveries, experiences, and reflections which humanity in its intermittent and irregular progress from barbarism to civilization has acquired and laid up.&#8221; Since he owned these books, I presume my great-grandfather agreed with this general idea.</p><p>To describe what he was looking for in his selection of books, Eliot invoked the methods of science: &#8220;observing, recording, inventing, and imagining.&#8221; This is no accident &#8212; the Harvard Classics were released at a time when the modern scientific view of the world was arriving at a dominant position in our culture, carrying at least as much cultural heft as the major religions. In a cultural conquest as significant as the rise of major world religions in earlier centuries, science&#8217;s description of the world has become a near-universal baseline with which every other belief system, religious or not, is forced to reckon. Most items in the vast and surprising inventory of the scientific Big Picture&#8212;electrons, atoms, chromosomes, cells, galaxies, black holes, microwaves, mantle plumes, coronaviruses&#8212;are accepted as real by almost everyone, even though much of this inventory was unknown when my great-grandfather was born in 1887. (One striking example: scientists didn&#8217;t establish that there was more than one galaxy until the 1920&#8217;s.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kJfQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kJfQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 424w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 848w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 1272w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kJfQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic" width="270" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:480,&quot;width&quot;:270,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:31334,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!kJfQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 424w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 848w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 1272w, https://substackcdn.com/image/fetch/$s_!kJfQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e1f200a-e348-448e-a8e0-7510dcb0df78_270x480.heic 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">My great-grandfather in his lab.</figcaption></figure></div><p>There is one volume in the Harvard Classics that most prophetically anticipates the coming cultural influence of scientific thought: Darwin's <em>The Voyage of the Beagle</em>. Too often, the <em>Voyage</em> is left out of Great Books lists in favor of Darwin&#8217;s more famous <em>On the Origin of Species</em>. The essayist Adam Kirsch, in an <a href="http://harvardmagazine.com/2001/11/eliots-elect-the-harvard.html">insightful piece</a> on the Harvard Classics, chided Eliot for the excessive enthusiasm that led him to devote two of the fifty volumes to Darwin. Kirsch suggests that a modern edition of the Classics should "doubtless" keep <em>On the Origin of Species</em> and ditch the <em>Voyage</em> for James Watson's <em>Double Helix</em>. Darwin's <em>Origin</em> is often included in collections of great books by default, because it is one of the very few important primary scientific documents that can be read without specialized training. But if I had to pick one book by Darwin to recommend, it would be the <em>Voyage</em>. The <em>Origin</em> is an argument for a specific scientific theory. The <em>Voyage</em> is a much more general foreshadowing of science's cultural force in the modern world, written before science&#8217;s overwhelming presence in modern life was fully established.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/darwin-and-the-cultural-rise-of-science?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/darwin-and-the-cultural-rise-of-science?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h3><strong>Observations</strong></h3><p><em>The Voyage of the Beagle</em> is Darwin's account of the five formative years abroad (1831-1836) with the crew of the HMS Beagle, an experience that transformed him into the scientist who would discover natural selection. The book was first published in 1839 as the most popular part of a multi-volume record of the Beagle's exploits, and later in 1845 as the definitive edition that we read today. The book is a hybrid, a scientific account fused to the ever-popular travel narrative. Darwin wrote both to summarize his scientific findings and to sell books to a reading public eager for accounts of exotic regions that most would never visit. The result is one of literature's great voyages, much like <em>Moby Dick</em> in its mass of scientific detail and similarly concerned with the meaning of nature and our place in it. What makes Darwin's <em>Voyage</em> stand out in this genre is his authentic, uncompromising scientific vision. Herman Melville in <em>Moby Dick</em> provides a stark contrast. While presenting pages and pages on the science and technology of whales and whaling, Melville agonizes over Providence, fate, purpose, and ultimate meaning in the universe. Darwin also has profound things to say about our place in the universe, but he does so without much fuss over the kinds of unanswerable questions that tormented Melville. Darwin patiently observes, records, and imagines, and in doing so sees a vast, complex planet that nevertheless can ultimately be explained.</p><p>Darwin's considerable powers of observation are immediately evident in <em>The Voyage of the Beagle</em>. His aim as a scientist is to infer the story underneath the visible events of nature, which is why he captures the small, but high-information details that allow him to narrow the space of possible explanations. Imagine arriving on the scene of the chaotic aftermath of a city devastated by an earthquake, with misery everywhere. While another writer might have written about human lives being subject to the whims of fate, here is Darwin on the 1835 earthquake that destroyed the Chilean city of Concepci&#243;n:</p><blockquote><p>The town of Concepci&#243;n was built in the usual Spanish fashion, with all the streets running at right angles to each other; one set ranging south-west by west, and the other set north-west by north. The walls in the former direction certainly stood better than those in the latter; the greater number of the masses of brickwork were thrown down towards the north-east. Both these circumstances perfectly agree with the general idea of the undulations having come from the south-west; in which quarter subterranean noises were also heard; for it is evident that the walls running south-west and north-east which presented their ends to the point whence the undulations came, would be much less likely to fall than those walls which, running north-west and south-east, must in their whole lengths have been at the same instant thrown out of the perpendicular...</p><p>The different resistance offered by the walls, according to their direction, was well exemplified in the case of the Cathedral. The side which fronted the north-east presented a grand pile of ruins... The side walls (running south-west and north-east), though exceedingly fractured, yet remained standing; but the vast buttresses (at right angles to them, and therefore parallel to the walls that fell) were in many cases cut clean off, as if by a chisel, and hurled to the ground. Some square ornaments on the coping of these same walls were moved by the earthquake into a diagonal position.</p></blockquote><p>Instead of chaos, Darwin sees geometry in the patterns in the surviving walls, down to the angles of the ornamental bricks. These patterns allow him to infer the behavior of the earthquake. It is this kind of observation and inference that eventually leads to the capacity to build more earthquake-resistant buildings.</p><p>Here is Darwin observing a wasp hunting a spider:</p><blockquote><p>The wasp made a sudden dash at its prey, and then flew away: the spider was evidently wounded, for, trying to escape, it rolled down a little slope, but had still strength sufficient to crawl into a thick tuft of grass. The wasp soon returned, and seemed surprised at not immediately finding its victim. It then commenced as regular a hunt as ever hound did after fox; making short semicircular casts, and all the time rapidly vibrating its wings and antennae. The spider, though well concealed, was soon discovered, and the wasp, evidently still afraid of its adversary's jaws, after much manoeuvring, inflicted two stings on the under side of its thorax. At last, carefully examining with its antennae the now motionless spider, it proceeded to drag away the body.</p></blockquote><p>Darwin notes where and how often the spider has been stung, and discovers the rationale behind the wasp's motion that others might have taken to be a random walk. Driving Darwin's method is a belief that the world has an underlying rationale, one that can be inferred if we look closely enough. The behavior of wasps and spiders is not unfathomable; it is a careful survival strategy. Earthquakes don't randomly knock down walls; they travel in waves and exert forces on buildings at particular angles. Darwin's observations are precise because the world is precise, locked into tight constraints of cause and effect, and therefore we use the tools of reason and observation to understand why the world behaves the way it does.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vfpX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vfpX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 424w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 848w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 1272w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vfpX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic" width="524" height="430.06868131868134" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1195,&quot;width&quot;:1456,&quot;resizeWidth&quot;:524,&quot;bytes&quot;:285298,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vfpX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 424w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 848w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 1272w, https://substackcdn.com/image/fetch/$s_!vfpX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041f2872-ca17-4847-9582-772b9c3f50a6_1816x1490.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Tony Wills, <a href="https://commons.wikimedia.org/wiki/File:Wasp_and_spider_02.jpg">Wikimedia Commons</a>, CC BY 2.5 </figcaption></figure></div><p>Darwin showed a disciplined commitment to avoiding unjustified shortcuts in the pursuit of answers. Curiosity may be a common human trait, but rigorously and reliably satisfying that curiosity is not. Darwin often had difficulty convincing people that he was genuinely driven to understand the workings of nature, and that he wasn't secretly prospecting for valuable ores:</p><blockquote><p>I found the most ready way of explaining my employment was to ask them how it was that they themselves were not curious concerning earthquakes and volcanos?--why some springs were hot and others cold?--why there were mountains in Chile, and not a hill in La Plata? These bare questions at once satisfied and silenced the greater number; some, however (like a few in England who are a century behindhand), thought that all such inquiries were useless and impious; and that it was quite sufficient that God had thus made the mountains.</p></blockquote><p>Science depends upon a restless, intellectual dissatisfaction with inadequate, speculative answers. It was this dissatisfaction that drove Darwin to brave thousands of miles while enduring discomforts, seasickness, earthquakes, severe thirst, cold nights on beds of rocks, vile food, and even violent political upheaval, with an energy that is evident on every page of the <em>Voyage</em>.</p><h3><strong>People</strong></h3><p>It would be wrong to assume that Darwin's scientific instincts make him a passionless observer of the world. In fact, much of the power of the <em>Voyage</em> comes from Darwin's capacity to find emotional resonance in the targets of his scrutiny. His knack for seeing precisely works with his sense of humanity to make Darwin particularly skilled at rendering people, and describing their poignant and ironic moments as they respond to their natural and social environments.</p><p>This is perhaps most vividly seen in Darwin's account of his encounters with the natives at the end of the world in Tierra del Fuego. The crew of the Beagle, on a previous voyage, had basically kidnapped several Fuegians, and the Beagle's captain was bringing them back home. One of these natives, 'Jemmy Button', was purchased as a boy for the price of a pearl button. Now a young man, he is about to return to his stone age culture, after having spent three years within one of the world's most technologically sophisticated civilizations. He hardly remembers his native language and speaks broken English, but, like any teenager, is very impressed with the fact that he can make himself look sophisticated:</p><blockquote><p>Jemmy was short, thick, and fat, but vain of his personal appearance; he used always to wear gloves, his hair was neatly cut, and he was distressed if his well-polished shoes were dirtied. He was fond of admiring himself in a looking glass; and a merry-faced little Indian boy from the Rio Negro, whom we had for some months on board, soon perceived this, and used to mock him: Jemmy, who was always rather jealous of the attention paid to this little boy, did not at all like this, and used to say, with rather a contemptuous twist of his head, &#8220;Too much skylar<em>k.&#8221;</em></p></blockquote><p>Jemmy is struggling with acute and ambiguous feelings about his background: he is defensive about his people, yet he wants to prove that despite his family roots, he can be at least as civilized as any of his British shipmates. And now he's about to be unceremoniously dropped back among his kinfolk. In the mid-nineteenth century, too many thought nothing of such callous treatment; for them, there was a natural order to things, and Jemmy was best off in his proper place. But Darwin manages, almost inadvertently, to poignantly render the ambiguity of Jemmy's reunion with his family:</p><blockquote><p>We had already perceived that Jemmy had almost forgotten his own language. I should think there was scarcely another human being with so small a stock of language, for his English was very imperfect. It was laughable, but almost pitiable, to hear him speak to his wild brother in English, and then ask him in Spanish ("no sabe?") whether he did not understand him.</p></blockquote><p>Some time later, when the Beagle briefly returns to the region before leaving for good, Jemmy shows up again in a canoe, unclothed, scrawny, and long-haired. His first action is to turn his back on his former shipmates in embarrassment.</p><p>Darwin's account of the Fuegians is unforgettable because Darwin was more unsettled by his encounters with these people than with any others during his five year voyage. He is honest about his rattled feelings and about the difficulty he has making sense of what appear to him to be lives of unnecessary misery:</p><blockquote><p>A woman, who was suckling a recently-born child, came one day alongside the vessel, and remained there out of mere curiosity, whilst the sleet fell and thawed on her naked bosom, and on the skin of her naked baby! These poor wretches were stunted in their growth, their hideous faces bedaubed with white paint, their skins filthy and greasy, their hair entangled, their voices discordant, and their gestures violent. Viewing such men, one can hardly make oneself believe that they are fellow-creatures, and inhabitants of the same world. It is a common subject of conjecture what pleasure in life some of the lower animals can enjoy: how much more reasonably the same question may be asked with respect to these barbarians! At night five or six human beings, naked and scarcely protected from the wind and rain of this tempestuous climate, sleep on the wet ground coiled up like animals. Whenever it is low water, winter or summer, night or day, they must rise to pick shellfish from the rocks; and the women either dive to collect sea-eggs, or sit patiently in their canoes, and with a baited hair-line without any hook, jerk out little fish. If a seal is killed, or the floating carcass of a putrid whale is discovered, it is a feast; and such miserable food is assisted by a few tasteless berries and fun<em>gi.</em></p></blockquote><p>Putting ourselves in Darwin&#8217;s place, if we were confronted with such people who seemed thoroughly alien, but are nonetheless as human as anyone else in their intelligence, their desires, and their needs, how would we react? In this case, Darwin's method of observing and then inferring the story again serves him well, because he does not try to fit the Fuegians into some morality tale. Instead, recognizing them as fully human, he attempts to understand how it could be that they exist in such an apparently miserable condition.</p><blockquote><p>[O]ne asks, Whence have they come? What could have tempted, or what change compelled, a tribe of men, to leave the fine regions of the north, to travel down the Cordillera or backbone of America, to invent and build canoes, which are not used by the tribes of Chile, Peru, and Brazil, and then to enter on one of the most inhospitable countries within the limits of the globe? Although such reflections must at first seize on the mind, yet we may feel sure that they are partly erroneous. There is no reason to believe that the Fuegians decrease in number; therefore we must suppose that they enjoy a sufficient share of happiness, of whatever kind it may be, to render life worth having. Nature by making habit omnipotent, and its effects hereditary, has fitted the Fuegian to the climate and the productions of his miserable country.</p></blockquote><h3><strong>Language</strong></h3><p>Darwin is a scientific stylist, writing a travel narrative that shows how the practice of science is not an impersonal, algorithmic process, but is instead a human act of imagination. The <em>Voyage</em> shows, at the fine level of sentences and paragraphs, a scientific imagination churning away, observing, sifting, and finally inferring the story of nature's driving forces. As Darwin puts it, &#8220;The limit of man's knowledge in any subject possesses a high interest, which is perhaps increased by its close neighbourhood to the realms of imagination.&#8221;</p><p>In this, Darwin compares favorably with another first-rate travel writer, Herman Melville. Both Darwin and Melville cut their teeth producing best-selling accounts of their voyages to the Southern hemisphere. (In Melville's case the accounts were lightly fictionalized.) Both knew what would sell among the travel-narrative-reading public, but the differences in their styles highlight the power that Darwin's scientific instincts bring to his language.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IEm7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IEm7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 424w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 848w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 1272w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IEm7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic" width="1456" height="961" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:961,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1287425,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IEm7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 424w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 848w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 1272w, https://substackcdn.com/image/fetch/$s_!IEm7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dbcfd05-808d-4550-9360-3f06549b3f5b_2770x1828.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">San Crist&#243;bal, Galapagos, by Diego Delson via <a href="https://commons.wikimedia.org/wiki/File:Paisaje_en_Punta_Pitt,_isla_de_San_Crist&#243;bal,_islas_Gal&#225;pagos,_Ecuador,_2015-07-24,_DD_53.JPG">Wikimedia Commons</a>, CC BY-SA </figcaption></figure></div><p>Melville's imagery is impressionistic, as in this description of the Galapagos from <em>The Encantadas</em>:</p><blockquote><p>In many places the coast is rock-bound, or, more properly, clinker-bound; tumbled masses of blackish or greenish stuff like the dross of an iron-furnace, forming dark clefts and caves here and there, into which a ceaseless sea pours a fury of foam; overhanging them with a swirl of gray, haggard mist, amidst which sail screaming flights of unearthly birds heightening the dismal din. However calm the sea without, there is no rest for these swells and those rocks; they lash and are lashed, even when the outer ocean is most at peace with, itself. On the oppressive, clouded days, such as are peculiar to this part of the watery Equator, the dark, vitrified masses, many of which raise themselves among white whirlpools and breakers in detached and perilous places off the shore, present a most Plutonian sight. In no world but a fallen one could such lands exist.</p></blockquote><p>Darwin, while also drawing a comparison with iron-foundries, doesn&#8217;t make vague observations about "blackish and greenish stuff." His imagery depends on observational precision and tight organization of his thoughts. His words, while not always technical, are usually specific, and descriptions are often given in geometrical terms of symmetries, regularities, lines, and circles. Here is Darwin on Chatham Island:</p><blockquote><p>The entire surface of this part of the island seems to have been permeated, like a sieve, by the subterranean vapours: here and there the lava, whilst soft, has been blown into great bubbles; and in other parts, the tops of caverns similarly formed have fallen in, leaving circular pits with steep sides. From the regular form of the many craters, they gave to the country an artificial appearance, which vividly reminded me of those parts of Staffordshire where the great iron-foundries are most numerous. The day was glowing hot, and the scrambling over the rough surface and through the intricate thickets was very fatiguing; but I was well repaid by the strange Cyclopean scene. As I was walking along I met two large tortoises, each of which must have weighed at least two hundred pounds: one was eating a piece of cactus, and as I approached, it stared at me and slowly walked away; the other gave a deep hiss, and drew in its head. These huge reptiles, surrounded by the black lava, the leafless shrubs, and large cacti, seemed to my fancy like some antediluvian animals.</p></blockquote><p>Another strategy that Darwin uses to powerful effect is the shocking contrast between violence and beauty so commonly found in the world. In this case, the murder of a sea captain intrudes on a serene moment in the Galapagos:</p><blockquote><p>The water is only three or four inches deep and rests on a layer of beautifully crystallised, white salt. The lake is quite circular, and is fringed with a border of bright green succulent plants; the almost precipitous walls of the crater are clothed with wood, so that the scene was altogether both picturesque and curious. A few years since the sailors belonging to a sealing-vessel murdered their captain in this quiet spot; and we saw his skull lying among the bushes.</p></blockquote><p>Note that the lake is not fringed with generic green plants, but green <em>succulent</em> plants. The walls of the crater are "almost precipitous"; here precipitous means literally perpendicular, forming a 90 degree angle with the surface. And contrasting with this very specific, orderly scene is a messy reminder of human conflict.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h3><strong>God</strong></h3><p>In the early 17th century, Francis Bacon warned that &#8220;from [an] unwholsome mixture of things human and divine there arises not only a fantastic[al] philosophy but also a heretical religion.&#8221; It&#8217;s a warning that scientists have taken seriously ever since. Darwin, in all of his arguments, inferences, hypotheses, and narratives of natural history, quietly refuses to ever invoke God as an explanation. The geographical distribution of animals, the causes of extinctions, the composition of mountain ranges, the layout of the plains of the South American Pampas, are all explained exclusively in terms of natural processes. These processes operate over vast scales of space and time, and are thus often not directly observable. They are inferred from observable evidence: raised beds of fossilized sea shells, thousands of miles from any ocean; folded layers of various types of rock exposed in the great mountains of the Cordillera; the resemblance of the skeletons of long-extinct, gigantic quadrupeds to those of living species. It is by inferences made from observations like these that Darwin and many others developed the grand explanations of the origin of the world around us, explanations that have now thoroughly infiltrated and in many cases completely supplanted those provided by other systems of belief.</p><p>Here is Darwin inferring a thoroughly materialistic creation story of the great mountain ranges of the South American Cordillera:</p><blockquote><p>As the beds of the conglomerate have been thrown off at an angle of 45 degrees by the red Portillo granite (with the underlying sandstone baked by it), we may feel sure that the greater part of the injection and upheaval of the already partially formed Portillo line took place after the accumulation of the conglomerate, and long after the elevation of the Peuquenes ridge. So that the Portillo, the loftiest line in this part of the Cordillera, is not so old as the less lofty line of the Peuquenes. Evidence derived from an inclined stream of lava at the eastern base of the Portillo might be adduced to show that it owes part of its great height to elevations of a still later date. Looking to its earliest origin, the red granite seems to have been injected on an ancient pre-existing line of white granite and mica-slate.</p></blockquote><p>The <em>Voyage</em> makes it clear that scientists' refusal to invoke divine agency as a causal explanation does not arise from an innate or personal opposition to religion. At issue is what suffices as an explanation. Darwin refuses to settle for vagueness, and rightly considers statements like 'it was born in nature', or 'God made it' to be no better (or perhaps worse) than saying 'I don't know.' This refusal to accept the unfathomable as an explanation is the first step in the scientific process, and it clears the ground for observation, hypothesis, inference, and experiment to operate. It is based on a belief that the world, unlike God, is not inscrutable, and that our minds can comprehend the detailed links of cause and effect that happen behind the scenes of the everyday, observable world. What you read in the <em>Voyage</em> is an almost fanatical demand for precision and rigor, a curmudgeonly insistence that we don't fill in our stories with what scientists call 'hand waving.' Again and again Darwin rejects such hand-waving, content-free explanations.</p><p>Of petrified trees:</p><blockquote><p>How surprising it is that every atom of the woody matter in this great cylinder should have been removed and replaced by silex so perfectly that each vessel and pore is preserved! These trees flourished at about the period of our lower chalk; they all belonged to the fir-tribe. It was amusing to hear the inhabitants discussing the nature of the fossil shells which I collected, almost in the same terms as were used a century ago in Europe,--namely, whether or not they had been thus "born by nature."</p></blockquote><p>Boiling water:</p><blockquote><p>At the place where we slept water necessarily boiled, from the diminished pressure of the atmosphere, at a lower temperature than it does in a less lofty country; the case being the converse of that of a Papin's digester. Hence the potatoes, after remaining for some hours in the boiling water, were nearly as hard as ever. The pot was left on the fire all night, and next morning it was boiled again, but yet the potatoes were not cooked. I found out this by overhearing my two companions discussing the cause, they had come to the simple conclusion "that the cursed pot (which was a new one) did not choose to boil potatoes."</p></blockquote><p>Today, scientific explanations of the world are overwhelmingly what we teach and what we research, and it is these exaplanations on which we base our technologies, our medical practice, and much of our social and and economic organization. This was not yet true when Darwin was writing his travel narrative, but the eventual cultural success of science was probably inevitable. Galileo, Newton, and their 17th century colleagues invented a method, rooted in extremely precise observation and careful inference, to reliably learn about the world. Three centuries of steady growth in reliable knowledge brought about the most astonishing material transformation of human life in the history of our species. By the time of the <em>Voyage</em>, science was making serious inroads into matter and earth's history, and it was about crack open biology in a spectacular way. </p><p>My great-grandfather, during his career as a bacteriologist, witnessed the first great taming of infectious disease by the development of antibiotics and new vaccines. Within the past few years, scientists have made the first direct observation of a black hole, invented new therapies to treat disease by editing a patient&#8217;s DNA, and built AI that can write code and design proteins. One hundred eighty years after the publication of the <em>Voyage</em>, science is a relentless current in our society whose power can be compared with the streams that Darwin observed in the Chilean Cordillera:</p><blockquote><p>As often as I have seen beds of mud, sand, and shingle, accumulated to the thickness of many thousand feet, I have felt inclined to exclaim that causes, such as the present rivers and the present beaches, could never have ground down and produced such masses. But, on the other hand, when listening to the rattling noise of these torrents, and calling to mind that whole races of animals have passed away from the face of the earth, and that during this whole period, night and day, these stones have gone rattling onwards in their course, I have thought to myself, can any mountains, any continent, withstand such waste?</p></blockquote><p>Pick up a free, electronic copy of <em>The Voyage of the Beagle</em> over at Project Gutenberg. (<a href="http://www.gutenberg.org/ebooks/3704">http://www.gutenberg.org/ebooks/3704</a>)</p><p><em>This essay is adapted from a much earlier piece published at my old blog, The Finch &amp; Pea.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[This Week’s Finds: Replication, Enhancer Testing, More AI]]></title><description><![CDATA[Five Friday links to items that caught my eye in genomics and beyond.]]></description><link>https://www.thisgenomiclife.org/p/this-weeks-finds-replication-enhancer</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/this-weeks-finds-replication-enhancer</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 17 Jan 2025 20:30:58 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!QtaJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QtaJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QtaJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 424w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 848w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 1272w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QtaJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic" width="516" height="453.1895223420647" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1140,&quot;width&quot;:1298,&quot;resizeWidth&quot;:516,&quot;bytes&quot;:93089,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QtaJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 424w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 848w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 1272w, https://substackcdn.com/image/fetch/$s_!QtaJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fada34065-54a1-4c63-9bbc-81e3d6ca42af_1298x1140.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I&#8217;m skeptical of exact replication efforts but sometimes it works! A comparison of the same experiment performed in two different labs.</figcaption></figure></div><h2>1. Replication initiatives conflate replication with reproducibility</h2><p>I&#8217;m a little late to this, but last December <em>Science</em> <a href="https://www.science.org/content/article/nih-launches-initiative-double-check-biomedical-studies">reported</a> on a new <a href="https://nexus.od.nih.gov/all/2024/10/29/a-pilot-approach-to-replicating-research-the-nih-common-funds-replication-to-enhance-research-impact-initiative-provides-funding-to-support-research-replication-activities/">NIH initiative</a> to fund contract research organizations (CROs) to replicate studies. The NIH invited researchers to apply for $50,000 to be used to help a CRO re-run their experiments. Apparently there were few takers. I think anyone who has spend time trying to get a difficult assay to work in the lab might understand the reluctance to participate. The <em>Science </em>article quotes one researcher who sums up the problem well:</p><blockquote><p>Sean Morrison of the University of Texas Southwestern Medical Center, who edited some of the papers resulting from that project, notes the contract labs didn&#8217;t have the resources to do pilot experiments or repeat studies to work out kinks. And such labs often lack &#8220;the expertise of academic laboratories, especially when it comes to advanced or specialized techniques,&#8221; he says. In some cases, &#8220;this led to uninterpretable results.&#8221; The murky outcome of some replication attempts may have left scientists behind the original studies feeling their reputation was unfairly tarnished, he adds.</p></blockquote><p>While there are some specific types of studies that <em>should </em>be replicated (replication cohorts for GWAS are good), I and others have been <a href="https://thefinchandpea.com/2014/12/29/why-reproducibility-initiatives-are-misguided/">arguing</a> for a <a href="https://psmag.com/social-justice/trustworthy-published-science-97180/">long</a> <a href="https://thefinchandpea.com/2015/07/01/the-cancer-reproducibility-project-is-incredibly-naive-probably-useless-and-potentially-damaging-2/">time</a> that it is foolish to conflate <em>replication</em> with <em>reproducibility </em>in science. Chris Drummond of the Canadian National Research Council summed it up well in a <a href="https://web-archive.southampton.ac.uk/cogprints.org/7691/7/ICMLws09.pdf">piece </a>written way back in 2009 (PDF):</p><blockquote><p>A critical point of reproducing an experimental result is that irrelevant things are intentionally not replicated. One might say, one should replicate the result not the experiment&#8230;The sharing of all the artifacts from people&#8217;s experiments is not a trivial activity.</p></blockquote><p><em>Cell Reports</em> in 2014 <a href="https://www.sciencedirect.com/science/article/pii/S2211124714001211">published an example</a> of just how hard it is to exactly replicate someone&#8217;s experiments:</p><blockquote><p>Our two laboratories, one on the East and the other on the West Coast of the United States, decided to collaborate on a problem of mutual interest&#8212;namely, the heterogeneity of the human breast. Despite using seemingly identical methods, reagents, and specimens, our two laboratories quite reproducibly were unable to replicate each other&#8217;s fluorescence-activated cell sorting (FACS) profiles of primary breast cells. Frustration mounted, given that we had not found the correct answer(s), even after a year. Rather than giving up or each publishing our data without the other laboratory, we decided to work together to solve these differences, even traveling from one laboratory to the other in order to perform experiments side by side on the same human breast tissue sample. This exercise confirmed our suspicions and resolved our problem. Here, we summarize our cautionary tale and provide advice to our colleagues.</p></blockquote><p>Reproducibility, I contend, is usually achieved when multiple labs that use <em>different</em> methods show that a phenomenon is robust. Reproducibility can be poor when journals publish papers with poor controls, p-hacking, and other bad statistical practices. The solution to that is <em>not</em> replication initiatives but rather holding papers to higher standards for publication.</p><p>To reiterate, there are specialized cases in biomedical science where something closer to exact replication is indeed important, but those cases generally involve studies with methods that can and should be standardized: GWAS, clinical trials, pre-clinical testing of therapeutics, etc. But exactly replicating a basic science study like this <a href="https://pubmed.ncbi.nlm.nih.gov/20371346/">2010 </a><em><a href="https://pubmed.ncbi.nlm.nih.gov/20371346/">Cell</a></em><a href="https://pubmed.ncbi.nlm.nih.gov/20371346/"> paper</a> (one included in the Cancer Reproducibility Project) is a waste of time and resources.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>2. Yet another DNA language model</h2><p>Yun Song and colleagues at UC Berkeley have a <a href="https://pubmed.ncbi.nlm.nih.gov/39747647/">paper</a> in <em>Nature Biotechnology</em> that describes a DNA language model trained on the multiz multi-species alignment of 100 vertebrates. What makes their model stand out, the authors claim, is the computational efficiency of the approach compared the big foundation models out there. (I covered one of them <a href="https://thisgenomiclife.substack.com/p/this-weeks-finds-in-genomics-and">here</a>.) Their method is efficient because they didn&#8217;t actually train their model on 100 vertebrate genomes, but rather only on the top 5% conserved regions, plus some randomly selected regions. </p><p>They tested their model by asking it to classify pathogenic variants from various databases (ClinVar, COSMIC, OMIM, etc.). Their method sometimes does about as well as, and occasionally better than some other standard variant predictors, like CADD and ESM-1b. None of the methods do particularly well on deep mutational scanning data. They scored all 9 billion possible SNVs in the human genome, and have thus created another database of predicted variant effects.</p><p>There is nothing special about the performance here, but as this field develops, it is useful to watch the performance of two different approaches to language models: multi-species models, like this one, and functional genomics models, like Enformer. It&#8217;s not clear which approach is more effective.</p><h2>3. Accelerating <em>in vivo</em> enhancer assays</h2><p>Evgeny Kvon, of the <a href="https://pubmed.ncbi.nlm.nih.gov/27768887/">snakefied ZRS enhancer</a> fame, has a cool <a href="https://www.nature.com/articles/s41467-024-55500-7">new paper</a> out describing an efficient, two-color assay to test enhancer variants <em>in vivo</em>: </p><blockquote><p>Here we introduce dual-enSERT, a robust Cas9-based two-color fluorescent reporter system which enables rapid, quantitative comparison of enhancer allele activities in live mice in less than two weeks.</p></blockquote><p>They make one-color reporter mice, each carrying either the wild-type or alternate allele of an enhancer. The mice are crossed to generate two-color offspring, which allows you to measure both versions of the enhancer in the same mouse. This promises to make <em>in vivo</em> enhancer assays much more efficient and <em>reproducible</em>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/this-weeks-finds-replication-enhancer?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/this-weeks-finds-replication-enhancer?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2>4. Training data for deep learning beyond the genome</h2><p>I don&#8217;t intend to use too much of this space to discuss my own work, but we have a <a href="https://www.sciencedirect.com/science/article/pii/S2405471224003685?via%3Dihub">paper</a> out in <em>Cell Systems </em>this week that, imho, works on an angle in genomics and machine learning that I think is a under-appreciated. The paper is the thesis work of Ryan Friedman, a fantastic former graduate student who is now a postdoc in Cole Trapnell&#8217;s lab. Ryan used a technique called active machine learning to iteratively train models of regulatory DNA on successive rounds of functional assays performed in the mouse retina:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XCdt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XCdt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 424w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 848w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 1272w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XCdt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic" width="426" height="378.4846153846154" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:924,&quot;width&quot;:1040,&quot;resizeWidth&quot;:426,&quot;bytes&quot;:41761,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XCdt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 424w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 848w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 1272w, https://substackcdn.com/image/fetch/$s_!XCdt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0576fd3-bbdf-4643-ae64-13102ff38b58_1040x924.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fig 1A from Friedman, <em>et al.</em> <em>Cell Systems</em> 2025 16(1):101163</figcaption></figure></div><p>An important point of this approach is that, with functional assays and affordable DNA synthesis, we aren&#8217;t limited to training data from the genome. In fact, we can design our experiments to produce training data that is better optimized for deep learning. That the genome may not offer the best training data is something that is increasingly recognized in the field &#8212; see this <a href="https://pubmed.ncbi.nlm.nih.gov/38093018/">commentary</a> (&#8220;Hold out the genome&#8221;) by De Boer and Taipale (bioRxiv <a href="https://www.biorxiv.org/content/10.1101/2023.04.20.537701v1">here</a>), and this very <a href="https://pubmed.ncbi.nlm.nih.gov/39362779/">good review</a> by La Fleur, Shi, and Seelig. There isn&#8217;t enough sequence diversity in the genome to learn some of the things we want to learn, and a big chunk of the genome is inactive and probably not especially informative. Obviously not all (or even most) genomic assays use synthesized DNA, but we have a number of powerful functional assays that do, and we should take advantage of the opportunity to design experiments to better achieve our deep learning goals.</p><h2>5. Is it harder to make scientific progress?</h2><p>Now and again people write <a href="https://www.vox.com/future-perfect/2023/1/11/23549993/science-research-progress-studies-disruption-technology-artificial-intelligence-biotechnology">big think pieces</a> asking whether <a href="https://pubmed.ncbi.nlm.nih.gov/36600070/">scientific progress is slowing</a>. I don&#8217;t think you should generalize. AI and biology seem to be firing on all cylinders right now. But perhaps things are different in physics. Physicist Chad Orzel, who write <em><a href="https://open.substack.com/pub/chadorzel/p/everybody-is-wrong-about-the-structure-56f?r=bwtw3&amp;selection=f33513b4-5ad8-498b-898c-86db11fe3411&amp;utm_campaign=post-share-selection&amp;utm_medium=web">Counting Atoms</a></em> made the point in a way I found interesting. If you consider that huge progress in physics has been made by finding anomalies, like the Michealson-Morley experiment that failed to detect the movement of the earth, or  precession of Mercury that could not be accounted for by classical physics, then <a href="https://open.substack.com/pub/chadorzel/p/everybody-is-wrong-about-the-structure-56f?r=bwtw3&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">this point</a> seems important:</p><blockquote><p>All the anomalies in the sixth decimal place were measured a century ago&#8212; modern physicists are looking for changes in the <em>14th decimal place</em>. You can still come up with new ideas about physics, sure, but they&#8217;re <em>incredibly</em> tightly constrained by prior experiments, and testing those theories is vastly more complicated an expensive than it was in the golden era&#8212; you used to be able to discover new particles by leaving some photographic plates out on top of a mountain, but now you need millions to billions of dollars of sophisticated electronic detectors.</p></blockquote><p>&#8220;All the anomalies in the sixth decimal place were measured a century ago.&#8221; Science is cumulative in an important sense, and the scale at which we need to search to find new phenomena in some fields becomes increasingly harder to achieve, because all of the relevant measurements at the easier scales have been made. Biology is not quite the same, but there are some parallels. Samples sizes of GWAS need to be very large to detect the associations that we&#8217;re going after these days. Bulk tissue measurements of gene expression aren&#8217;t enough; we now need single-cell resolution, and maybe some spatial information too. There are still big questions to answer, but smart people have been at this business for awhile, always pushing at the technological frontier.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p> </p>]]></content:encoded></item><item><title><![CDATA[Human embryo editing - should we do it?]]></title><description><![CDATA[Two recent papers weigh the pros and cons but don't really emphasize the main barrier.]]></description><link>https://www.thisgenomiclife.org/p/human-embryo-editing-should-we-do</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/human-embryo-editing-should-we-do</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Tue, 14 Jan 2025 23:12:37 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!QFXx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QFXx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QFXx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 424w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 848w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 1272w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QFXx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic" width="598" height="445.6553911205074" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:705,&quot;width&quot;:946,&quot;resizeWidth&quot;:598,&quot;bytes&quot;:118682,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QFXx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 424w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 848w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 1272w, https://substackcdn.com/image/fetch/$s_!QFXx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cc23158-4128-46d1-a4d7-2ce5a41082d5_946x705.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A human embryo</figcaption></figure></div><p>Last week <em>Nature</em> published a <a href="https://www.nature.com/articles/s41586-024-08300-4">provocative think-piece</a> by the eminent human geneticist Peter Visscher and his colleagues that works through the potential impact of human embryo editing on disease. They assume for the sake of argument that we can reliably edit individual embryo genomes in tens or hundreds places. Reliable here means that you edit <em>all</em> and <em>only</em> the target positions, so that there is little risk of off-target or incomplete editing. Assuming that capacity, Visscher and colleagues ran a model showing that you could substantially reduce the burden of common diseases among a hypothetical genome-edited population. Disease with polygenic risks, like type 2 diabetes, coronary artery disease, Alzheimer's disease, schizophrenia, and major depressive disorder could be prevented by editing in protective alleles. </p><p>They make the bold claim that &#8220;editing 40 variants could greatly reduce an individual&#8217;s lifetime risk of AD, SCZ, T2D and CAD to less than 0.2%&#8221;. The degree of risk reduction obviously depends on the base rate of disease, but the predicted reduction is impressive. For example, editing ~10 variants to protect against type 2 diabetes could reduce its prevalence among genome-edited people by <em>50 fold</em>! The big point is that <em>if </em>such edits ever become feasible, we need to consider the potentially massive benefit to human health, at least among those whose genomes are edited.</p><h3>What about the risks? </h3><p>Anyone reading this surely is immediately thinking about a bunch of caveats, many of which Visscher and colleagues acknowledge and discuss. The model involves some important assumptions, and you could ask, how would this hold up across different environments? How much does genomic &#8220;background&#8221; matter &#8212; would these numbers hold for different populations across the world? What about pleiotropy &#8212; would editing a dozen variants to reduce your risk for major depressive disorder also increase your risk for some other condition in as yet unknown ways? And won&#8217;t routine genome editing basically bring on a dystopian Gattaca scenario in which those with the means to access editing for their kids dominate the world?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/human-embryo-editing-should-we-do?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/human-embryo-editing-should-we-do?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p>These are all good questions, and Visscher and colleagues discuss them. So do Shai Carmi, Henry Greely, and Kevin Mitchell, in a <a href="https://www.nature.com/articles/d41586-024-04105-7">responding piece</a> in the same journal issue. They are much less sanguine about the risks of genome editing than Vissher, <em>et al. </em>Their most compelling point is that even if you assume that embryo editing technology gets much better than it now is, the method would still impose unknown risks on the non-consenting future child. Mistakes and incomplete knowledge about the impact of particular alleles on development could have catastrophic effects on some children who had no say in the decision to submit their genomes to editing. We need to keep in mind that while idealizations may work for thought experiments, in practice no technology is ever mistake-free. A counterpoint would be that back in the 1960&#8217;s and 70&#8217;s, <a href="https://www.theatlantic.com/technology/archive/2012/06/the-ivf-panic-all-hell-will-break-loose-politically-and-morally-all-over-the-world/258954/">similar concerns</a> were raised about in vitro fertilization (IVF). But IVF turned out to be an enormous benefit to millions of families, including mine, and it didn&#8217;t have a detrimental impact on the health of children born by the procedure. I recognize that the parallel to embryo editing is very imperfect, but we should keep that example in mind when we talk about unknown risks.</p><p>I have followed the individual work of Carmi, Greely, and Mitchell, and I have tremendous admiration for all three of them. Greely, for example, is the author of <em><a href="https://mitpress.mit.edu/9780262543880/crispr-people/">CRISPR People: The Science and Ethics of Editing Humans</a></em>, and has clearly thought very carefully about the issues surrounding embryo editing. But I found this <em>Nature</em> perspective a bit frustrating, even <s>a little hysterical</s> too combative in its tone. They repeatedly accuse Vissher, <em>et al</em> of ignoring issues that are in fact discussed in their piece, though clearly not to the satisfaction of Carmi, Greely, and Mitchell. (Caveat: I don&#8217;t know what the editing process was, and perhaps an earlier draft of the Vissher piece, shared with those who wrote the response,  didn&#8217;t address these topics.) Carmi, Greely, and Mitchell ask whether it&#8217;s responsible to even have this discussion, concluding their piece with the following sentence: &#8220;Is it wise to distract stakeholders, including the public, with a technology that is still a long way off at best, and might never actually be safe?&#8221;</p><h3>The technology will get there but will we need it?</h3><p>While I&#8217;m glad these three wrote an important response, I&#8217;m not so sure it&#8217;s unwise to discuss the possibilities of polygenic embryo editing. We should be wary of underestimating the technological advances in genome editing that will occur over the next generation. Vissher and his colleagues suggest a time frame of 30 years. 30 years ago, the technology to complete the human reference genome was still under development, and none of today&#8217;s major sequencing and gene editing technologies existed. </p><p>There are critical applications of gene editing technology other than modifying embryos, especially in basic research and gene therapy for people suffering from catastrophic disease. These applications, not embryo editing, are motivating hundreds of innovative projects to develop more reliable gene editing. This field is moving very fast, and an enormous number of potential technologies are quickly being explored. I think reliable, polygenic gene editing will get here sooner than we might guess. In parallel, the functional genomics field is currently putting in a big effort to expand our catalog of actionable genetic variants, and in 30 years I have no doubt at all that we will have enough causal variants on hand to implement the strategy proposed by Vissher, <em>et al</em>. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>So I do think that, with the technologies moving as quickly as they are, it is important to start thinking about this issue now. But for me the biggest reasons to not go the route of polygenic embryo editing are ones that were hardly touched on in these two pieces:</p><p>First, while Vissher and colleagues showed that you can greatly reduce the incidence of disease among genome-edited people, there are easier ways to reduce many of these diseases now or in the near-term future. Drugs, in general, are the cheapest and easiest therapeutics. I recognize that&#8217;s not true for everything, but I propose that in 30 years, we will likely see <em>substantial</em> progress in drug treatments for all of the diseases discussed, including Alzheimer&#8217;s. Alongside that, gene therapy for already-born individuals is also likely to achieve major progress. The <a href="https://psmag.com/social-justice/gene-therapy-no-longer-pipe-dream/">first FDA-approved gene therapy</a> came out in 2018, and many, many more are in the pipeline. Not only will technology to treat disease advance, but we&#8217;ll also likely see major progress in preventative treatments. As some of my Washington University colleagues <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC3343638/">wrote back in 2012</a>, &#8220;more than half of the cancer occurring today is preventable by applying knowledge that we already have.&#8221; By 2054, editing embryos to reduce disease may seem expensive and difficult compared to other the treatments and environmental interventions available.</p><p>And finally, the second barrier to a Gattaca society is the fact that if you are going to edit your embryo, you need to do IVF. Maybe someday we&#8217;ll edit embryos <em>in utero</em>, but that is a much more distant technology. It seems implausible that there will ever be enough people who voluntary conceive their children by IVF for embryo editing to have a big impact on human health or societal inequality. Sex is cheaper, easier, more spontaneous, and much more enjoyable, while at the same time, greater forces are driving inequality in society. Embryo editing will likely never be more than a niche factor.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Erwin Schrödinger and life at low copy number]]></title><description><![CDATA[Schr&#246;dinger's biological paradox, and how new technologies to measure protein binding to individual DNA molecules might address it.]]></description><link>https://www.thisgenomiclife.org/p/erwin-schrodinger-and-life-at-low</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/erwin-schrodinger-and-life-at-low</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Tue, 07 Jan 2025 19:56:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!zh8a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zh8a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zh8a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 424w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 848w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 1272w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zh8a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic" width="560" height="386.9230769230769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1006,&quot;width&quot;:1456,&quot;resizeWidth&quot;:560,&quot;bytes&quot;:127244,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zh8a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 424w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 848w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 1272w, https://substackcdn.com/image/fetch/$s_!zh8a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff90f0481-5de8-4c3b-8fef-3ed45b967800_1546x1068.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Protein-binding footprint analysis of phage lambda, <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC386950/">Fig. 1</a> of  Brenowitz, <em>et al.</em> <em>PNAS</em> 1986 Nov;83(22):8462&#8211;8466.</figcaption></figure></div><p>One of the most profound questions that biology tries to answer is, how does the organized behavior of a living cell arise from the physical interactions of non-living components? The physicist Erwin Schr&#246;dinger posed this question in a particularly compelling way in a set of public lectures given in 1943, which were published the next year as the book <em>What is Life? </em>Writing about a decade before molecular biology emerged as a distinct field, Schr&#246;dinger basically said that physicists can&#8217;t explain the picture of life developed by geneticists:</p><blockquote><p>Today, thanks to the ingenious work of biologists, mainly of geneticists, during the last thirty or forty years, enough is known about the actual material structure of organisms and about their functioning to state that, and to tell precisely why, <strong>present-day physics and chemistry could not possibly account for what happens in space and time within a living organism</strong>. <em>What is Life</em>, p. 4, emphasis added.</p></blockquote><h3>Life shouldn&#8217;t work but it does</h3><p>Why couldn&#8217;t &#8220;present-day&#8221; (1943) physics and chemistry account for what happens within a living organism? Because the motion of individual atoms is dominated by random thermal motion, and this &#8220;does not allow the events that happen between a small number of atoms to enrol [sic] themselves according to any recognizable laws (p. 10).&#8221; The orderly behavior of matter, argues Schr&#246;dinger, is described by the physics of statistical mechanics and only holds for large collections of atoms. Thermodynamics, for example, gives a very accurate description of the pressure, volume, diffusion, and heat capacity of a gas at large n, but not for ~1000 atoms. And yet, &#8220;incredibly small groups of atoms, much too small to display exact statistical laws, do play a dominating role in the very orderly and lawful events within a living organism (p. 20).&#8221; </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>Statistical physics as <a href="https://sethna.lassp.cornell.edu/statistical_mechanics_entropy_order_parameters_and_complexity">come a long way</a> since 1943, but the primary challenge to reconciling physics with the workings of a cell is still this paradox that Schr&#246;dinger identified: cells exhibit remarkably organized behavior that arises from the interactions of an astonishingly small number of molecules. The promoters that regulate the expression of critical genes exist in just <em>two</em> copies in each typical cell, and those promoters are bound by a handful of transcription factor proteins that activate gene transcription. The results for the organism are quite reliable. For example, <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC9771235/">99.9% of all babies</a> are born with five fingers on each hand, and this depends in part on the proper functioning of the <a href="https://pubmed.ncbi.nlm.nih.gov/12837695/">ZRS enhancer</a>, a regulatory DNA element that is present in only two copies of each cell and bound by a relatively small number of transcription factors. And not only do we need to consider that each cell operates with only two copies of each gene and its associated regulatory elements, but also that all of us start out life as only a single cell. Life works amazingly well at low copy number.</p><p>I won&#8217;t belabor the point, but I don&#8217;t think we have answered Schr&#246;dinger&#8217;s paradox (the biological one, not the cat one) in a satisfactory way. To be absolutely clear, I am not claiming that there is something happening in the cell that violates the laws of physics. The fact that we can assemble, in a test tube, operational cellular subsystems from synthetic components shows that organized behavior is an intrinsic feature of the interactions between these components. But we don&#8217;t have the quantitative models that would, I think, have satisfied Schr&#246;dinger. Sure, molecular biologists have identified millions of different types of functional molecules at work in the cell, and we are steadily mapping how they interact with each other. But from a physical perspective, I don&#8217;t think we have a great account of how a heterogenous collection of proteins, nucleic acids, and other molecules organize themselves into a coherent operation. Part of the answer, not recognized by Schr&#246;dinger but clearly appreciated <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC528649/">by the 1950&#8217;s</a>, is <em>specificity</em>. Specific, non-covalent interactions among biological molecules are a big part of the answer to Schr&#246;dinger&#8217;s question. But we don&#8217;t have accurate, generally applicable physical models that apply to the kinetics and thermodynamics of a cell. The growing interest in physics of <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC7572565/">biomolecular condensates</a> is one area of the current frontier. (Here is another classic on this general topic, <a href="https://www.damtp.cam.ac.uk/user/tong/fluids/lowreynolds.pdf">&#8220;Life at low Reynolds number&#8221;</a> from 1976 (PDF)).</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/erwin-schrodinger-and-life-at-low?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/erwin-schrodinger-and-life-at-low?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h3>Life at low copy number</h3><p>This is a long preamble to some recent work describing a set of exciting new technologies that enable us to more directly tackle the big question that Schr&#246;dinger posed: single-fiber assays. If a critical feature of life is regulating genes, which are present at two copies per cell, then we need to understand the binding events that are happening at those individual DNA molecules. So much of cell biological data comes from bulk assays, assays that measure averages over enormous numbers of cells. Single-fiber assays let biologists zoom in and see how many factors are bound on individual copies of regulatory DNA, but they do this at scale using sequencing technology, rather than imaging individual cells.</p><p>One of the <a href="https://pubmed.ncbi.nlm.nih.gov/39567683/">papers</a> I&#8217;m most excited about came out in November, work of the labs of Lacra Bintu and Will Greenleaf at Stanford. (Free <em>bioRxiv</em> version <a href="https://pubmed.ncbi.nlm.nih.gov/38352517/">here</a>.) They used one version of this technology to measure the binding states of individual regulatory DNA elements. In a nutshell, the method works by treating DNA with a methylating enzyme that marks sites that are accessible to the enzyme. If a transcription factor is bound to that site, the methylating enzyme can&#8217;t get there, and thus the site remains unmarked. By looking at the distribution of marked and unmarked sites on individual DNA molecules, you get binding statistics. For example, you can ask, if I have four DNA binding sites for a regulatory factor, how often are all four sites bound, versus fewer or no sites? How about if I increase the number of binding sites to six or eight?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g0X-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g0X-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 424w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 848w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 1272w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g0X-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic" width="1456" height="613" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:613,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73593,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g0X-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 424w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 848w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 1272w, https://substackcdn.com/image/fetch/$s_!g0X-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d0a4dd-79fe-4bf8-bc1b-d153b65ab25a_1866x786.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Schematic of single-molecule footprinting. Accessible GC&#8217;s are methylated (lollipops) while bound sites (blue regions under purple transcription factors) are protected from methylation.</figcaption></figure></div><p>What the experiment gives you is the frequencies of the different occupancy states of a DNA molecule. There is a direct analogy here to statistical thermodynamics: just like molecules in a gas occupy different energy states as described by the Boltzmann distribution, DNA molecules exist in different protein-bound or -unbound states with probabilities that can be <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC386950/">described by a biophysical model</a>. But unlike a gas, in this case we&#8217;re observing the microstates of the system directly.</p><p>With data like this, the Bintu and Greenleaf labs were able to parameterize biophysical models that describe how protein factors compete and cooperate for binding to regulatory DNA. Using this approach, they drew some conclusions about the relationship between transcription factor occupancy and gene expression and how binding of individual factors to individual sites works (binding seems to be independent). I won&#8217;t walk through the full paper here, but I do want to highlight some other interesting papers using related technologies. Andrew Stergachis, at the University of Washington, pioneered the technique called <a href="https://pubmed.ncbi.nlm.nih.gov/32587015/">Fiber-seq</a> with Stirling Churchman (at Harvard) and John Stamatoyannopoulos (University of Washington). Stergachis, who came and spoke to our department late last year, has a new <a href="https://www.biorxiv.org/content/10.1101/2024.11.06.622310v1.full">preprint</a> out describing DAF-seq, a more versatile form of Fiber-seq that lets you focus in on specific DNA elements of interest.</p><p>The DAF-seq preprint by the Stergachis lab has a number of very cool technical improvements over prior versions of this assay, part of which involved developing a very efficient new enzyme to mark the DNA. They also perform a binding analysis to show that the E-box of the <em>NAPA</em> promoter (element 1 in the figure below) nucleates the binding for elements 2 and 3. In other words, protein binding to these sites is not only cooperative, but directional, with one site driving the occupancy of the other sites.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wz8N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wz8N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 424w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 848w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 1272w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wz8N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic" width="606" height="366.67994505494505" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:881,&quot;width&quot;:1456,&quot;resizeWidth&quot;:606,&quot;bytes&quot;:330451,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wz8N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 424w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 848w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 1272w, https://substackcdn.com/image/fetch/$s_!wz8N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15a67a99-c738-42c8-b513-7889e4905834_1534x928.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Excerpt of Fig. 2a from Swanson, <em>et al.</em>, <em>bioRxiv</em> <a href="https://www.biorxiv.org/content/10.1101/2024.11.06.622310v1.full">https://www.biorxiv.org/content/10.1101/2024.11.06.622310v1.full</a>. CC-BY 4.0 International license.</figcaption></figure></div><p>The idea of using DNA-modifying enzymes to mark unbound sites of DNA has been around for awhile (see <a href="https://pubmed.ncbi.nlm.nih.gov/22960375/">here</a>, <a href="https://pubmed.ncbi.nlm.nih.gov/32042188/">here</a>, <a href="https://pubmed.ncbi.nlm.nih.gov/33290745/">here</a>, and <a href="https://pubmed.ncbi.nlm.nih.gov/28212748/">here</a> for a few of the important papers). As the technology becomes democratized, we&#8217;re going to see more work directly addressing the mechanisms of gene regulation, which get to the heart of Schr&#246;dinger&#8217;s biological paradox: how a handful of binding events at a single DNA molecule give rise to reproducible, orderly behavior in the face of disorderly, random thermal motion.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[This week's finds in genomics and beyond]]></title><description><![CDATA[Five items of interest as we get back to work in the new year]]></description><link>https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and-4bd</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and-4bd</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 03 Jan 2025 16:56:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!DD8a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h3>1. An expanded registry of regulatory DNA elements</h3><p>One of the most profound findings of genomics is that a majority of the functional portion of our genome is <a href="https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1004525">devoted to regulatory sequence rather than genes</a> that code for proteins. (This result was <a href="https://pubmed.ncbi.nlm.nih.gov/1090005/">anticipated</a> before the Human Genome Project.) Thus one of the major efforts in the field has been to catalog all of these regulatory DNA elements, which is difficult because they a) don&#8217;t follow a clear code like the genetic code for protein-coding genes and b) are often very cell type-specific and are hard to discover unless you make your measurements in the correct cell type. And the easiest, most scalable measurements we can make to detect regulatory DNA elements are only indirect measures of function, such as chromatin accessibility or epigenetic state. Thus making an inventory of all regulatory elements in the human genome is a tall order.</p><p>A <a href="https://www.biorxiv.org/content/10.1101/2024.12.26.629296v1">new preprint</a> reports an updated registry of candidate regulatory DNA elements for the human and mouse genomes. Using new analyses of <a href="https://www.encodeproject.org">ENCODE data</a>, there are now 2.35 million human sequences that qualify as candidate regulatory elements. This is one of the most thorough and systematic analyses of regulatory elements across tissues.</p><p>There are a few notable features of the updated registry: More than half of these elements are not close to the transcription start sites of genes, meaning that act at a distance to control gene expression. 90% of the candidate elements (identified by indirect measures of function) have been tested by some sort of direct functional assay (which helps one decide whether to move an element out of the &#8220;candidate&#8221; category). This to me is a surprisingly high number and a nice sign of progress. Also, this update includes a big set of silencers &#8212; repressive regulatory elements that have been under-studied and which are not well understood. </p><p>The updated registry is a useful resource, but it&#8217;s important to keep some important limitations in mind: regulatory elements from many cell types are still not represented  because the measurements haven&#8217;t been done, and the assays that directly measure regulatory function are still heavily weighted towards manipulable cell lines, rather than primary cells. In other words, there is still substantial work to do before we achieve a compete inventory of the most abundant type of functional element in the human genome.</p><h3>2. On the meaning of &#8220;post-genomic&#8221;</h3><p>One of my goals for 2025 is to work out a better science media diet, now that science Twitter has fragmented into too many sub-communities to keep track of. My old &#8216;must-read&#8217; book mark folder has links to a bunch of dead blogs and publications that no longer publish anything that captures my interest. Trying to keep track of posts from all of the people I follow on Xitter, Bluesky, Substack notes, and LinkedIn usually leaves me disoriented.</p><p><em><a href="https://undark.org/who-is-undark/">Undark</a></em>, an independent publication supported by MIT&#8217;s Knight Science Journalism Fellowship Program is an outlet that has flown under my radar, but which looks fantastic. It&#8217;s going on my daily must-read list for 2025. A <a href="https://undark.org/2024/12/05/opinion-post-genomic-age/">recent piece</a> by Yale biologist C. Brandon Ogbunu asks &#8220;What does it mean to be in the &#8216;post-genomic&#8217; age?&#8221;Ogbunu discusses the kinds of problems biologists should be thinking about in this era in which sequence data no longer seems to be rate-limiting.</p><blockquote><p>In today&#8217;s world, <em>post-genomic</em> is built on two important ideas: that, as previously mentioned, doing genomics is easier than ever, and that genetic information is not enough. Post-genomic embodies a world where we can and should focus on the next big (or small) revolutionary ideas in the study of the biological world.</p></blockquote><p>&#8220;Genetic information is not enough&#8221; &#8212; while it may seem obvious, it&#8217;s also a good prompt for creative ideas. If genetic information is not enough to understand your disease/system of interest, and if &#8220;doing genomics is easier than ever&#8221;, how should we frame our questions? Will the solutions require more large consortia, or are &#8216;omic technologies sufficiently democratized so that important advances will be most likely made in individual labs?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h3>3. How would you explain heritability to a fifth grader?</h3><p>Harvard statistical geneticist Sasha Gusev explains how to explain heritability, and much else, in an <a href="https://www.psychiatrymargins.com/p/a-critical-introduction-to-behavioral?utm_campaign=post&amp;utm_medium=web">interesting interview</a> with Awais Aftab, who write the Substack <em>Psychiatry at the Margins</em>. Those of us working in genomics these days come from many different disciplinary backgrounds (I trained in biochemistry), and the grasp of key statistical genetic concepts in the broader community is uneven, to put it charitably. And get those statistical concepts are critical in the work that the genomics community does. What does a GWAS association really mean and what mechanisms could produce one? What does it mean to say that there is &#8216;missing heritability&#8217;? I don&#8217;t think our current biomedical PhD programs do a great job teaching non-statistical geneticists these concepts.</p><p>Gusev also discusses some fascinating ongoing challenges in the field, so even if you are a statistical geneticist, this interview is worth checking out. And if you want more, Gusev writes the excellent Substack <em><a href="https://theinfinitesimal.substack.com">The Infinitesimal</a>.</em></p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:153052532,&quot;url&quot;:&quot;https://www.psychiatrymargins.com/p/a-critical-introduction-to-behavioral&quot;,&quot;publication_id&quot;:1201860,&quot;publication_name&quot;:&quot;Psychiatry at the Margins&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F30f5d1be-a3e1-4571-9ac3-93672932c080_600x600.png&quot;,&quot;title&quot;:&quot;A Critical Introduction to Behavioral Genetics: Q&amp;A with Sasha Gusev&quot;,&quot;truncated_body_text&quot;:&quot;Sasha (Alexander) Gusev is a statistical geneticist and an Associate Professor of Medicine at Harvard Medical School and the Dana-Farber Cancer Institute. His work involves the development of statistical methods for making sense of disease mechanisms and heritability from Genome-Wide Association Studies. He blogs about the genetics of complex traits at&quot;,&quot;date&quot;:&quot;2024-12-14T14:02:08.940Z&quot;,&quot;like_count&quot;:55,&quot;comment_count&quot;:17,&quot;bylines&quot;:[{&quot;id&quot;:18723016,&quot;name&quot;:&quot;Awais Aftab&quot;,&quot;handle&quot;:&quot;awaisaftab&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F595b3363-046e-4623-887b-84b0fabfe8e6_2499x2499.jpeg&quot;,&quot;bio&quot;:&quot;Psychiatrist with philosophical interests. My first book &#8220;Conversations in Critical Psychiatry&#8221; is out from Oxford University Press!&quot;,&quot;profile_set_up_at&quot;:&quot;2021-04-28T19:37:43.610Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:1156652,&quot;user_id&quot;:18723016,&quot;publication_id&quot;:1201860,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:1201860,&quot;name&quot;:&quot;Psychiatry at the Margins&quot;,&quot;subdomain&quot;:&quot;awaisaftab&quot;,&quot;custom_domain&quot;:&quot;www.psychiatrymargins.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Exploring critical, philosophical, and scientific debates in psychiatric practice and the psy-sciences&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/30f5d1be-a3e1-4571-9ac3-93672932c080_600x600.png&quot;,&quot;author_id&quot;:18723016,&quot;theme_var_background_pop&quot;:&quot;#9D6FFF&quot;,&quot;created_at&quot;:&quot;2022-11-19T17:23:28.187Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Awais Aftab&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;twitter_screen_name&quot;:&quot;awaisaftab&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100},{&quot;id&quot;:247615449,&quot;name&quot;:&quot;Sasha Gusev&quot;,&quot;handle&quot;:&quot;sashagusev&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b4804eb1-12db-4de5-9684-ced516e029c4_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Statistical geneticist. Associate Professor of Medicine at Dana-Farber Cancer Institute / Harvard Medical School&quot;,&quot;profile_set_up_at&quot;:&quot;2024-06-19T14:09:36.914Z&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationId&quot;:2719736,&quot;primaryPublicationName&quot;:&quot;The Infinitesimal&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://theinfinitesimal.substack.com&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://theinfinitesimal.substack.com/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.psychiatrymargins.com/p/a-critical-introduction-to-behavioral?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!grCP!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F30f5d1be-a3e1-4571-9ac3-93672932c080_600x600.png" loading="lazy"><span class="embedded-post-publication-name">Psychiatry at the Margins</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">A Critical Introduction to Behavioral Genetics: Q&amp;A with Sasha Gusev</div></div><div class="embedded-post-body">Sasha (Alexander) Gusev is a statistical geneticist and an Associate Professor of Medicine at Harvard Medical School and the Dana-Farber Cancer Institute. His work involves the development of statistical methods for making sense of disease mechanisms and heritability from Genome-Wide Association Studies. He blogs about the genetics of complex traits at&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 55 likes &#183; 17 comments &#183; Awais Aftab and Sasha Gusev</div></a></div><h3>4. Great reading ideas for 2025</h3><p>Aside from getting in shape, a reading plan seems to be one of the most common New Year&#8217;s resolutions. I enjoyed this list of <a href="https://jaredhenderson.substack.com/p/9-reading-ideas-for-2025">9 Reading Ideas for 2025</a> from Jared Henderson who writes <em><a href="https://jaredhenderson.substack.com">Commonplace Philosophy</a></em>. Many of these ideas can be applied to scientific papers as well. For example: join a book (or journal) club, pick an author (or lab) and read the complete works, pick a topic and really master it. I like to read in stacks, such a bunch of books on the scientific revolution or general relativity. I&#8217;m trying to do the same thing with papers now, rather than only haphazardly reading whatever just came out in <em>Cell</em> or <em>bioRxiv</em>.</p><p>How do you master a topic? There is more to it than just reading (take notes, organize the information in your mind, write about it, even if just for yourself). But to narrow the question to just reading, how many papers should you read to master a topic? The advice from my director of graduate studies to those of us prepping for our qualifying exams was good: Identify 20 critical papers and know them in-depth. Along with that, read more quickly another ~100 other papers in the field. The process of identifying those 120 papers will already be a great start towards deeply understanding the field.</p><p>The final piece of advice from Henderson is to <em>actually read the books you buy</em>. (I admit I have a problem.) The same could be said for dozens or hundreds of the browser tabs many of us have open: just read the paper &#8212; or close the tab if you decide it&#8217;s not worth the time.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and-4bd?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and-4bd?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h3>5. August Weissmann is underrated in the history of biology</h3><p>On that last piece of reading advice, sitting on my shelf is a <a href="https://www.hup.harvard.edu/books/9780674736894">door-stop biography</a> of the pioneering 19th century biologist August Weissmann. I&#8217;m finally going to read it this year. When people think about 19th century biology, Darwin gets an overwhelmingly disproportionate share of the attention, followed by Mendel as a distant second. Weissmann deserved much more, especially given the major influence of genetics and developmental biology on today&#8217;s biological sciences. Weissman developed important ideas about heredity and development well before the 20th century rediscovery of Mendelian genetics. He was also a superb experimentalist.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DD8a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DD8a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 424w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 848w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 1272w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DD8a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic" width="377" height="513.305114638448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:772,&quot;width&quot;:567,&quot;resizeWidth&quot;:377,&quot;bytes&quot;:73411,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DD8a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 424w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 848w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 1272w, https://substackcdn.com/image/fetch/$s_!DD8a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd440ceba-fad8-4df8-b2f6-3b0ee9122ac5_567x772.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here is the book summary from Harvard University Press:</p><blockquote><p>The evolutionist Ernst Mayr considered August Weismann &#8220;one of the great biologists of all time.&#8221; Yet the man who formulated the germ plasm theory&#8212;that inheritance is transmitted solely through the nuclei of the egg and sperm cells&#8212;has not received an in-depth historical examination. <em>August Weismann</em>reintroduces readers to a towering figure in the life sciences. In this first full-length biography, Frederick Churchill situates Weismann in the swirling intellectual currents of his era and demonstrates how his work paved the way for the modern synthesis of genetics and evolution in the twentieth century.</p><p>In 1859 Darwin&#8217;s tantalizing new idea stirred up a great deal of activity and turmoil in the scientific world, to a large extent because the underlying biological mechanisms of evolution through natural selection had not yet been worked out. Weismann&#8217;s achievement was to unite natural history, embryology, and cell biology under the capacious dome of evolutionary theory. In his major work on the germ plasm (1892), which established the material basis of heredity in the &#8220;germ cells,&#8221; Weismann delivered a crushing blow to Lamarck&#8217;s concept of the inheritance of acquired traits.</p><p>In this deeply researched biography, Churchill explains the development of Weismann&#8217;s pioneering work based on cytology and embryology and opens up an expanded history of biology from 1859 to 1914. <em>August Weismann </em>is sure to become the definitive account of an extraordinary life and career.</p></blockquote><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Technologies That Made The Human Genome Project Possible]]></title><description><![CDATA[An oral history of key, early technological advances made in St. Louis that first "instantiated" genomics.]]></description><link>https://www.thisgenomiclife.org/p/technologies-that-made-the-human</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/technologies-that-made-the-human</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Tue, 31 Dec 2024 00:34:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!4VUg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Note: This post exceeds the email length limit for Substack. To read the full transcript of the talk presented below, click through to the post on the web. The highlights are presented first.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4VUg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4VUg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 424w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 848w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 1272w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4VUg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic" width="460" height="309" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:309,&quot;width&quot;:460,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:39402,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4VUg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 424w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 848w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 1272w, https://substackcdn.com/image/fetch/$s_!4VUg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F965b46f0-ec49-4125-b52b-e13480e8ecba_460x309.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The McDonnell Sciences Building at Washington University in St. Louis</figcaption></figure></div><p>Last spring, my colleagues and I <a href="https://genetics.wustl.edu/recorded-sessions-of-the-gary-d-stormo-symposium-on-computational-biology/">organized a symposium</a> in honor of our colleague Gary Stormo, a pioneer of computational biology who (nominally) retired last year. (He still has some papers in the works.) The outstanding closing talk was given by Mark Johnston, former chair of the Washington University Genetics Department and now emeritus chair of the Department of Biochemistry and Genetics at the University of Colorado School of Medicine. In his talk, Mark made a bold claim: that our Genetics Department here in St. Louis was the &#8220;founder&#8221; of genomics in its modern sense. Mark argues that it was here &#8212; specifically on the eighth floor of the McDonnell Sciences building where the Genetics Department was housed&#8212; that genomics was first instantiated.</p><p>Those from other institutions who are less partisan than I am may dispute this claim, but no matter where you come down on it, Mark Johnston&#8217;s talk is a fascinating oral history of the technological advances that were needed to make genome sequencing possible. (Read Mark&#8217;s talk and you&#8217;ll learn about the early St. Louis careers of several important figures who made Seattle&#8217;s University of Washington a genomics powerhouse. Our department also trained the current director of the National Human Genome Research Institute.) Imagine a scientist in the early 1980&#8217;s considering the daunting prospect of sequencing the entire human genome. Sequencing very short segments of DNA was a tedious process that involved manually examining bands on a polyacrylamide gel. Sequencing hundreds of base pairs was daunting, and the technology to sequence billions of base pairs did not exist.</p><p>Today, I can submit my sample to our sequencing core and get back a few hundred billion bases of sequencing in a couple of days. This capacity is, and will continue to be transformative, but how did it originate? Mark Johnston covers some of the critical early developments that anyone interested in genomics should learn about. Below I&#8217;ve highlighted the key landmark papers/technologies that Mark presents in his talk. If you read these dozen or so papers, you will have an excellent understanding of the key technologies that made the Human Genome Project possible. You can watch the video and read over the annotated and lightly edited transcript below. Mark was the person who drew me to Washington University  few years after the draft human genome sequence was published. In the talk below you can hear his wonderfully dry sense of humor, his intelligence, and his very clear sense of the history of this field. It&#8217;s an inspiring story </p><h3>Landmark 1: Restriction fragment clone mapping</h3><p>Genomes aren&#8217;t sequenced in order, from the start to end of individual chromosomes. Your sequencing data comes in shorter fragments. To sequence a genome, you need a map. Work by Maynard Olson at Washington University in St. Louis on <em>S. cerevisiae</em>, and complementary work at the MRC in Cambridge mapping the genome of <em>C. elegans</em> showed to create these maps.</p><p><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC386815/">Random-clone strategy for genomic restriction mapping in yeast</a><strong>, </strong>Olson, <em>et al.</em>,<strong> </strong><em>PNAS, </em>1986 Oct; 3(20):7826-30.</p><p><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC386814/">Toward a physical map of the genome of the nematode Caenorhabditis elegans</a>, Coulson, <em>et al.</em>, <em>PNAS</em> 1986 Oct; 83(20): 7821&#8211;7825.</p><h3>Landmark 2: Yeast Artificial Chromosomes</h3><p>Restriction fragment clones are good for relatively short range sequencing, but they aren&#8217;t capable of mapping sequences on the scale of millions of base pairs. Larger clones for mapping were made possible by the development of yeast artificial chromosomes (YACs) in Maynard Olson&#8217;s lab.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/3033825/">Cloning of large segments of exogenous DNA into yeast by means of artificial chromosome vectors</a>, Burke, Carle, and Olson, <em>Science</em> May 15;236(4803):806-12.</p><h3>Landmark 3: Sequence-tagged Sites</h3><p>Eric Green, the current director of the National Human Genome Institute at the NIH, did critical work with Maynard Olson to develop a clever mapping method called sequence-tagged sites. The technique involves using PCR primers to identify landmarks in cloned fragments.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/2781285/">A common language for physical mapping of the human genome</a>, Olson, Hood, Cantor, and Botstein, <em>Science</em> 1989 Sep 29;245(4925):1434-5.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/2218515/">Chromosomal region of the cystic fibrosis gene in yeast artificial chromosomes: a model for human genome mapping</a>, Green and Olson, <em>Science</em> 1990 Oct 5;250(4977):94-8.</p><h3>Landmark 4: Automated 4-color sequencing</h3><p>Automating DNA sequencing was a critical step to scale up capacity. With automated methods to read sequencing output, you also needed computational methods to assess the quality of that output. Phil Green developed a prototype of these methods (notably PHRED) here at Washington University, before he moved to the University of Washington in Seattle, along with a number of other former member of the WashU Genetics Department.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/3713851/">Fluorescence detection in automated DNA sequence analysis</a>, Smith, <em>et al</em>., <em>Nature</em> 1986 Jun;321(6071):674-9.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/9521922/">Base-calling of automated sequencer traces using phred. II. Error probabilities</a>, Ewing and Green, <em>Genome Res.</em> 1998 Mar;8(3):186-94.</p><p><a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC331543/">An improved method for photofootprinting yeast genes in vivo using Taq polymerase</a>, Axelrod and Majors, <em>Nucleic Acids Res.</em> 1989 Jan 11;17(1):171-83.</p><h3> Yeast genome sequenced</h3><p>As Mark Johnston explains, it was necessary to sequence the yeast genome because, if you were going to use YACs in yeast as your physical vector for <em>C. elegans</em> or human genome sequencing, you needed to identify the yeast sequences so that you could throw then away. The yeast genome was the first sequenced eukaryotic genome.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/8091229/">Complete nucleotide sequence of Saccharomyces cerevisiae chromosome VIII</a>, Johnston, <em>et al</em>., <em>Science </em>1994 Sep 30;265(5181):2077-82.</p><p><a href="https://pubmed.ncbi.nlm.nih.gov/8849441/">Life with 6000 genes</a>, Goffeau, <em>et al.</em>, <em>Science</em> 1996 Oct 25;274(5287):546, 563-7.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><p>Watch Mark&#8217;s talk below:</p><div id="youtube2-4KBdG2Wq5m4" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;4KBdG2Wq5m4&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/4KBdG2Wq5m4?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h4>Transcript of &#8220;Genomics Instantiated: A View from the Front Row&#8221;, Mark Johnston, May 21, 2023, Washington University in St. Louis Gary D. Stormo Symposium</h4><p>My talk today really has nothing to do with Gary. That's because I don't know any computational biology and all the other speakers before me have amply phrased Gary's accomplishments and his contributions to our field. So I won't do that. I'll just preface my remarks by saying that I know that Gary was well known from the very beginning of the field. I was a graduate student in 1978. We were doing some sequencing and we had a little bit of sequence you know 100 bases or something and we didn't know how to analyze it. And I remember somebody telling me, I can't remember who, but I clearly remember them saying, &#8220;Oh you should talk to this guy Stormo in Colorado. He knows how to analyze sequence.&#8221; I never found his email address so I wasn't able to get in touch with him.</p><p>What I want to do today is to give you a history of why Washington University and in particular the Genetics Department here was really the founder of the field of Genomics. So my talk is &#8220;Genomics Instantiated.&#8221; Now that word &#8220;instantiated&#8221;, academics love that word. I've seen it all over the place. I've never used it and I'm an academic and I've always felt bad that I've never been able to never had an opportunity to use the word instantiated. So I want to thank the organizers for inviting me to give me the chance to actually use this word for the very first time now. Of course I had to look it up because I don't really know what it means. I had to look it up. And I looked up in the dictionary: &#8220;Instantiate -&nbsp; to represent an abstraction by a concrete instance.&#8221;</p><p>What I'm going to present to you today is my contention that the first instantiation of genomics happened right here up on the eighth floor of McDonnell Sciences building. I'm going to tell you the story of how that came to be and how WashU came to be known as a DNA sequencing Power, and my view from the front row. I had the privilege to observe all of this. I saw the whole story from the very front row, so I'll tell you my view of what of how it all happened.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/technologies-that-made-the-human?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/technologies-that-made-the-human?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p>So back then, this is the &#8216;80s we're talking about, the goal was to map genomes. And so you have your genome here and the goal is to get a single-nucleotide resolution map of the genome. Well, back in the 80s of course you couldn't do that directly. You really can't quite even do it now directly. What you need is to map the genome first before you get it down to single-nucleotide resolution. You have to cut the genome up into smaller pieces, figure out where they lie in the genome, and then you can get your single-nucleotide resolution map. And it was those methods that you need to map the genome.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Vk-n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Vk-n!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 424w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 848w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 1272w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Vk-n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic" width="347" height="503.65914221218964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:643,&quot;width&quot;:443,&quot;resizeWidth&quot;:347,&quot;bytes&quot;:23694,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Vk-n!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 424w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 848w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 1272w, https://substackcdn.com/image/fetch/$s_!Vk-n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F728163ec-935a-4abc-8c85-6be8941fd4a1_443x643.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And how to do that was really instantiated here in the Department of Genetics by this guy Maynard Olson, a faculty member who arrived here I think in 1978. And Maynard was really the first one to do genomics, the first one to study an organism on the whole genome scale. And what Maynard did was to to map clones of the yeast genome. He was the first one, along with John Sulston, who you'll hear about in just a minute here, to lay out the need, plus the mechanism of the approach for <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC386815/">how to map clones to genomes</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wcCm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wcCm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 424w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 848w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 1272w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wcCm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic" width="604" height="186.86803519061584" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:211,&quot;width&quot;:682,&quot;resizeWidth&quot;:604,&quot;bytes&quot;:27823,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wcCm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 424w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 848w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 1272w, https://substackcdn.com/image/fetch/$s_!wcCm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c964e5d-20fe-4a25-973a-a0a4425c39d4_682x211.heic 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>What Maynard did was simply do a restriction site digest of a large number of clones and measure the fragment sizes. So here's a bunch of clones of the yeast genome cut with EcoRI and HindIII, run out on a gel to size the fragments. And you just count the sizes of the fragments. If you do that enough times to enough clones, you can identify clones that overlap. They share fragments of the same lengths, and so those clones must overlap. So if you're clever enough to be able to run gels that can accurately size DNA fragments, and then you're clever enough to write algorithms that can handle the data and put it all together, you end up with something that looks like this a clone map of the genome, where the horizontal lines are individual clones, the hash marks on the clones are EcoRI and HindIII sites. And so you can generate a series of clones that map physically map the genome, and Maynard was the first guy to do this.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Xbu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Xbu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 424w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 848w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 1272w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Xbu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic" width="494" height="528.1025641025641" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1084,&quot;width&quot;:1014,&quot;resizeWidth&quot;:494,&quot;bytes&quot;:107490,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Xbu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 424w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 848w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 1272w, https://substackcdn.com/image/fetch/$s_!3Xbu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ea6d730-89a3-4ecb-8973-430dfc0fdc9d_1014x1084.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Physically mapping the yeast genome. Figure 4 from Olson, <em>et al.</em>,<strong> </strong><em>PNAS, </em>1986 Oct; 3(20):7826-30.</figcaption></figure></div><p>And he was doing it at a time when nobody could understand why you'd want to do that. I remember a very well-known scientist, a very smart guy who said to me who the hell cares where all the EcoRI and HindIII sites are in the yeast genome, I don't really care. Well, the reason you care is because you need a map of a genome to get a sequence of a genome, and all that follows from that.</p><p>Okay, so Maynard was the first one to map genomes. Maynard was working at this level, in the range of clones of about tens of kb in phage vectors and plasmid vectors and later on in cosmid vectors. He wasn't working at the larger scale of thousands and hundreds to thousands of kilobases of DNA. So now comes the key event that really led to WashU having a <a href="https://genome.wustl.edu">Genome Sequencing Center</a>. You all know that it's much easier to put together a puzzle with a few number of pieces than a puzzle with lots of pieces, right? A thousand-piece puzzle is much harder to put together than a hundred-piece puzzle, and Maynard was working in this range, and then comes the key event. </p><p>Bob Waterston, Maynard's colleague and my colleague up on the eighth floor of McDonnell Sciences Building, goes off in the mid 80s, &#8216;85 I think, &#8216;86, to do a sabbatical with John Sulston. And Bob's over there working on his muscle genes in  John's lab, watching John map the <em>C. elegans</em> genome. John was doing the same thing Maynard was doing, trying to get a map of the <em>C. elegans </em>genome, eventually for the purpose of sequencing it. And John was also mired in this area of small fragments. He had 750 contigs and there were 750 gaps in the clone map, when he should have had seven. And he just couldn't get past those 750 gaps in the DNA sequence. Bob is watching John be frustrated with that. </p><p>Bob comes back to St Louis and he talks to these two guys, David Burke and George Carl. They were graduate students in Maynard's lab, and they told Bob about work they were doing to generate yeast artificial chromosomes. These were vectors that would take large pieces of DNA, and Bob learned about this before it was published. Bob goes back to John Sulston&#8217;s Lab in Cambridge England and begins to work with John on closing those 750 gaps in the genome sequence. And that's really what got Bob into DNA sequencing, which is what brought a genome sequencing center with hundreds of millions of dollars in grant income to Washington University. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-BK5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-BK5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 424w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 848w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 1272w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-BK5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic" width="632" height="220.5054945054945" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:508,&quot;width&quot;:1456,&quot;resizeWidth&quot;:632,&quot;bytes&quot;:63075,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-BK5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 424w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 848w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 1272w, https://substackcdn.com/image/fetch/$s_!-BK5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac188b3f-7d87-4e69-b72a-346d7e0d6d68_1764x616.heic 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Burke, Carle, and Olson, <em>Science</em> May 15;236(4803):806-12</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-9jH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-9jH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 424w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 848w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 1272w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-9jH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic" width="554" height="477.96078431372547" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:792,&quot;width&quot;:918,&quot;resizeWidth&quot;:554,&quot;bytes&quot;:61010,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-9jH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 424w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 848w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 1272w, https://substackcdn.com/image/fetch/$s_!-9jH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b426840-1982-43b6-a5b4-0a52af9849c5_918x792.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Strategy for yeast artificial chromosomes. Fig. 1 from Burke, Carle, and Olson.</figcaption></figure></div><p>Okay so there's another landmark [to talk about]. We have clone mapping from Maynard and John Sulston, we have yeast artificial chromosomes from David Burke and George Carl, again working in Maynard&#8217;s lab, and there's a third major landmark in genome mapping that also came out of Maynard&#8217;s lab called sequence-tagged sites. Maynard, in 1989, publishes <a href="https://pubmed.ncbi.nlm.nih.gov/2781285/">this paper</a>, with a few other people to give it credence I guess, &#8220;A common language for physical mapping of the human genome.&#8221;  </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!00Tq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!00Tq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 424w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 848w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 1272w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!00Tq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic" width="574" height="401.8" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:658,&quot;width&quot;:940,&quot;resizeWidth&quot;:574,&quot;bytes&quot;:45219,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!00Tq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 424w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 848w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 1272w, https://substackcdn.com/image/fetch/$s_!00Tq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f1646af-e041-470c-a1ff-b1e93db78539_940x658.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Olson, <em>et al.</em>, <em>Science</em> 1989 Sep 29;245(4925):1434-5</figcaption></figure></div><p>Sequence-tagged sites are simply PCR priming sites that will yield a PCR product. So you simply get random sequence from a bunch of clones, design primers, and if those primers give a product, amplify a sequence, then that means the clone that was the template has those sequences in it. So it simply sequenced landmarks of about 100 to a thousand base pairs. Sequence-tagged sites, really a brilliant idea, quite simple, which as most brilliant ideas are, they're brilliant after you know what the idea is.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pfCc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pfCc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 424w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 848w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 1272w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pfCc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic" width="522" height="388.15384615384613" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:522,&quot;width&quot;:702,&quot;resizeWidth&quot;:522,&quot;bytes&quot;:39437,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pfCc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 424w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 848w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 1272w, https://substackcdn.com/image/fetch/$s_!pfCc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59db0aa8-0a24-49e7-8364-e2aca01106d3_702x522.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sequence-tagged sites. Fig. 1A from Green and Olson, <em>Science</em> 1990 Oct 5;250(4977):94-8.</figcaption></figure></div><p>The guy who put sequence-tagged sites into practice, who instantiated sequence-tagged sites, was this guy Eric Green, a postdoc with Maynard. Eric was a student here, an M.D./P.h.D. student, got his degree in immunology, and decided he wanted to go into genomics and join Maynard's lab. And he really put into practice Maynard's idea of sequence tagged-sites and ended up generating, by 1990, <a href="https://pubmed.ncbi.nlm.nih.gov/2218515/">a map</a> of a region of the human genome. It was a model for how you map the human genome with sequence-tagged sites. Here's all the clones [image below], the horizontal lines, and now those dots on the clones are sequence-tagged sites, PCR priming sites that are present in those clones. You generate a map of the genome that way. Eric went on to become the second director of The National Human Genome Research Institute where he has been the director now for 15 years.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5_us!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5_us!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 424w, https://substackcdn.com/image/fetch/$s_!5_us!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 848w, https://substackcdn.com/image/fetch/$s_!5_us!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 1272w, https://substackcdn.com/image/fetch/$s_!5_us!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5_us!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic" width="588" height="451.51532846715327" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1052,&quot;width&quot;:1370,&quot;resizeWidth&quot;:588,&quot;bytes&quot;:102735,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5_us!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 424w, https://substackcdn.com/image/fetch/$s_!5_us!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 848w, https://substackcdn.com/image/fetch/$s_!5_us!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 1272w, https://substackcdn.com/image/fetch/$s_!5_us!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F062b7e9a-431b-4a5e-b973-67e33ba74216_1370x1052.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fig. 5 from Green and Olson, <em>Science </em>1990 Oct 5;250(4977):94-8.</figcaption></figure></div><p>So genomics was born in the [Washington University] Genetics Department, was extended by the Sequencing Center in the Genetics Department, and then we exported it to the National Human Genome Research Institute. Mr. McDonnell [James S, McDonnell] would have been proud that his money invested in the Genetics Department really came to fruition.</p><p>Okay, so now we got a map, and now we're ready to do the sequencing. You all know how this works, right, di-deoxynucleotides get incorporated, cause chain termination,  you generate a family of molecules that have each stopped at a nucleotide, you separate those by their size on a gel and you read the sequence off. Now this was how it started for <em>C. elegans</em> sequencing. I remember them developing films and reading off the sequence. I remember you know fishing gels out of the fixer and reading the sequence before they were even dry, because you  couldn't wait to to see your exciting sequence come off. Now of course you're not going to be able to sequence a genome that way. You&#8217;ve got to have some automated way to do it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!76sW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!76sW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 424w, https://substackcdn.com/image/fetch/$s_!76sW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 848w, https://substackcdn.com/image/fetch/$s_!76sW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 1272w, https://substackcdn.com/image/fetch/$s_!76sW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!76sW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic" width="186" height="373.10059171597635" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:678,&quot;width&quot;:338,&quot;resizeWidth&quot;:186,&quot;bytes&quot;:20162,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!76sW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 424w, https://substackcdn.com/image/fetch/$s_!76sW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 848w, https://substackcdn.com/image/fetch/$s_!76sW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 1272w, https://substackcdn.com/image/fetch/$s_!76sW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1b90cdf-79e8-4600-9801-ef55cc7b0bc8_338x678.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sequencing gel, image by John Schmidt via <a href="https://commons.wikimedia.org/wiki/File:Sequencing.jpg">Wikimedia Commons</a>.</figcaption></figure></div><p>So a great leap forward came with the development of <a href="https://pubmed.ncbi.nlm.nih.gov/3713851/">fluorescently-tagged DNA</a> for sequencing where you put on your di-deoxys some dyes that report a color, and now you can load all of those reactions into one lane and simply read off the sequence from the color of each of those bands. So that was a major advance that allowed automation and sort of portended that the sequencing the human genome was going to be possible. You get these kinds of kinds of gels where you read off the sequence and here's really what the readout was. These colored peaks are the peaks of those bands coming off the gel, and you just simply read the sequence off from the color.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3IBr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3IBr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 424w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 848w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 1272w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3IBr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic" width="588" height="356.1923076923077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:882,&quot;width&quot;:1456,&quot;resizeWidth&quot;:588,&quot;bytes&quot;:231915,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3IBr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 424w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 848w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 1272w, https://substackcdn.com/image/fetch/$s_!3IBr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff26c895b-e78b-4e0c-9251-e816b2218c62_2152x1304.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sanger sequencing read, image from M. White.</figcaption></figure></div><p>But the problem is, where do you start your sequence read. Down here it's not going to be a very good sequence, it gets pretty good in here,  but then the sequence starts degrading, so where do you stop reading your sequence. And here's where a major advance came that really enabled the human genome sequencing and it also comes from the Department of Genetics on the eight floor of McDonnell Sciences building. That is the development of quality statistics by Phil Green who was a faculty member in Genetics.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!s-bF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!s-bF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 424w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 848w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 1272w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!s-bF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic" width="284" height="485.3498452012384" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1104,&quot;width&quot;:646,&quot;resizeWidth&quot;:284,&quot;bytes&quot;:58416,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!s-bF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 424w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 848w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 1272w, https://substackcdn.com/image/fetch/$s_!s-bF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d7adb1-2c40-4e32-a846-46e89e73844e_646x1104.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here's Phil, and what Phil did is <a href="https://genome.cshlp.org/content/8/3/186.long">develop algorithms</a> that would read those peaks, those colored peaks, and put a quality score on them. So his algorithm could read the peaks and tell you how sure of each base call you are. So a score of 10 means that there was a one in 10 probability the base was called wrong, so you're 90% certain that base is called right. A score of 50 said that there was a one in 100,000 chance that you were wrong so you're most certain of that of that base call. And so that enabled you to turn these peaks into scores which enabled automation of the sequence reading, and that was a major advance in fluorescent sequencing of the human genome. [<em>Note: Phil Green moved from Washington University in St. Louis to the University of Washington in 1994, <a href="https://pubmed.ncbi.nlm.nih.gov/15383656/">bringing with him</a> the protoypes of PHRED and PHRAP, initially developed at WashU, and which he later published at the University of Washington.</em>]</p><p>There were other advances this is the slide I got from Maynard. These are Maynard's landmarks in the evolution of four-color sequencing. There were advances in polyacrylamide gel technology, better dyes that gave you more sensitive detection , DNA polymerases that would accept better the dye-labeled nucleotides. And there's one more advance that was critical that, again also came from Washington University. This is one not from the McDonnell Sciences building, but from the first floor of the North building in the biochemistry department, and that was the development of <a href="https://pubmed.ncbi.nlm.nih.gov/2643080/">cycle sequencing</a>: using PCR technology to amplify your sequencing reaction so you can sequence small amounts of DNA. And that was done by our late colleague John Major and his student Jeff Axelrod.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yIZj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yIZj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 424w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 848w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 1272w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yIZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic" width="262" height="360.25" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:704,&quot;width&quot;:512,&quot;resizeWidth&quot;:262,&quot;bytes&quot;:49850,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yIZj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 424w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 848w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 1272w, https://substackcdn.com/image/fetch/$s_!yIZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ebc8668-76b8-48ac-b6b4-7ade38a539a1_512x704.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">John Majors</figcaption></figure></div><p>So here's John, and what Jeff and John did was they use multiple rounds of anealing and extension with Taq polymerase to detect sequences in the genome. You do a one-sided PCR reaction, essentially a linear amplification of the reaction, and John pointed out that their primary extension assay could be used to determine DNA sequence directly from the yeast genome. That was the first instantiation of cycle sequencing, which was another major advance that enabled the sequencing of the of the human genome.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RQ_V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RQ_V!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 424w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 848w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 1272w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RQ_V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic" width="600" height="264.9725274725275" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:643,&quot;width&quot;:1456,&quot;resizeWidth&quot;:600,&quot;bytes&quot;:63178,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RQ_V!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 424w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 848w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 1272w, https://substackcdn.com/image/fetch/$s_!RQ_V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b857cf1-426f-4109-b641-36bfc2404e04_1558x688.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>So mapping a genome happened here, many other advances happened here at Washington University. So why am I up here telling you the story? It's all about Maynard and Bob and Phil. Right, why am I here? Well, I had a front row seat because I was peripherally involved in the project. So when Bob set out to sequence the worm genome, he and John decided they needed to get the sequence of the yeast genome also. Why? Well, they thought that they were going to be sequencing <em>C. elegans</em> DNA cloned in YACs. And if you isolate YAC DNA from yeast, you're going to necessarily be contaminated with yeast DNA. So you got to know the yeast sequence so you can throw it out.</p><p> So they wanted to sequence the yeast genome and they recruited me to be in charge of that of that project. So this was our <a href="https://pubmed.ncbi.nlm.nih.gov/8091229/">first paper</a> on this in 1994, we sequenced one of the yeast chromosomes. This was the second whole chromosome sequenced and that's why it was published in a relatively high, in a sensibly high-profile Journal. We did it in quite quick order because we had Rick Wilson&#8217;s and Bob Waterston&#8217;s Sequencing Center that enabled us to do that. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_-EZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_-EZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 424w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 848w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 1272w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_-EZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic" width="478" height="313.63787375415285" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:1204,&quot;resizeWidth&quot;:478,&quot;bytes&quot;:174633,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_-EZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 424w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 848w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 1272w, https://substackcdn.com/image/fetch/$s_!_-EZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6e7d600-e628-461b-b879-8ce9ae985496_1204x790.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Johnston, et al., <em>Science </em>1994 Sep 30;265(5181):2077-82</figcaption></figure></div><p>Now what does it mean to manage a project like that? So my role to manage this project was to rely on Linda Riles. Linda was Maynard's technician who played a big role in his mapping of the yeast genome. So Linda was the caretaker of all the clones of the yeast genome. Here's Linda in front of the maps of the yeast genome. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rnss!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rnss!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 424w, https://substackcdn.com/image/fetch/$s_!rnss!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 848w, https://substackcdn.com/image/fetch/$s_!rnss!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 1272w, https://substackcdn.com/image/fetch/$s_!rnss!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rnss!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic" width="398" height="296.6488372093023" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:641,&quot;width&quot;:860,&quot;resizeWidth&quot;:398,&quot;bytes&quot;:45202,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rnss!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 424w, https://substackcdn.com/image/fetch/$s_!rnss!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 848w, https://substackcdn.com/image/fetch/$s_!rnss!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 1272w, https://substackcdn.com/image/fetch/$s_!rnss!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a2ddb80-171b-4ccc-86d2-ec7bdef24ac2_860x641.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>So my role was to go tell Linda what chromosome we were going to sequence next Linda would go in and figure out what clones span the chromosome. She'd give them to me, I'd walk them the three blocks over to the 4444 building, hand them to Lucinda [Fulton], and they'd get sequenced. The sequence would come in and I had the privilege of sitting in front of a computer and annotating it, which was really fantastic and I got all the credit. It's great work if you can get it, highly recommended.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/p/technologies-that-made-the-human?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/p/technologies-that-made-the-human?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p> Then just a little while later we collaborated with a worldwide Consortium of sequencers &#8212; back then sequencing was hard and so lots of people were involved &#8212; to produce the sequence of the <a href="https://pubmed.ncbi.nlm.nih.gov/8849441/">first eukaryotic genome</a>, yeast, about 12 megabases in size. This was an interesting sociological and political science experiment, which we can talk about later, maybe. </p><p>Then soon thereafter, Bob and John's Sequencing Center <a href="https://pubmed.ncbi.nlm.nih.gov/9851916/">sequenced a real genome</a>, you know, a large genome [<em>C. elegans</em>]. About 100 megabases is about roughly 10 times the size of of yeast. The first multi-cellular organism whose genome  was sequenced. And then of course you know that they sequenced the human genome. Here they are in 2000 celebrating the completion of the yeast genome. Here's Bob Waterston and Rick Wilson at the White House, with a few other minor players in the process, celebrating what they called the completion of the human genome sequence. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h_VW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h_VW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 424w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 848w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 1272w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h_VW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic" width="380" height="298" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:298,&quot;width&quot;:380,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:16727,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!h_VW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 424w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 848w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 1272w, https://substackcdn.com/image/fetch/$s_!h_VW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45811c3e-93b2-4d2c-b816-926b2fc85e20_380x298.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">L to R: Jim Watson, Eric Lander, Richard Gibbs, Bob Waterston, and Rick Wilson at the White House.</figcaption></figure></div><p>So really this guy [Bob Waterston], was I think one of the moral compasses of the project, for data release, for appropriate allocation of the resources of the project. He really, I think, does not get all of the credit that he deserves for his leadership role, his quiet leadership role in that project. Here is Bob at a press conference with Craig talking about talking about the sequence. </p><p>That's how genomics was instantiated in the field, how genomics was invented, I claim here, at Washington University on the eighth floor of the McDonnell Sciences building by Maynard, implemented extraordinarily well by Bob, and advanced by people like Phil Green. So we should all be proud to have been part of Washington University, and I had the great privilege of having a front row seat for all of that which I've just told you about.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[This week's finds in genomics and beyond]]></title><description><![CDATA[Some genomics and general science reading for your pre-holiday weekend]]></description><link>https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/this-weeks-finds-in-genomics-and</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Fri, 20 Dec 2024 23:45:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-aQA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Here are five items that caught my eye this week:</p><h2>1. Genomic foundation models aren&#8217;t there yet</h2><p>One question that comes up in recent discussions about AI is what happens when the big large language models run out of training data? There are concerns that progress is starting to slow since companies like Google and OpenAI have already used up just about <a href="https://www.nytimes.com/2024/12/19/technology/artificial-intelligence-data-openai-google.html?unlocked_article_code=1.i04.if3V.OcfFRMEDSxOD&amp;smid=url-share">all of the available text</a> on the internet.</p><p>There is a parallel concern about large DNA language models, which haven&#8217;t made as much progress as one might have expected. Two recent papers benchmark these models and find that, despite the fact that they are the biggest AI models in genomics, they don&#8217;t compare favorably to humble CNNs trained on smaller datasets for specific tasks. </p><p>In one paper, out in <em><a href="https://www.nature.com/articles/s41592-024-02523-z">Nature Methods</a></em>, the team at <a href="https://www.instadeep.com">InstaDeep</a> trained transformer models on different datasets and then benchmarked their performance on a variety of tasks. In the paper, they give a glass-half-full interpretation of results that show models with 2.5 billion parameters, trained on hundreds of thousands of genomes, often don&#8217;t do worse on specific tasks than smaller specialized models. They generally don&#8217;t do better either. For example, a 2.5 billion parameter model trained and then fine-tuned to predict activities of regulatory sequences in a massively parallel reporter gene assay didn&#8217;t do any better than a small CNN:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-aQA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-aQA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 424w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 848w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 1272w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-aQA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic" width="490" height="338.24626865671644" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/edb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:740,&quot;width&quot;:1072,&quot;resizeWidth&quot;:490,&quot;bytes&quot;:40441,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-aQA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 424w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 848w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 1272w, https://substackcdn.com/image/fetch/$s_!-aQA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb75b8e-8051-4de7-bdd7-4c6a02c36806_1072x740.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fig. 2e from Dalla-Torre, et al., <em>Nature Methods</em> DOI https://doi.org/10.1038/s41592-024-02523-z</figcaption></figure></div><p>Despite these disappointing results, this a well-done study that presents a clear, systematic evaluation of these models. </p><p>A second DNA language model benchmarking paper was <a href="https://arxiv.org/abs/2412.05430">just posted</a> on arXiv by Anshul Kundaje and his lab, at Stanford. This is also a nicely-executed study that includes a worthwile discussion of the state of the field. Their top-line conclusion is that <em><strong>&#8220;current annotation-agnostic DNALMs exhibit inconsistent performance and do not offer compelling gains over alternative baseline models for most tasks, despite requiring significantly more computational resources.&#8221; </strong></em>So yeah, we&#8217;re not there yet.</p><p>I&#8217;ll highlight a critical point from their discussion that, at least in the circles I move in, is one that is increasingly recognized as a challenge: the genome itself isn&#8217;t necessarily the best training data for the models we want:</p><blockquote><p>Although DNALMs successfully discriminate regulatory DNA from background sequences, they appear to learn incomplete repertoires of regulatory sequence features. This limitation likely stems from the sparsity and the uneven distribution of regulatory features; regulatory elements constitute only 10-20% of the human genome, and certain classes of regulatory features occur at substantially different frequencies.</p></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h2>2. A tool to measure the activity of a cell&#8217;s regulatory factors</h2><p>Transcription factors are the key regulators of a cell&#8217;s identity, since they direct the expression of the right repertoire of cell type-specific genes. To know which transcription factors are active in a cell type people typically look at RNA-seq data, but this only tells you something about the amount of transcription factor present in a cell and not its level of functional activity.</p><p>Bas van Steensel&#8217;s lab, at the Netherlands Cancer Institute, has a <a href="https://www.cell.com/cell-systems/fulltext/S2405-4712(24)00312-0">paper</a> out in <em>Cell Systems</em> that describes how to use multiplexed reporter gene libraries as a tool to read out transcription factor activity in a cell. They came up with a scheme to design optimal sensors of transcription factor activity, and because the libraries are multiplexed, you can read out the activities of dozens of factors. </p><h2>3. Everything you ever wanted to know about commercial single cell technologies.</h2><p>One of my plans over the holiday break is to get up to speed on some areas I should know better than I do, including the latest single-cell omics technologies. Fortunately, the latest issue of <em>Cell Genomics</em> has a comprehensive, <a href="https://www.cell.com/cell-genomics/fulltext/S2666-979X(24)00352-5">open access review</a> of single-cell genomic and epigenomic assay technologies. Even better, this is merely the introduction to what they call a &#8220;living review&#8221;: a Substack page at <a href="https://sctrends.org/">sctrends.org</a> discussing all things related to single-cell and spatial omics technologies. (Noted: find a way to promote your Substack by writing a paper about it&#8230;)</p><h2>4. The laws underlying the physics of everyday life are completely understood</h2><p>In 2010, the physicist Sean Carroll wrote what is probably one of my favorite <a href="https://www.preposterousuniverse.com/blog/2010/09/23/the-laws-underlying-the-physics-of-everyday-life-are-completely-understood/">blog posts</a> of all time, in which he claimed that &#8220;the laws underlying the physics of everyday life are completely understood.&#8221; He said:</p><blockquote><p>A hundred years ago it would have been easy to ask a basic question to which physics couldn&#8217;t provide a satisfying answer. &#8220;What keeps this table from collapsing?&#8221; &#8220;Why are there different elements?&#8221; &#8220;What kind of signal travels from the brain to your muscles?&#8221; But now we understand all that stuff. (Again, not the detailed way in which everything plays out, but the underlying principles.) Fifty years ago we more or less had it figured out, depending on how picky you want to be about the nuclear forces. But there&#8217;s no question that the human goal of figuring out the basic rules by which the easily observable world works was one that was achieved once and for all in the twentieth century.</p></blockquote><p>This claim caused a little controversy, but I like it because it pushes back against a view of science that people may have picked up from Thomas Kuhn (or Larry Laudan for a deeper cut), namely that scientific paradigms are always going to change and that today&#8217;s theories are almost certainly going to be discarded by scientists of the future. I don&#8217;t think that is uniformly true. In biology we&#8217;ve figured out the basics of mammalian sexual reproduction (two haploid gametes fuse to form a diploid zygote). We have a good understanding of the structure of DNA and why that structure enables DNA to act as a template for transcription and its own replication. Not everything in science is going to be obsolete.</p><p>I can&#8217;t speak to the physics, but Carroll wrote an interesting <a href="https://arxiv.org/pdf/2101.07884">paper</a> laying out the details of his claim. (It was posted in 2021 but I didn&#8217;t see it until this week.) If you&#8217;re interested in this discussion, it&#8217;s worth at least skim, even if you don&#8217;t have a background in quantum mechanics.</p><h2>5. Are we living in a simulation?</h2><p>Speaking of philosophy and science, here&#8217;s one more interesting <a href="https://onlinelibrary.wiley.com/doi/epdf/10.1111/phpr.13124">paper</a>, this one by the philosopher Peter Godfrey-Smith, who is one of the most interesting philosophers of biology working today. He takes on David Chalmer&#8217;s <a href="https://wwnorton.com/books/9780393635805">best-selling book</a> <em>Reality+</em> and gets into questions like the substrate independence of minds.</p><h2>6.  Three history of science books that directly address philosophy</h2><p>It&#8217;s getting a little late to make holiday book recommendations, but here are three I enjoyed, one well known, and two less familiar. I enjoy books that present a careful history of some aspect of science, and then use that history to directly address big questions in philosophy of science. </p><p><strong>David Wotton, </strong><em><strong><a href="http://www.inventionofscience.com/">The Invention of Science</a>. </strong></em>This book got quite a bit of attention when it came out in 2015. I&#8217;ve read more than a dozen books on the Scientific Revolution and this is easily my favorite. Wootton argues that science was not a gradual evolution from earlier natural philosophy, but was instead a specific invention of the long 17th century. He makes his case by analyzing the emergence of language of science. For example, when did people start using the words <em>discovery</em>, <em>experiment</em>, and <em>fact</em>, and in what context were they using them? This proves to be an extremely effective way at understanding what the scientists of the 17th century thought they were up to. (They didn&#8217;t use the word &#8216;scientist&#8217;, but they used other words that captured the same idea.)</p><p><strong>Stephen Brush,</strong><em><strong> <a href="https://global.oup.com/academic/product/making-20th-century-science-9780199978151?cc=us&amp;lang=en&amp;">Making 20th Century Science</a></strong></em><strong>. </strong>Historian Stephen Brush asks, what is more convincing to scientists, a successful new prediction or an explanation of some long-standing anomaly? To answer this question, Brush examines the history of how scientific consensus was achieved for major theories developed in the 20th century. He traverses quantum mechanics, organic chemistry, general relativity, the Big Bang, the hereditary role of chromosomes, and the Modern Synthesis of evolution and genetics. You may have thought that Eddington&#8217;s famous 1919 eclipse expedition to test Einstein&#8217;s theory of general relativity is a classic case of a successful new prediction persuading scientists, but Brush explains that it took nearly five more years of careful measurements before most of the physics community came around. In general Brush&#8217;s answer is that both prediction and explanation can be persuasive in different contexts, but there is more to it. To get the full, nuanced answer, read the book.</p><p><strong>Elof Axel Carlson, </strong><em><strong><a href="https://www.cshlpress.com/default.tpl?action=full&amp;--eqskudatarq=1410">How Scientific Progress Occurs</a></strong></em><strong>. </strong>This concise book reviews the history of cell and molecular biology, with the aim of contrasting scientific progress in biology with Thomas Kuhn&#8217;s model of paradigm shifts. Carlson argues that biology is much more incremental and, as far as he can see, basically never involves paradigm shifts. It&#8217;s a provocative book with some great history - history that really anyone working in the biomedical sciences should know.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[The Origin of the False Claim That Vaccines Cause Autism Is Worse Than You Think]]></title><description><![CDATA[Did the editors of The Lancet in 1998 not understand how science works?]]></description><link>https://www.thisgenomiclife.org/p/the-origins-of-the-false-claim-that</link><guid isPermaLink="false">https://www.thisgenomiclife.org/p/the-origins-of-the-false-claim-that</guid><dc:creator><![CDATA[Mike White]]></dc:creator><pubDate>Wed, 18 Dec 2024 19:06:21 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!1HcU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1HcU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1HcU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 424w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 848w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 1272w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1HcU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic" width="458" height="649.9760479041917" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1896,&quot;width&quot;:1336,&quot;resizeWidth&quot;:458,&quot;bytes&quot;:435006,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1HcU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 424w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 848w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 1272w, https://substackcdn.com/image/fetch/$s_!1HcU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79418dc5-9f9a-4e41-b580-ba2109d055e7_1336x1896.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Unfortunately Robert F. Kennedy Jr. is now a regular presence in the national news, and it means that we are going to suffer through yet another cycle of public debate over the <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC6768751/">false claims</a> that autism has been linked to vaccines. Responsible journalists, medical institutions, and public figures will yet again need to remind everyone that a there is no link between the MMR vaccine and autism, that the issue has been <a href="https://pubmed.ncbi.nlm.nih.gov/12421889/">thoroughly investigated</a> in <a href="https://pubmed.ncbi.nlm.nih.gov/25898051/">large studies</a> and meta-analyses, whose results are <a href="https://nap.nationalacademies.org/catalog/13164/adverse-effects-of-vaccines-evidence-and-causality">publicly available</a>. It is good to cite this science, but it can lead people who are inclined to be skeptical of vaccines to feel like this is a complex debate with large datasets and difficult statistics that once could parse in different ways. </p><p>I want to try something different here. In this post, we&#8217;ll go back to the original source of the claim that the MMR vaccine might cause autism, the notorious, now-retracted <a href="https://pubmed.ncbi.nlm.nih.gov/9500320/">1998 paper</a> published by Wakefield, <em>et al</em>. in <em>The Lancet</em>. What people typically know about this paper, if they are aware of it, is that it was retracted and Wakefield and his colleagues were accused of fraud and ethics violations. (They had undisclosed financial interests, they didn&#8217;t get the necessary consent for conducting invasive examinations of their child subjects, and they lied about how they assembled their study population. See <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC3136032/">this summary</a>.)</p><p>But you don&#8217;t need to know any of that to see that the paper, taken at face value, is obviously terrible. It&#8217;s study design (and I&#8217;m being generous using the word &#8220;design&#8221;) has problems can be easily understood by anyone who has even a vague recollection of high school science. When you read this paper, you can understand why a whole movement for &#8220;evidence-based medicine&#8221; had to be a thing, because the 1990&#8217;s <em>Lancet</em> editors clearly didn&#8217;t understand how medical evidence works.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.thisgenomiclife.org/subscribe?"><span>Subscribe now</span></a></p><h3>Problem 1: There is no statistical power in 12 kids who happen to show up at your clinic</h3><p>The 1998 Wakefield paper is an &#8220;early report&#8221; that describes an assessment of twelve children who visited a UK gastroenterology clinic. Their ages ranged from 3-11, and, according to the paper, they had &#8220;a history of normal development followed by loss of acquired skills, including language, together with diarrhoea and abdominal pain.&#8221; You don&#8217;t need any more information than this to understand that a study like this has zero power to find any association between autism and the MMR vaccine. Twelve kids who show up at your clinic, among a population of tens of millions of children who had received the MMR vaccine, will tell you nothing about MMR and autism.</p><h3>Problem 2: The association of autism with vaccination was a parent report and not a statistical finding</h3><p>Nine of the twelve children in the study were diagnosed with autism, though the paper does not describe how the diagnosis was made. Three others were ambiguous cases. The main evidence used to claim a link with vaccination, I kid you not, is this:</p><blockquote><p>In eight children, the onset of behavioural problems had been linked, either by the parents or by the child's physician, with measles, mumps, and rubella vaccination. </p></blockquote><p>That is literally the entire basis for suggesting a link between vaccines and autism in this paper - the parents or the child&#8217;s physician thought the vaccine caused their child&#8217;s condition. There is no statistical analysis, no clear histological or other unambiguous evidence of vaccine effects. The children were all given colonoscopies, and the study reports a miscellany of findings, each of which only occurred in some of the children. Wakefield had earlier <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC6768751/">explored the idea</a> that measles and the MMR vaccine might be associated with the development of irritable bowel disease (IBD). This idea morphed into his belief that vaccines predispose children to autism via gut pathology. Hence the colonoscopies for his patients.</p><p>There is a table in the paper titled &#8220;Neuropsychiatric diagnosis&#8221; whose columns show the behavioral diagnosis for each subject, the &#8220;exposure identified by parents or doctor&#8221;, the interval from exposure to first symptoms (usually a few days to a week), etc. This table does not present evidence for anything. It merely lists a small number children who happened to show up at a GI clinic, who have autism and who supposedly received the MMR vaccine within a week of the first symptoms (<em>according to the parents, with no other documentation).</em></p><h3>Problem 3: No controls</h3><p>When you want to find an association between some cause and a disease, you need to include disease cases <em>and</em> healthy controls. If, for example, you wanted to claim that <a href="https://xkcd.com/882/">green jelly beans</a> are associated with a risk of cancer, your analysis would need to include control subjects who didn&#8217;t get cancer, so that you could check whether the rate of green jelly bean consumption differed among the groups. Alternatively, you could follow two groups, one that ate green jelly beans and one that did not, and ask whether there was a difference in cancer incidence between the groups. The Wakefield study has no real controls. There is no work-up of healthy children presented or an analysis of children who were not vaccinated. </p><p>Figure 1 does show a plot of urinary methylmalonic-acid in 8 of the 12 patients versus 14 &#8220;age-matched controls.&#8221; There is no description of where these control patients came from or anything else about them. There is a p value (p = 0.003), but how it was calculated is not described. And methylmalonic-acid says nothing about autism or vaccines; it is a potential indicator of vitamin B-12 deficiency.</p><h3>How could anyone consider this science?</h3><p>There are many other problems in this short paper we could discuss, but by now it should be clear: this work, taken at face value, ignoring the later evidence of fraud, couldn&#8217;t possibly say <em>anything</em> about a link between vaccines and autism. It&#8217;s a description of twelve children who were vaccinated (like millions of other children the same age), who had autism, and whose parents thought the vaccine caused their autism. This is not science. </p><p>It&#8217;s true that the authors write that &#8220;We did not prove an association between measles, mumps, and rubella vaccine and the syndrome described.&#8221; But this leaves out the fact that they didn&#8217;t even <em>investigate</em> an association between the MMR vaccine and autism. And yet, in the very next paragraph, the authors write that &#8220;If there is a causal link between measles, mumps, and rubella vaccine and this syndrome, a rising incidence might be anticipated after the introduction of this vaccine in the UK in 1988.&#8221; It was this idea that caught fire in the media and, nearly three decades later, has left us us with a nominee to lead the U.S. Department of Health and Human Services who goes on television to tell people that vaccines cause autism.</p><p>The most perplexing question is how any serious person at <em>The Lancet</em> thought a manuscript like this met the standards for publication in their journal. Medical science, like all science, is an astonishingly effective generator of reliable knowledge <em>when</em> it is based on reliable methods. This paper flagrantly ignores those methods. It is worth noting that in the year after the Wakefield paper <em>The Lancet </em>published a <a href="https://pubmed.ncbi.nlm.nih.gov/10376617/">real investigation</a> with nearly 500 subjects and found no association between the MMR vaccine and autism, no evidence of a distinct increase in incidence after the MMR vaccine was introduced, and no relationship between the timing of vaccination and the age of autism diagnosis. But the damage had been done.</p><p>A quick read of the Wakefield paper is enough for almost anyone to understand that false claims about vaccines and autism did not arise from some suggestive or worrying early study that was later clarified. The tragedy is that there was never any basis whatsoever for believing the claim.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.thisgenomiclife.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading This Genomic Life! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item></channel></rss>