Through engagements with both companies, we not only toured many of the company's biotech instruments, enriching our biological background knowledge,
but also gained a deeper understanding of the role of synthetic biology in biomedicine and life health. We learned that AI has already played a
significant role in new drug development. What;'s more, both companies have utilized AI to optimize wet lab processes, making substantial contributions
significant role in new drug development. What's more, both companies have utilized AI to optimize wet lab processes, making substantial contributions
to clinical diagnosis, treatment, and drug development.
</p>
<p>
...
...
@@ -268,38 +269,45 @@
Additionally, during the exchange, we also acknowledged the critical significance of "safety" in the new drug development process. In our project,
we placed significant emphasis on incorporating measures to <strongstyle="color: #8a61ad;">ensure the security of data sources and the integrity of algorithmic output sequences</strong>.
<h5style="width:50%;font-weight: bold;margin-bottom: 12px;">(Chinese Academy of Sciences, Assoc. Prof.)</h5>
</div>
<p>
On April 19th, we had the privilege of inviting Yinglu Cui from the research group of Professor Bian Wu at the Chinese Academy of Sciences to visit Tsinghua University. We held a group
seminar where we engaged in extensive discussions regarding protein design and enzyme engineering.
<h5style="width:50%;font-weight: bold;margin-bottom: 12px;">(Chinese Academy of Sciences, Assoc. Prof.)</h5>
</div>
<p>
On April 19th, we had the privilege of inviting Yinglu Cui from the research group of Professor Bian Wu at the Chinese Academy of Sciences to visit Tsinghua University. We held a group
seminar where we engaged in extensive discussions regarding protein design and enzyme engineering.
Professor Cui introduced to us the research undertaken by their group in the field of "computational-enzyme design." We reviewed the history of protein optimization and design,
highlighting areas such as rational design, semi-rational design, directed evolution, etc. She outlined the various breakthroughs and ongoing challenges in the transition to the
current phase of computational design. Further, she provided insights into the prospects of applying computer-designed non-natural enzymes in the field of biomedicine.
</p>
<p>
Our team presented our research and algorithm designs in protein optimization, sharing some of the algorithmic models we've experimented with. Professor Cui shared valuable insights
from their experience in this field, pointing out that the one-hot encoding we've been using lacks universality. Instead, she suggested we should include pre-encoding work and emphasized
the importance of assessing the generalization capability when researching model algorithms in the protein design realm. She noted that some current models, such as DLKcat, suffer from
from her lab's experience in this field, pointing out that the one-hot encoding we've been using lacks universality. Instead, she suggested we should include <strongstyle="color: #8a61ad;">pre-encoding work </strong>and emphasized
the importance of assessing the <strongstyle="color: #8a61ad;">generalization capability</strong> when researching model algorithms in the protein design realm. She noted that some current models, such as DLKcat, suffer from
weak generalization abilities. In response to Professor Cui's suggestions, we systematically worked on improvements, addressing each point.
<strongstyle="color: #8a61ad;">The role:</strong> Professor Xie has extensive experience in both academia and industry, providing us with guidance on the
direction of our research topics and technical approach.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Professor Xie Zhen is a professor at the Center for Synthetic and Systems Biology at Tsinghua University,
as well as the founder and chief scientist of Syngentech.
<strongstyle="color: #8a61ad;">The role:</strong> Professor Xie has extensive experience in both academia and industry, providing us with guidance on the
direction of our research topics and technical approach.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Professor Xie Zhen is a professor at the Center for Synthetic and Systems Biology at Tsinghua University,
as well as the founder and chief scientist of Syngentech.
After communicating with several protein design and optimization companies, we have transitioned our focus from protein optimization design to antibody optimization design. However,
we haven't yet identified a specific task. Before establishing a concrete plan, we learned about recent breakthroughs in using artificial intelligence for de novo antibody design and
...
...
@@ -309,32 +317,46 @@
<p>
We further investigated antibody humanization and found that traditional humanization techniques primarily involve a series of biological tools and simulation computations implemented
on computers, whereas machine learning deep learning methods are less used in the field, and are mainly employed for binary classification and humanization scoring of humanized antibodies.
We aim to delve deeper into using machine learning and deep learning tools to address challenges in this domain, incorporating considerations for antibody structures to outline a preliminary
We aim to delve deeper into using machine learning and deep learning tools to address challenges in this domain, incorporating considerations for antibody structures. When we propose a preliminary
technical route. We once again communicated with Professor Xie, who acknowledged our idea and expressed anticipation for us to construct a comprehensive automated tool and introduce innovative aspects.
<strongstyle="color: #8a61ad;">The role:</strong> Ziting Zhang provided us with an in-depth introduction to immunological knowledge related to antibodies. As a researcher in the field of immunology,
she expressed that our project holds significant meaning.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Ziting Zhang is a Ph.D. student in Bioinformatics at Tsinghua University, and she possesses unique insights in immunology.
Moreover, Zhang Ziting was a member of the Tsinghua-A team in 2020.
<strongstyle="color: #8a61ad;">The role:</strong> Ziting Zhang provided us with an in-depth introduction to immunological knowledge related to antibodies. As a researcher in the field of immunology,
she expressed that our project holds significant meaning.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Ziting Zhang is a Ph.D. student in Bioinformatics at Tsinghua University, and she possesses unique insights in immunology.
Moreover, Zhang Ziting was a member of the Tsinghua-A team in 2020.
Ziting Zhang also pursued her undergraduate studies in the Department of Automation at Tsinghua University. Currently, her research interests converge at the intersection of bioinformatics.
Through exchanges with her, we gained a deeper understanding of immunological knowledge related to antibodies and how to leverage the advantages of our team's background in information.
She believes that our team's project this year holds considerable meaningand looks forward to the richness and experience of the Tsinghua-A team participating in the iGEM competition again after three years.
<strongstyle="color: #8a61ad;">The role:</strong> Xiangzhe Kong provided us with a lot of advice on CDR algorithm design, which helped us to understand the current situation
in the field of antibody CDR design in the international arena.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Xiangzhe Kong is a Ph.D. student at the Department of Computer Science, Tsinghua University, and the author of "Conditional Antibody Design as 3D Equivariant Graph Translation"
(ICLR 2023 Outstanding Paper Honorable Mention). He possesses extensive experience in computational de novo antibody design, especially CDR design.
<strongstyle="color: #8a61ad;">The role:</strong> Xiangzhe Kong provided us with a lot of advice on CDR algorithm design, which helped us to understand the current situation
in the field of antibody CDR design in the international arena.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Xiangzhe Kong is a Ph.D. student at the Department of Computer Science, Tsinghua University, and the author of "Conditional Antibody Design as 3D Equivariant Graph Translation"
(ICLR 2023 Outstanding Paper Honorable Mention). He possesses extensive experience in computational de novo antibody design, especially CDR design.
Antibodies are divided into constant and variable regions, within the variable region are the high-frequency mutated CDR regions (where the antigen comes into contact with the antibody), and the relatively conserved and
species-specific FR regions. As the antigen-antibody binding region, CDR has garnered significant attention in the academic field, and there are a lot of difficulties in the design of CDR region, especially CDR-H3 loop.
...
...
@@ -342,17 +364,24 @@
essential aspects of designing CDR regions from sequence to structure, and learned about some details of Equivariant Graph Neural Networks (EGNs). Through this exchange, we gained insights into the unique characteristics of
CDR regions and came to a full realization of the high variability of the CDR H3 loop. Based on this understanding, we made improvements to the structural scoring aspect of our project.
<strongstyle="color: #8a61ad;">The role:</strong> Wenbo Guo gave us valuable suggestions on our algorithms.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Wenbo Guo is a postdoctoral fellow in the bioinformatics direction at Tsinghua University, and has worked on classifying and annotating rare cell types using
OCLR (One Class Logistic Regression) promoted by One Class classifier.
<strongstyle="color: #8a61ad;">The role:</strong> Wenbo Guo gave us valuable suggestions on our algorithms.
</p>
<p>
<strongstyle="color: #8a61ad;">Profile:</strong> Wenbo Guo is a postdoctoral fellow in the bioinformatics direction at Tsinghua University, and has worked on classifying and annotating rare cell types using
OCLR (One Class Logistic Regression) promoted by One Class classifier.
Since we apply One Class approach to scoring antibody sequences from rare species, we introduced our project and consulted with Dr. Guo about the key advantages of the one-class approach. Dr. Guo noted that the approach exhibits excellent scalability,
making it highly suitable for our application scenario. Additionally, we also sought advice from Dr. Zeyu Chen, who is also from the same research group. Dr. Chen recommended that we first experiment with methods such as SVM one class and IsolationForest,
making it highly suitable for our application scenario. Additionally, we also sought advice from Zeyu Chen, who is a Ph.D. student from the same research group. Dr. Chen recommended that we first experiment with methods such as SVM one class and IsolationForest,
which are commonly used for outlier detection.
</p>
<p>
...
...
@@ -360,6 +389,23 @@
project. However, we also realized that the aforementioned traditional machine learning algorithms could be repurposed for outlier detection and integrated into our project to optimize it. Following several rounds of comparative validation, we selected
Local Outlier Factor as the method for outlier detection. Additionally, we embarked on constructing our own deep one class algorithm to score antibody sequences.
When we were at CCiC, we had a preliminary understanding of PekingHSC's projects. Due to the geographical advantage, we visited Peking University's
main campus on July 24th and engaged in in-depth discussions with students from the School of Medicine at Peking University. We once again presented
...
...
@@ -435,6 +481,67 @@
related topics. Members of PekingHSC also introduced us to some laboratories at Peking University dedicated to antibody research. Both of our
teams assisted each other, gained significant insights, and forged lasting friendships.
</p>
<h2id="8"style="font-weight: bold;color:#573674">Synbio Plus AI Conference</h2>
<p>
We communicated with other teams about the applications of AI in synthetic biology, with a focus on introducing the algorithmic models and considerations
used in our project. Many teams utilized pre-trained models, and we shared some insights regarding generalization capabilities: emphasizing the alignment
between the data used during pre-training and the data employed in actual project is crucial for achieving better results.
</p>
<p>
In addition, we placed particular emphasis on our team's concern regarding the safety aspects of utilizing AI in new drug development, along with a series
of measures. We shared our insights derived from discussions with multiple companies. We believe that the current limitations of AI encompass: dataset quality,
interpretability, and security. Similar to the perspective of a member of SYSU-Software, we concur that sometimes data holds more significance than the algorithms
themselves. When applying AI technology to antibody design, limitations in data quality and quantity might constrain the potential risks associated with the designed
antibodies. Moreover, comprehending how AI models predict and optimize antibody sequences and characteristics is crucial for ensuring their reliability and safety.
However, prevailing deep learning models often possess intricate structures and parameters, making it difficult to elucidate the rationales behind their decisions,
which raises doubts about their reliability and trustworthiness. Hence, our team is also devoted to developing more interpretable models and algorithms, aiming to
provide explanations for the decision-making processes and outcomes of AI models in antibody design.
</p>
<p>
During the conference, we also delved into the ethical challenges AI faces, as well as the relevant legal regulations and academic standards. This prompted us to
engage in further reading about the legal regulations and academic standards related to AI after the discussions concluded. This, in turn, facilitated the refinement of our project.