合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

        代寫EMATM0050 DSMP MSc in Data Science

        時間:2024-04-21  來源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯



         University of Bristol MSc in Data Science; DSMP (Data Science Mini Project; EMATM0050)
        Predicting T-Cell Receptor Specificity
        T cells (T lymphocytes) are among the most important immune system cells with a vital role in adaptive immunity. T cells recognise cells in the body infected by viruses, bacteria or cells that have undergone cancer transformation. After recognising the infected or cancerous cells, T cells eliminate them from the body thereby preventing the spread of infection or cancer.
        T cells recognise their targets through their T Cell Receptors (TCRs) expressed on their cell membrane. A T Cell Receptor consists of an alpha and a beta subunit. The evolutionary arms race between pathogens and the immune system has resulted in a mechanism for generation of a huge number of unique TCRs: and this is essential for a proper immune response against infections and cancer. Although TCR genes are encoded in the genome, their diversity is massively enhanced in several ways: (i) each TCR is composed of a pair of proteins (either alpha + beta chains or gamma + delta chains); (ii) rather than being encoded as a single gene, the DNA encoding the variable region of each of these chains is formed by joining 3 or 4 different stretches of DNA (gene segments) in a process is called VDJ recombination. Each alpha subunit contains a single V and J segment and each beta subunit contains a single V, a D and a J segment. Diversity is provided by the fact that the genome encodes multiple V, D and J segment; (iii) The joining of these segments involves mechanisms which insert and delete nucleotides in a pseudorandom fashion, maximising diversity in the joining region (the CDR3), the region of the TCR chain which contacts the peptide antigen. (ref 1)
        T Cell Receptors (TCRs) constitute one of the most promising classes of emerging therapeutics. Whilst TCRs are amongst the most complex facets of immune biology, engineering of an optimum TCR can transform immunotherapies and personalised medicines. The TCR repertoire at any time point reflects on the person’s health and contains a memory of all past experiences. However, CRs are highly variable and their specificities aren’t easily predictable with traditional empirical methods.
        In this project you will analyse TCR repertoire from the VDJdb (link) and use machine learning to predict TCRs that will bind to specific epitopes.
         
         Tasks
        1. Data Download and Preprocessing
        1.1 Download the zip file from GitHub and focus on the VDJdb.txt file.
        1.2 Preprocess the dataset. Figure out what each column represents and keep
        columns that will help you complete the project.
        Predicting TCR specificity from sequence alone is the holy grail of immunotherapy. TCRs that are specific to the same target, often have very similar sequences, thereby TCR sequence – target patterns emerge in the data.
        A crude approach could be to represent amino acids of the TCR or key regions of it using one-hot representation.
        2. What are the limitations of this approach in downstream analysis? Could you describe a way to overcome them (Hint: Consider the CDR3 length distribution. We are looking for a high level description of the limitation and an approach that would overcome it. No algorithm development is required.)
        A common method to predict specificity from a sequence is described in Vujovic et.al. (1). It creates some kind of distance or similarity score matrix of TCR sequences and uses that representation to train models that can classify TCRs based on specificity (Fig 1.).
         
          3. Estimate a distance/similarity matrix representation of the data. Calculate these metrics for the alpha and the beta chains separately, then calculate these for the combined alpha and beta chains too. (Hint: TCRDist, GLIPH or GIANA can be used for this. Alternatively, you can define your own similarity metric.)
        4. Plot the TCRs in 2 dimensions and colour them based on specificity. Compare the plots for the alpha, the beta and the combined alpha-beta chains. Comment on your findings. (Hint: scikit-learn has a plethora of dimensionality reduction tools. Some examples are PCA, tSNE and UMAP.)
        5. Write code to cluster TCRs. How well do TCRs cluster based on specificity? Can you explain why they do/don’t?
        6. Write an algorithm that can predict antigen specificity from sequence. You can use any supervised/unsupervised algorithm to predict specificity. Comment on the performance of the model and reason why it performs good or bad. (Hint: Any reasonable modelling approach is fine. However, keep in mind that simpler models sometimes provide more insights regarding the underlying problem.)

         Bibliography/References
        1. Vujovic M, Degn KF, Marin FI, Schaap-Johansen AL, Chain B, Andresen TL, Kaplinsky J, Marcatili P. T cell receptor sequence clustering and antigen specificity. Comput Struct Biotechnol J (2020) 18:2166–21**. doi:10.1016/j.csbj.2020.06.041
        2. Mayer-Blackwell. TCR meta-clonotypes for biomarker discovery with tcrdist3: quantification of public, HLA- 2 restricted TCR biomarkers of SARS-CoV-2 infection. bioRxiv (2020) 1:75–94.
        3. Huang H, Wang C, Rubelt F, Scriba TJ, Davis MM. Analyzing the Mycobacterium tuberculosis immune response by T-cell receptor clustering with GLIPH2 and genome-wide antigen screening. Nat Biotechnol (2020) 38:1194–1202. doi:10.1038/s41587-020-0505-4
        4. Zhang H, Zhan X, Li B. GIANA allows computationally-efficient TCR clustering and multi-disease repertoire classification by isometric transformation. Nat Commun (2021) 12:1–11.doi:10.1038/s41467-02**25006-WX:codinghelp

        掃一掃在手機(jī)打開當(dāng)前頁
      1. 上一篇:學(xué)習(xí)英語必備的幾大教材!非常全面
      2. 下一篇:代做CS 7642 Reinforcement Learning and Decision
      3. 無相關(guān)信息
        合肥生活資訊

        合肥圖文信息
        出評 開團(tuán)工具
        出評 開團(tuán)工具
        挖掘機(jī)濾芯提升發(fā)動機(jī)性能
        挖掘機(jī)濾芯提升發(fā)動機(jī)性能
        戴納斯帝壁掛爐全國售后服務(wù)電話24小時官網(wǎng)400(全國服務(wù)熱線)
        戴納斯帝壁掛爐全國售后服務(wù)電話24小時官網(wǎng)
        菲斯曼壁掛爐全國統(tǒng)一400售后維修服務(wù)電話24小時服務(wù)熱線
        菲斯曼壁掛爐全國統(tǒng)一400售后維修服務(wù)電話2
        美的熱水器售后服務(wù)技術(shù)咨詢電話全國24小時客服熱線
        美的熱水器售后服務(wù)技術(shù)咨詢電話全國24小時
        海信羅馬假日洗衣機(jī)亮相AWE  復(fù)古美學(xué)與現(xiàn)代科技完美結(jié)合
        海信羅馬假日洗衣機(jī)亮相AWE 復(fù)古美學(xué)與現(xiàn)代
        合肥機(jī)場巴士4號線
        合肥機(jī)場巴士4號線
        合肥機(jī)場巴士3號線
        合肥機(jī)場巴士3號線
      4. 上海廠房出租 短信驗證碼 酒店vi設(shè)計

        主站蜘蛛池模板: 亚洲一区二区三区偷拍女厕| 亚洲V无码一区二区三区四区观看| 亚洲一区二区视频在线观看| 久久一区二区明星换脸| 九九无码人妻一区二区三区 | 久久久久成人精品一区二区| 国产成人一区在线不卡| 精品国产一区二区三区香蕉事| 人妻无码一区二区不卡无码av| 国产午夜精品一区理论片飘花| 亚洲片一区二区三区| 国产丝袜一区二区三区在线观看| 亚洲一区二区三区高清在线观看| 亚洲一区免费视频| 射精专区一区二区朝鲜| 另类免费视频一区二区在线观看| 国产成人精品无码一区二区三区| 福利在线一区二区| 国产精品亚洲一区二区三区| 在线视频一区二区日韩国产| 国产色欲AV一区二区三区| 大香伊蕉日本一区二区| 影院成人区精品一区二区婷婷丽春院影视| 亚洲国产精品一区二区久| 麻豆国产一区二区在线观看 | 中文字幕在线观看一区二区 | 成人精品视频一区二区| 成人精品视频一区二区| 激情久久av一区av二区av三区| 国模视频一区二区| 亚洲乱码国产一区网址| 国产一区二区精品久久凹凸| 亚洲福利视频一区| 国产一区二区三区乱码网站| 国产一区二区三区乱码在线观看 | av无码免费一区二区三区| 色婷婷av一区二区三区仙踪林 | 精品国产日产一区二区三区 | 国产精品电影一区二区三区| 精品不卡一区中文字幕| 亚洲国产综合精品一区在线播放|