转摘TCGA样本ID号的含义
经常有人会问小编,TCGA下载的数据中,样本的ID号究竟代表什么意思。从样本ID号上能看出样本类型吗?首先小编先给大家一个肯定的答案。从TCGA的样本ID号上是可以区分样本类型的。
我们以TCGA-CHOL这套数据的sample sheet为例,sample sheet的下载方法和详细讲解,参考下面这个视频。☞[新版TCGA数据库RNAseq数据下载](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649222068%26idx%3D1%26sn%3D8ebb13f8164a5cbaceb5688bb8661362%26chksm%3Df3d1a1c1c4a628d72a924c7b93fa6fc3e4a14542ae74e7bcdd6c202d673b9f1c33f5a9d556e3%26scene%3D21%23wechat_redirect)
得到的sample sheet内容如下,我们用Excel打开,然后直接查看最后几列。从Sample ID和Sample Type的对应关系不难发现,后缀为-01A的是Primary Tumor样本,后缀为-11A的是Solid Tissue Normal样本。
[图片地址]:
(https://upload-images.jianshu.io/upload_images/24747866-8a2372fc4c9bdd97.png)``````
而事实上也是这样的,从TCGA官方文档
[https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/sample-type-codes](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=https%3A//gdc.cancer.gov/resources-tcga-users/tcga-code-tables/sample-type-codes)
我们可以看到后缀中数字与样本类型的对应关系。
[图片地址]:
(https://upload-images.jianshu.io/upload_images/24747866-c6c563e03698cacf.png)``````
那么-01A和-11A,这里的字母A又是什么含义呢?
[图片地址]:
(https://upload-images.jianshu.io/upload_images/24747866-8cc79ccd8e6eb1a1.png)``````
从TCGA的另外一个官方文档上我们可以看到具体的解释。
[https://docs.gdc.cancer.gov/Encyclopedia/pages/TCGA_Barcode/](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=https%3A//docs.gdc.cancer.gov/Encyclopedia/pages/TCGA_Barcode/)
[图片地址]:
(https://upload-images.jianshu.io/upload_images/24747866-42c4ade41cf4a948.png)``````
从这个示意图上可以看到,有时候我们可以从一个病例身上取多个样本,不论是肿瘤样本,还是癌旁正常对照,然后存放在不同的管子里面。这里的A,B,C就表示样本的顺序。官方文档的解释如下。
[图片地址]:
(https://upload-images.jianshu.io/upload_images/24747866-a83f31fa83bef87d.png)``````
讲到这里,我相信大家对TCGA中的样本ID有了更深入的理解。如果对TCGA还不太了解的小伙伴,可以参考生信交流平台往期的内容。
前面小编也给大家详细介绍过TCGA这数据库,从RNAseq数据,miRNA-seq数据的下载合并,到临床数据的下载,再到差异表达分析。
☞[新版TCGA数据库RNAseq数据下载](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649222068%26idx%3D1%26sn%3D8ebb13f8164a5cbaceb5688bb8661362%26chksm%3Df3d1a1c1c4a628d72a924c7b93fa6fc3e4a14542ae74e7bcdd6c202d673b9f1c33f5a9d556e3%26scene%3D21%23wechat_redirect)
☞[新版TCGA数据库miRNA数据下载](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649222215%26idx%3D1%26sn%3De5fa7c101ff7e6c50ce932cbb7d66b5e%26chksm%3Df3d1a632c4a62f2459d1d621d71be4d7b24c71749044700214825a60f11f54f4d5dc1e2154d7%26scene%3D21%23wechat_redirect)
☞[R代码合并新版TCGA数据库RNAseq表达谱数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649221653%26idx%3D1%26sn%3D4aeada2253a160f73bda0d773597ff53%26chksm%3Df3d1a060c4a629764e3c630227175c921456d9bdfcb4e4676abaed2115f7ce282588ab19a33f%26scene%3D21%23wechat_redirect)
☞[零代码合并新版TCGA数据库RNAseq表达谱数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649221724%26idx%3D1%26sn%3Dd5d74ca9f93737826b9dbee363259c29%26chksm%3Df3d1a029c4a6293fc1e1fedec684818d3aa4de6a04a1e544c8505182b114bb9b1d60282a8982%26scene%3D21%26token%3D690441454%26lang%3Dzh_CN%23wechat_redirect)
☞[提取TCGA中mRNA或lncRNA表达矩阵](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649225187%26idx%3D1%26sn%3Dda9460075a025802a9889e36319e25d9%26chksm%3Df3d1bd96c4a6348048d2dfc7dd463f8619a1657dcb5b719f610ba18aa2cdb8e3f2552f47aec4%26scene%3D21%23wechat_redirect)
☞[R代码TCGA差异表达分析](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649212159%26idx%3D1%26sn%3D6cf4599c3e7a1b7a213c096ad6a43e7d%26chksm%3Df3d18e8ac4a6079cd15bf726cebf4270a115ee386bd97bc8513e8065c755c7594e5c0c793c0e%26scene%3D21%23wechat_redirect)
☞[零代码TCGA差异表达分析](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649211897%26idx%3D1%26sn%3D2337e9692c91b096bba85a9c42613ff2%26chksm%3Df3d1898cc4a6009a9b4d73742e46fdab38f4c84f35f515deb9dce5fe0f2bec22887abdd098ad%26scene%3D21%23wechat_redirect)
从体细胞突变数据的下载到合并成maf文件,然后绘制瀑布图。
☞ [如何从TCGA数据库下载体细胞突变数据(somatic mutation)](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224023%26idx%3D1%26sn%3D23dcb5d1c3a0d2243156f6c783216b9c%26chksm%3Df3d1b922c4a630341dfba00cc1bb32fe07e8b7bc0758c08be4132063211f6b1bfc9505d9bfec%26scene%3D21%23wechat_redirect)
☞[【视频讲解】下载TCGA数据库中突变数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224257%26idx%3D1%26sn%3D34cebb9d5ec5dc603d964b5efc872ead%26chksm%3Df3d1be34c4a637227a862f77310178a6f039414f88736897c471160da0f7da62ed4249ad62fa%26scene%3D21%23wechat_redirect)
☞ [R代码合并TCGA体细胞突变数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224073%26idx%3D1%26sn%3D52e0f1ded36ad7521ecb733839bad02b%26chksm%3Df3d1b9fcc4a630ea46edc7a7947f5303693709af0644a03fcab65142823ba6a23d8abe21717f%26scene%3D21%23wechat_redirect)
☞ [maftools包分析突变数据,绘制瀑布图](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224107%26idx%3D1%26sn%3D2f71693371fce7c6297ec3e2829d2cab%26chksm%3Df3d1b9dec4a630c8b3df86154c8921e206c432b3b09d47e34f663af4f145da3d5a9c4f7cb9fc%26scene%3D21%23wechat_redirect)
☞ [【R实战】使用maftools复现SCI文章中的体细胞突变瀑布图](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224469%26idx%3D1%26sn%3D8283f09e6f683674eaa1b9f9202d473a%26chksm%3Df3d1bf60c4a6367611108f14d0af1fd30965d52db12c3b2400fc5825658568256525b4a047a6%26scene%3D21%23wechat_redirect)
从甲基化数据的下载到甲基化水平矩阵的合并
☞ [如何从TCGA数据库下载DNA甲基化数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224633%26idx%3D1%26sn%3D2d97027d05c867341e0930a6c5ab1287%26chksm%3Df3d1bfccc4a636da160c709dbeae2ff53cc7cf87d654adb5d37985f5c72897cfa9f7e18239e5%26scene%3D21%23wechat_redirect)
☞ [R代码合并TCGA数据库中DNA甲基化数据](https://links.jianshu.com/go?to=https://link.zhihu.com/?target=http%3A//mp.weixin.qq.com/s%3F__biz%3DMzI4ODE0NTE3OA%3D%3D%26mid%3D2649224687%26idx%3D1%26sn%3Dad0d705c1f8ff58a42359c1920ebc6a7%26chksm%3Df3d1bf9ac4a6368cfd2bdd8d1a1175901304be0aab0f3a5cf4bba1376ab5a5f94d212ded6b45%26scene%3D21%23wechat_redirect)
===========================
【来源: 简书】
【作者: 生信交流平台】
【原文链接】 https://www.jianshu.com/p/addc91637ffe
声明:转载此文是出于传递更多信息之目的。若有来源标注错误或侵犯了您的合法权益,请作者持权属证明与本网联系,我们将及时更正、删除,谢谢。