20200102GO富集分析
第一类conserve pattern
1.在各个基因组都存在相同的IR
awk -F "_" '{print $1"_"$2}' ../converse/1_1 |sort|uniq|xargs -I {} grep {} ../../../../GhDt_Gr_GhAt_Ga_end_noScaffold |awk '{print $1"\n"$3}' |xargs -I {} grep {} ~/genome_data/Ghirsutum_genome_HAU_v1.1/Gh_Noscagenes_GO_V3.annot >1_1.go2.在各个基因组中都不存在IR的基因对
awk '{print $1"\n"$3}' ../converse/1_2|sort|uniq|xargs -I {} grep {} ~/genome_data/Ghirsutum_genome_HAU_v1.1/Gh_Noscagenes_GO_V3.annot >1_2.go3.在A基因组中存在保守的IR
awk -F "_" '{print $1}' ../converse/1_3 |sort|uniq|xargs -I {} grep {} ../../../../GhDt_Gr_GhAt_Ga_end_noScaffold |awk '{print $1"\n"$3}' |xargs -I {} grep {} ~/genome_data/Ghirsutum_genome_HAU_v1.1/Gh_Noscagenes_GO_V3.annot >1_3.go4.在D基因组中存在保守的IR
awk -F "_" '{print $1"_"$2}' ../converse/1_4 |sort|uniq|xargs -I {} grep {} ../../../../GhDt_Gr_GhAt_Ga_end_noScaffold | awk '{print $1"\n"$3}' | xargs -I {} grep {} ~/genome_data/Ghirsutum_genome_HAU_v1.1/Gh_Noscagenes_GO_V3.annot >1_4.go第二类IR在多倍化中发生丢失
1.只在Dt中发生IR丢失
2.只在At中发生IR丢失
3.在两个基因组中都发生了丢失
第三类IR在多倍化中获得新的IR事件
1.只在At中发生IR获得
2.只在Dt中发生IR获得
3.在两个基因组中都发生了获得
查看不同类别之间共有的GO号
partition splicing pattern
2020-01-07
两个亚基因组出现同样AS
同时存在AS
同时不存在AS
两个亚基因组有不同的AS
只在At中存在
只在Dt中存在
分别进行GO富集分析
Last updated