寻找motif

对于保守的剪切事件,去找对应的motif

保守的IR事件

提取内含子序列再加上左右各50bp的序列进行motif搜索

## 提取每个基因组的保守的IR对应的bed文件
awk -F "\t" '{print $1,$2-50,$3+50,$4,$5}' OFS="\t" ../converseBed/all_converse.bed|grep Ghir >At_Dt.fasta 
python ~/scripte/according_CDS_location_find_fasta_mesage.py  At_Dt.bed  ~/work/Alternative/data/Ghirsutum_genome_HAU_v1.0/Ghirsutum_genome_HAU_v1.0.fasta At_Dt.fasta

A2-At-D5

找出各个保守事件对应的基因序列不考虑正负链的关系

## At的序列
cat ../converseBed/At_D5_A2_converse.bed |awk -F "\t" '{print $1,$2,$3,$5}' OFS="\t"|grep "Ghir" >At.bed
~/software/bedtools2-2.29.0/bin/fastaFromBed -fi ~/work/Alternative/data/Ghirsutum_genome_HAU_v1.0/Ghirsutum_genome_HAU_v1.0.fasta -name -bed At.bed  -fo At.fasta 
## D5的序列
cat ../converseBed/At_D5_A2_converse.bed |awk -F "\t" '{print $1,$2,$3,$5}' OFS="\t"|grep "Gor" >D5.bed
~/software/bedtools2-2.29.0/bin/fastaFromBed -fi ~/work/Alternative/data/Gr_genome/Graimondii_221_v2.0.fa -name -bed D5.bed -fo D5.fasta
## A2的序列
cat ../converseBed/At_D5_A2_converse.bed |awk -F "\t" '{print $1,$2,$3,$5}' OFS="\t"|grep "evm" >A2.bed
~/software/bedtools2-2.29.0/bin/fastaFromBed -fi ~/work/Alternative/data/Ga_genome/G.arboreum.Chr.v1.0.fasta -name -bed A2.bed -fo A2.fasta

Last updated