KEGG sequence format

From BioPerl
Jump to: navigation, search


Description

This file format can be parsed by the Bio::SeqIO system using the Bio::SeqIO::kegg module.

This is the KEGG "flat" format. KEGG also distribute their database in an XML format.

Example

ENTRY       10768             CDS       H.sapiens
NAME        AHCYL1
DEFINITION  S-adenosylhomocysteine hydrolase-like 1 [EC:3.3.1.1]
ORTHOLOG    KO: K01251  adenosylhomocysteinase
CLASS       Metabolism; Amino Acid Metabolism; Methionine metabolism
           [PATH:hsa00271]
           Metabolism; Metabolism of Other Amino Acids; Selenoamino acid
           metabolism [PATH:hsa00450]
POSITION    1:join(26813..26932,50794..50905,52974..53117,54122..54222,54657..
           54759,56523..56617,57185..57291,58104..58220,58427..58490,59255..
           59343,59706..59776,60133..60227,60312..60410,60811..60879,61308..
           61386,62491..62611,63434..63440)
DBLINKS     LocusLink: 10768
           GDB: 9958257
           NCBI: 21361647
           SP: O43865
CODON_USAGE       T               C               A               G
         T   7   8   1  10   8  10   6   2   7  11   1   0  11   8   0   8
         C   3  10   2  16   3   8   4   3   2   6   6  17   3   6   7   3
         A   9  15   3  14   6  11  15   1  11  11  14  24   2   8   2   4
         G  10   9   7  17  18  11  11   3  21  11  10  23   5  11   6  11
AASEQ       530
           MSMPDAMPLPGVGEELKQAKEIEDAEKYSFMATVTKAPKKQIQFADDMQEFTKFPTKTGR
           RSLSRSISQSSTDSYSSAASYTDSSDDEVSPREKQQTNSKGSSNFCVKNIKQAEFGRREI
           EIAEQDMSALISLRKRAQGEKPLAGAKIVGCTHITAQTAVLIETLCALGAQCRWSACNIY
           STQNEVAAALAEAGVAVFAWKGESEDDFWWCIDRCVNMDGWQANMILDDGGDLTHWVYKK
           YPNVFKKIRGIVEESVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDG
           LKRTTDVMFGGKQVVVCGYGEVGKGCCAALKALGAIVYITEIDPICALQACMDGFRVVKL
           NEVIRQVDVVITCTGNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVTSLRTPELTWERVR
           SQVDHVIWPDGKRVVLLAEGRLLNLSCSTVPTFVLSITATTQALALIELYNAPEGRYKQD
           VYLLPKKMDEYVASLHLPSFDAHLTELTDDQAKYLGLNKNGPFKPNYYRY
NTSEQ       1593
           atgtcgatgcctgacgcgatgccgctgcccggggtcggggaggagctgaagcaggccaag
           gagatcgaggacgccgagaagtactccttcatggccaccgtcaccaaggcgcccaagaag
           caaatccagtttgctgatgacatgcaggagttcaccaaattccccaccaaaactggccga
           agatctttgtctcgctcgatctcacagtcctccactgacagctacagttcagctgcatcc
           tacacagatagctctgatgatgaggtttctccccgagagaagcagcaaaccaactccaag
           ggcagcagcaatttctgtgtgaagaacatcaagcaggcagaatttggacgccgggagatt
           gagattgcagagcaagacatgtctgctctgatttcactcaggaaacgtgctcagggggag
           aagcccttggctggtgctaaaatagtgggctgtacacacatcacagcccagacagcggtg
           ttgattgagacactctgtgccctgggggctcagtgccgctggtctgcttgtaacatctac
           tcaactcagaatgaagtagctgcagcactggctgaggctggagttgcagtgttcgcttgg
           aagggcgagtcagaagatgacttctggtggtgtattgaccgctgtgtgaacatggatggg
           tggcaggccaacatgatcctggatgatgggggagacttaacccactgggtttataagaag
           tatccaaacgtgtttaagaagatccgaggcattgtggaagagagcgtgactggtgttcac
           aggctgtatcagctctccaaagctgggaagctctgtgttccggccatgaacgtcaatgat
           tctgttaccaaacagaagtttgataacttgtactgctgccgagaatccattttggatggc
           ctgaagaggaccacagatgtgatgtttggtgggaaacaagtggtggtgtgtggctatggt
           gaggtaggcaagggctgctgtgctgctctcaaagctcttggagcaattgtctacattacc
           gaaatcgaccccatctgtgctctgcaggcctgcatggatgggttcagggtggtaaagcta
           aatgaagtcatccggcaagtcgatgtcgtaataacttgcacaggaaataagaatgtagtg
           acacgggagcacttggatcgcatgaaaaacagttgtatcgtatgcaatatgggccactcc
           aacacagaaatcgatgtgaccagcctccgcactccggagctgacgtgggagcgagtacgt
           tctcaggtggaccatgtcatctggccagatggcaaacgagttgtcctcctggcagagggt
           cgtctactcaatttgagctgctccacagttcccacctttgttctgtccatcacagccaca
           acacaggctttggcactgatagaactctataatgcacccgaggggcgatacaagcaggat
           gtgtacttgcttcctaagaaaatggatgaatacgttgccagcttgcatctgccatcattt
           gatgcccaccttacagagctgacagatgaccaagcaaaatatctgggactcaacaaaaat
           gggccattcaaacctaattattacagatactaa
///
Personal tools
Namespaces
Variants
Actions
Main Links
documentation
community
development
Toolbox