GGRNA ver.2 Home | Help | Advanced search    Previous release (v1)

2024-11-15 16:42:52, GGRNA.v2 : RefSeq release 226 (Sep, 2024)

LOCUS       XM_021005582            3866 bp    mRNA    linear   MAM 10-MAY-2017
DEFINITION  PREDICTED: Phascolarctos cinereus extensin-2-like (LOC110221189),
            mRNA.
ACCESSION   XM_021005582
VERSION     XM_021005582.1
DBLINK      BioProject: PRJNA384067
KEYWORDS    RefSeq; corrected model; includes ab initio.
SOURCE      Phascolarctos cinereus (koala)
  ORGANISM  Phascolarctos cinereus
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Metatheria; Diprotodontia; Phascolarctidae;
            Phascolarctos.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_018344061.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Version          :: Phascolarctos cinereus Annotation
                                           Release 100
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 7.4
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            ab initio   :: 26% of CDS bases
            frameshifts :: corrected 1 indel
            ##RefSeq-Attributes-END##
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-332               MSTS01000110.1     3129491-3129822     c
            333-1685            MSTS01000110.1     3127930-3129282     c
            1686-2533           MSTS01000110.1     3127049-3127896     c
            2534-2625           MSTS01000110.1     3126839-3126930     c
            2626-3866           MSTS01000110.1     3125596-3126836     c
FEATURES             Location/Qualifiers
     source          1..3866
                     /organism="Phascolarctos cinereus"
                     /mol_type="mRNA"
                     /isolate="Bilbo 61053"
                     /specimen_voucher="AM:M.47724"
                     /db_xref="taxon:38626"
                     /chromosome="Unknown"
                     /sex="female"
                     /tissue_type="spleen"
                     /dev_stage="adult"
                     /geo_loc_name="Australia"
                     /collection_date="01-Aug-2015"
                     /collected_by="Australia Zoo Wildlife Hospital"
     gene            1..3866
                     /gene="LOC110221189"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 2 bases in 1 codon;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 3 Proteins, and 78% coverage of the
                     annotated genomic feature by RNAseq alignments"
                     /db_xref="GeneID:110221189"
     CDS             1..3069
                     /gene="LOC110221189"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 2 bases in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: extensin-2-like"
                     /protein_id="XP_020861241.1"
                     /db_xref="GeneID:110221189"
                     /translation="
MADKKLPQPPPTAPPPILLIPKAIAHPPLLCLPEVPPRPTIFSNEPPSRPLIFCPQPSPHHPALTSCVPPAQPTFFLRATLPAQPRASPGAPGPQTWPSVVLSESLPQPPPARAPSSPRPRRLGGLSEPARCHHIWEPLGAPSLCTGHHPLWGPLEGPSESTGCHPLWGAPQSPLQAYAPSSLLGAPRSPVRVHGLSPPLGAPRSLLRAYWQSSPLGGPSPQSPPAITLSGGPSQSPPSLHSVISGGHREPPPCPQAISISGGRSEPPPHLQAIRISWCCPEPPPRPPAIGSLRGSSEPPSPDKTPTLKSLLKPPSQQSKAIAAMAAQPYWITILTPPCTLVPKSEAVTYPTIIKAKSLPPAPYVIKSKTQPHPPFLKADPHPPTPSTTGSELLPQPPILSNEPSISIPPIIRFETPILKPSSPPGLSIIKSKPPPPDSIITRVEPPPDFSIISSEPPHDLSIISSEPLPDSSISSEPLPDSSIISSEPPPDSSIISSEPPHDLSIISSEPPPDFSIISSEPSPDSSIISSEPLPDSSISSEPPPDSSIFSSEPPPDPSIINSEPPPDPSIISSEPTPDSSIISSEPLPDSSIFSSEPPPDPSIISSEPPPDSSIISSEPPPDPSIISSEPTPDPSIISSEPPPDSSIISSEPLPDSSISSEPPHDLSIISSEPPPDFSIISSEPSPDPSIISSEPPPDSSIISSEPPPDFSIISSEPLPDSSIISEPLLTFHHSSEPLLTPPSSALSPLLTPPSSALSLFLTPPSSLSPVLTPPSSALSLLLTPPSSALSPVLTPPSSALSLLLTSPSSALSPVLTPPSSALSLFLTPPSSLSPVLTPPSSALTLSPPDSSIISSEPLLTHPSSALSPLLTPPSSSEPPPNPSIISSEPPPDPSIINSEPPPDSSIISSEPLPDSSISSEPLPDSSIFSSEPPPDPSIINSEPPPDPSIISSEPPPDPSIISSEPPPDSSIISSEPSPNPSIINSEPPPDSSIINSEPPPDSSITSAELPLDPSTSGSTPL"
     misc_feature    <106..1701
                     /gene="LOC110221189"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
     misc_feature    <1129..>1959
                     /gene="LOC110221189"
                     /note="CCR4-NOT transcriptional regulation complex, NOT5
                     subunit [Transcription]; Region: Not5; COG5665"
                     /db_xref="CDD:444384"
     misc_feature    <1594..3054
                     /gene="LOC110221189"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
ORIGIN      
atggcggacaagaagctgccgcagcccccaccgaccgcccctcctcccatcctgctcattcccaaggccatcgcccaccccccgctcctctgcctccccgaggtcccccctcggccgaccatcttcagcaacgagcccccttcgcgcccgctcatcttctgcccccagccctcgccccaccaccccgccctcaccagctgcgtgcccccggcgcagcccaccttctttctccgggccacgctccccgcgcagccccgggccagccccggtgccccggggccccagacctggccctccgtagtcctgtccgaaagcctgccccagcccccaccagccagagccccctcctccccccgccctcgcagactcgggggcctctccgagccagcgcgctgtcatcacatctgggagcccctcggagccccttccttgtgcacaggccatcaccccctctgggggcccctcgaaggcccctcggagtccacaggctgtcaccccctctggggggcccctcaaagtcccctccaagcctacgcaccatcatcacttctgggggcccctcggagtcctgtcagagtccacgggctgtcaccccctctgggggcccctcgcagtctcctccgagcctactggcagtcatcacctctggggggcccctcccctcagagtccaccggccatcaccctctctgggggtccctcgcaatctcctccaagcctacactcagtcatctctgggggccaccgggagccccctccttgcccacaggccatcagcatttctgggggccgctcggaaccccctcctcacctacaggctatcagaatttcttggtgctgcccggaaccccctcctcgcccacctgccattggcagcttaaggggtagctccgagcccccttctcctgacaaaacaccaactttgaagtctctcctcaagccccccagccaacagtcaaaggccatagctgccatggcagctcagccctactggatcactattctgacccctccttgtacccttgtccccaagtctgaggctgttacttaccccaccatcatcaaggctaaatcccttcctcctgctccttacgtcatcaagtctaagacccaacctcacccccctttcctcaaggctgatcctcatcctcccaccccctccaccaccggttcagagctccttcctcagccccccatactcagcaatgagccttcaatttccatcccccccatcatcaggtttgagacccccatcctcaagcctagctctcctcctggcctctccatcatcaaatctaagcctcctcctcctgactccatcatcaccagggttgagcctcctcctgatttctccatcatcagctctgagccccctcatgacctgtccatcatcagctctgagcctcttcctgactcctccatcagctctgagccccttcctgactcatccatcattagctctgagccccctcccgactcctccatcatcagctctgagccccctcatgacctgtccatcattagctctgagcctcctcctgacttctccatcattagctctgagccctctcctgactcctccatcatcagctctgagcctcttcctgactcctccatcagctctgagccccctcctgactcctccatcttcagctctgagccccctcctgacccatccatcatcaactctgagccccctcctgacccctccatcatcagctctgagcccactcctgactcctccatcatcagctctgagcctcttcctgactcctccatcttcagttctgagccccctcctgacccatccatcatcagctctgagcctcctcctgactcctccatcatcagctctgagccccctcctgacccatccatcatcagctctgagcccactcctgacccatccatcatcagctctgagccccctcctgactcctccatcatcagctctgagcctcttcctgactcctccatcagctctgagccccctcatgacctgtccatcattagctctgagcctcctcctgacttctccatcattagctctgagccctctcctgacccctccatcatcagctctgagccccctcctgactcctccatcatcagctctgagcctcctcctgacttctccatcatcagctctgagcctcttcctgactcctccatcatctctgagcccctcctgaccttccatcatagctctgagcccctcctgactcctccatcatcagctctgagccccctcctgactcctccatcatcagctctgagcctcttcctgactcctccatcatctctgagccccgtcctgactcctccatcatcagctctgagcctcctcctgactcctccatcatcagctctgagccccgtcctgactcctccatcatcagctctgagcctcctcctgacttctccatcatcagctctgagccccgtcctgactcctccatcatcagctctgagcctcttcctgactcctccatcatctctgagccctgtcctgactcctccatcatcagctctgactctgagccctcctgactcctccatcatcagctctgagcccctcctaacccatccatcatcagctctgagccccctcctgactcctccatcaagctctgagccccctcctaacccatccatcatcagctctgagccccctcctgacccatccatcatcaactctgagccccctcctgactcctccatcatcagctctgagcctcttcctgactcctccatcagctctgagcctcttcctgactcctccatcttcagttctgagccccctcctgacccatccatcatcaactctgagccccctcctgacccatccatcatcagctctgagcctcctcctgacccatccatcatcagctctgagccccctcctgactcctccatcatcagctctgagccctctcctaatccatccatcatcaactctgagccccctcctgactcctccatcatcaactctgagccccctcctgactcctccatcaccagtgctgagcttcctcttgatccctccactagcggctctacgcctctttaacaccattagttttgaactccttccttatccctctccctcccttccatcatctctgaaccccctctttacccctccatcagcagctctcaaccccctcctcactccccattgttgctctcagcccccctctcacccactaacatcagcgtctctgaacctcctccttagcccccgaccatctgtattattagctttgaactcctttcttatcctccttcctccttcctattacctctgactccctcttcctcccaccccactatcagctctgggtccctttttctccttctctccgccattagctctgagccctctattcctccccaaccatcgctgagcccccttctcactcctctatgatcagctctctgcctctttcccattccccatcatcagatctcagctctttccttgccattaactttgagccccctctttatccccttttttctcctgtgctgagctttgagcctgtggaccctagtgttctcctgttctgggctctgtcctctctctttccccctctccagctctgagtccctcccgtacctctgaccatcaatttatctgctcttcgttcccttcagctttgagccagtcctcaggacctcagacttgttttcatccttctttcaactccaagttcgggttttccagccttcttacatacaaactgcccctactctagctctgagcttcacgtccccacttgctcagattagcctttccagccccagttcccacaccctaattcttccctttcaactgtaccccattctccaagtctcttgctgcta
//

by @meso_cacase at DBCLS
This page is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

If you use GGRNA in your work, please cite:
Naito Y, Bono H. (2012)
GGRNA: an ultrafast, transcript-oriented search engine for genes and transcripts.
Nucleic Acids Res., 40, W592-W596. [Full Text]