The GenBank database at the NCBI (National Center for Biotechnology Information) contains millions of nucleotide and protein sequences. The BLAST family of programs at the NCBI can be used to compare unknown sequences to all the sequences in GenBank and find sequences that match. This can be helpful for determining the possible identity of an unknown sequence and for identifying related sequences from other organisms.
In this activity you will use BLAST to identify unknown sequences and look for related sequences in other organisms.
1. Visit the entry page and download the worksheet (BLASTING through the kingdom of life).
2. Open the BLAST tutorial in a separate web browser window. We recommend that you adjust the window size in order to view the tutorial in one browser window and use BLAST in the other window.
3. Open the NCBI BLAST home page site in a third window. This is where you'll do the blast search.
4. Copy your sequence from the sequence data set (after reading the instructions), identify it using BLAST, and answer the questions on the BLAST worksheet. The worksheet includes example answers.
Copy the DNA sequence using one of three ways.
- Use your mouse to highlight your sequence.
- Click the right mouse button and select Copy.
- Open the Edit menu in web browser and select Copy.
- Use the keyboard commands (Ctrl + C for Windows, Command + C for Mac).
**Be sure to copy the entire sequence including the > symbol and the name.
Paste your sequence in the BLAST text box, using one of three methods:
- Click the right mouse button and select Paste.
- Open the Edit menu in web browser and select Paste.
- Use the keyboard commands (Ctrl + V for Windows, Command + V for Mac).
You should be aware that the NCBI web changes regularly. Although the information in the tutorial is current, the NCBI web page may appear slightly different.
4. To get your sequence, either click the link below, or scroll down the page. There are 16 sequences in this data set and one example sequence.
List of sequences
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
>Example CGAGGGACCTTTACGGCGTAATCCTGGAAACCATGACAAATCCAGAACCCCAAGGCTCCCCTCTT CAGCTGATGTAGAATTTTGCCTGAGTTTGACCCATGGAAGGATTTGCTAGTCCACTTACTGGGAT AGCGGATGCCTCTCAAAGAGCATGCACAATGCCTTGCACATCTATATGAATGGAACAATGTCCCA GGCAGGGATCTGCCAACGATCCTATCTTCCTTCTTCACCATGCATTTGTTGACAGTATTTTTGAG CAGTGGCTCCGAAGGCACCGTCCTCTTCAAGAAGTTTATCCAGAAGCCAATGCACCCATTGGACA TAACCGGGAATCCTACATGGTTCTTATACCACTGTACAGAAATGGTGATTTCTTTATTTCATCCA AAGATCTGGGCTATGACTATAGCTATCTACAAGATTCAGACCCAGACTCTTTTCAAGACTACATT AAGTCCTATTTGGAACAAGCGAGTCGGATCTGGTCATGGCTCCTTGGGGCGGCGATGGTAGGGGC CGTCCTCACTGCCCTGCTGGCGGGCTTGTGAGCTTGCTGTGTCGTCACAAGAGAAAGCAGCTTCC TGAAGAAAAGCAGCCACTCCTCATGGAGAAAGAGGATTACCACAGCTTGTATCAGAGCCATTTAT AAAAGGCTTAGGCAATAGAGTAGGGCCAAAAAGCCTGACCTCACTCTAACTCAAAGTAATGTCCA GGTTCCCAGAGAATATCTGCTGGTATTTTTCTGTAAAGACCATTTGCAAAATTGTAACCTAATAC AAAGTGTAGCCTTCTTCCAACTCAGGTAGAACACACCTGTCTTTGTCTTGCTGTTTTCACTCAGC CCTTTTAACATTTTCCCCTAAGCCCATATGTCTAAGGAAAGGATGCTATTTGGTAATGAGGAACT GTTATTTGTATGTGAATTAAAGTGCTCTTATTTTAAAAAATTGAAATAATTTTGATTTTTGCCTT CTGATTATTTAAAGATCTATATATGTTTTATTGGCCCCTTCTTTATTTTAATAAAACAGTGAGAA ATCT
TCGAAATAACGCGTGTTCTCAACGCGGTCGCGCAGATGCCTTTGCTCATCAGATGCGACCGCAAC CACGTCCGCCGCCTTGTTCGCCGTCCCCGTGCCTCAACCACCACCACGGTGTCGTCTTCCCCGAA CGCGTCCCGGTCAGCCAGCCTCCACGCGCCGCGCGCGCGGAGTGCCCATTCGGGCCGCAGCTGCG ACGGTGCCGCTCAGATTCTGTGTGGCAGGCGCGTGTTGGAGTCTAAA
GTTTATTAGTGATCATGGCTAAGTTTGCGTCCATCATCGCACTTCTTTTTGCTGCTCTTGTTCTTT TTGCTGCTTTCGAAGCACCAACAATGGTGGAAGCACAGAAGTTGTGCGAAAGGCCAAGTGGGACAT GGTCAGGAGTCTGTGGAAACAATAACGCATGCAAGAATCAGTGCATTAACCTTGAGAAAGCACGAC ATGGATCTTGCAACTATGTCTTCCCAGCTCACAAGTGTATCTGCTACTTTCCTTGTTAATTTATCG CAAACTCTTTGGTGAATAGTTTTTATGTAATTTACACAAAATAAGTCAGTGTCACTATCCATGAGT GATTTTAAGACATGTACCAGATATGTTATGTTGGTTCGGTTATACAAATAAAGTTTTATTCACCA
CTCGAGACTAGTTCTCTCTCTCTCTCTCTCGTGCCGCATCTCACACCTGTGGATGGACGGCAGCTG AACCGCGGGAAACTTTCGTTCTCACTCTACCTAGATGAACTTTAGTTTATATTAAACACGCGTCGA CTCCCACACAAACCGTGCTCGTTTTACATCTTTGTCTCCGCTTTTGAAAACGAGAAGTTGAATTCG CAAGACGCAACTTTCCAGCCCCTCACTGAGCGGGCAGAGTCCGTGAAGCGATGGAGCCGTCCGTCA TTCCCGGTGCTGACATACCCGACCTTTACTCCATTAACCCGTTTAATGTCACTTTTCCCGACGACG TTTTGAGTTTCGTTCCTGATGGGAGGAACTACACCGAACCTAACCCGGTAAAGAGCCGCGGAATCA TCATCGCCATTTCCATCACCGCTC
GACATTACGGCGACCCAGTCTCCCCCGGTGTTGTCAGTGGGACTGGGCCAGACCGCAACCATCACTT GTACGGCCAGTCAAAGCATCTACAGTAACCTTGCTTGGTACCAGCAGAGAGAAGGACAGAAGCCCTC TCTCCTGATCTATGCTGCGACAACGCGATACGAAGGAGTCTCCGAGCGATTCAGCGGCAGTGGATCA GGGACCAGTTTCACCCTGACAATCAGCAACGTTCAGAATGAGGATGTCGCTGACTATTACTGTCAGA TCGCATATTCGATCTACTCCGGTTCCGTTGTTTTCGGTGAAGGAACCAAGCTCAGACTGAGCCGT
GAATTCGCGGCCGCATGGGGGAGAAGCTGCCGGTTGTGTATAAACGCTTCATCTGCTCGTTCCCGGA TTGTAATGCCACGTATAACAAGAACCGGAAGCTGCAGGCCCATCTGTGCAAGCACACGGGGGAGAGA CCGTTTCCTTGCACATATGAAGGCTGTGAGAAAGGCTTTGTGACGCTGCATCACCTGAATCGTCATG TGCTCTCCCACACCGGGGAGAAACCCTGCAAATGCGAAACGGAAAATTGCAATTTGGCGTTCACCAC AGCATCCAACATGAGGTTGCACTTCAAAAGGGCTCATTCTTCTCCGGCGCAGGTCTACGTGTGTTAT TTCGCAGACTGTGGCCAGCAGTTCAGGAAACATAACCAGCTAAAAATTCACCAGTATATCCATACAA ACCAGCAACCCTTCAAAT
GCCCAGCGTCTCTCGGAGGAAGCTAATTCTCAGGTTATCGCAGAGGAATCTCTTGTAGCTCGTGCTGA GGCTACCGTTGTCCAAGCCGCCGCTCCAACCAAATCCCTTGATCTGACAACATGGAAGTATGCTGATC TCAGAGACACTATCAACACCTCAATCGATATTGCGCTCCTGTCAGCCTGCAAGGAGGAGTTCCATCGT CGTCTCAAGGTCTACCACGCCTGGAAGATGAAGAATAAGAAGGTTGCCGCCGGCGACAAGGGCGGACC AGAGAGGGCTCCACAATCCATCTTTGAAAGTGCCCAACAATACAACCAGCTGGCACCCCCTCCGAAAG CCACCAAGGCTGCCCCAGCCAATCAGAACATCCAACGCTTCTTCAGGGTGCCTTTCTCCGTGACTGGG TCCACCGCTCAGGGTCAGATGCCCGAGAGGGGTTGGTGGTACGCCCACTTTGACGGTCAGTGGATCGC CCGCCAGATGGAGGTACACCCCACCAAGGTCCCCGTTCTTCTGGTTGCAGGTAAAGATGATGAGAACA TGTGTGAGATGAGTTTGGAGGAGACTGGGTTGACACGACGTCCCAACGCCGAGATCGTCGAGCGGGAG TTTGAGGAGCCCTGGAAGCGTAGCGGCGGTCAGCAGTACCACATGGCTGCAGTACGCAACAAGCAGGC TAGACCAACGTGGGCCACGCAGAGCTTGAA
AACAATTCATTTTTCCTGCTTTCCTAGAAAATTCTATAAAAGCTTCAAAATGAATTACTTGGTGATGA TTAGTTTGGCACTTCTCTTCGTGACAGGTGTAGAGAGTGTAAAAGACGGTTATATTGTCGACGATGTA AACTGCACATACTTTTGTGGTAGAAATGCATACTGCAACGAGGAATGTACCAAGTTGAAAGGTGAGAG TGGTTATTGCCAATGGGCAAGTCCATATGGAAACGCCTGTTATTGCTATAAATTGCCCGATCATGTAC GTACTAAAGGACCAGGAAGATGCCATGGCCGATAAATTATAAGATGGAATGTATCCTAAGTATCAATG TTAAATAAATATAATCAAAAAATT
ACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAGCCCTGCAAAGTAAAC TGGATGGCTTTCTTGCCGCCAAGGATCTGATGGCGCAGGGGATCAAGATCTGATCAAGAGACAGGATGA GGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTA TTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAG GGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCG CGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGA AGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCC
AACCAACCCAACTCGCCGCCGCATCGCCCTCGTTAGAACGATGGCCGCGTCGGCGCTGCACCAGACCAC CAGCTTCCTCGGCACCGCCCCTCGCCGGGATGAGCTCGTCCGCCGCGTCGGCGACTCCGGTGGCCGCAT CACCATGCGCCGCACCGTCAAGAGCGCGCCCCAGAGCATCTGGTATGGACCTGACCGTCCCAAGTACCT GGGCCCGTTCTCGGAGCAGACGCCATCGTACCTGACCGGAGAGTTCCCGGGAGACTACGGGTGGGACAC GGCGGGGCTATCGGCCGACCCGGAGACGTTCGCGAGGAACAGGGAGCTGGAGGTGATCCACTCGCGGTG GGCGATGCTGGGGGCGCTGGGCTGCGTCTTCCCGGAGATCCTGTCCAAGAACGGGGTGAAGTTCGGGGA GGCGGTGTGGTTCAAGGCCGGCGCGCAGATCTTCTCCGAGGGGGGGCTCGACTACCTGGGGAACCCCAA CCTGGTGCACGCGCAGAGCATCCTCGCCATCTGGGCGGTCCAGGTGGTGCTCATGGGATTCGTCGAGGG CTACCGCGTCGGCGGCGGCCCGCTCGGCGAGGGCCTCGACAAGGTGTACCCAAGCGGCGCCTTCGACCC GCTCGGCCTCGCCGACGACCCTGACACCTTCGCCGAGCTCAAGGTGAAGGAGCTCAAGAACGGCCGCCT CGCCATGTTCTCCATGTTCGGCTTCTTCGTCCAGGCCATCGTCACCGGCAAGGGCCCCATCGAGAACCT CTTCGACCACGTCGCCGACCCCGTCGCCAACAACGCCTGGGCATACGCCACCAACTTCGTCCCCGGCAA GTGAGCACAACGACACGATCGAGATGGTGCCAACCAACGTACCATGTGTACACTTGTAGTAGCCACGCA CCGACCCTGCAGTTGCAGTTGCAGCAGTGCATGTATGTATGTACCTTAATTGTGTGTGTGTGTGTGATC GATCGAGGAGATTTCTAGCTTAATTAATTTCTGTC
ACTCAACGTAAATCAGAAGATTCCAACTGCAGTGGAAGGCAGCACATGAAATAAATTACTTGTTAGAAAG AATACTGCCAACAGCATAGCAAAATGAAATTCTTCCTGCTGCTTTCCCTCATTGGATTCTGCTGGGCCCA ATATGACCCACATACTCAATATGGACGAACTGCTATTATCCACCTGTTTGAGTGGCGCTGGGTTGATATT GCTAAGGAATGTGAGAGATACTTAGCTCCTAATGGATTTGCAGGTGTGCAGGTCTCTCCACCCAATGAAA ACATCGTAGTCCACAGCCCTTCAAGACCATGGTGGGAAAGATATCAACCAATTAGCTACAAAATATGTTC CAGGTCTGGAAATGAAGATGAATTCAGGGACATGGTGAACAGGTGCAACAATGTTGGTGTCCGTATTTAT GTGGATGCTGTCATTAACCACATGTGTGGAGTGGGGGCTCAAGCTGGACAAAGCAGTACATGTGGAAGTT ATTTCAACCCAAATAACAGGGACTTTCCTGGAGTTCCCTATTCTGGTTTTGACTTTAATGATGGAAAATG TAGAACTGCAAGTGGAGGTATCGAGAACTACCAAGATGCTGCTCAGGTCAGAGATTGTCGTCTGTCTGGC CTTCTGGATCTTGCACTTGAGAAAGATTATGTTCGAACCAAGGTGGCTGACTATATGAACCATCTCATTG ACATTGGCGTAGCAGGGTTCAGACTTGATGCTTCTAAGCACATGTGGCCTGGAGACATAAAGGCAATTTT GGACAAACTGCATAATCTCAATACAAAATGGTTCTCCCAAGGAAGCAGACCTTTCATTTTCCAAGAGGTG ATTGATCTGGGTGGTGAGGCAGTGTCAAGTAATGAGTATTTTGGAAATGGCCGTGTGACAGAATTCAAAT ATGGAGCAAAATTGGGCAAAGTTATGCGCAAGTGGGATGGAGAAAAGATGTCCTACTTAAAGAACTGGGG AGAAGGTTGGGGTTTGATGCCTTCTGACAGAGCCCTTGTGTTTGTGGACAACCATGACAATCAGCGAGGA CATGGTGCTGGGGGAGCATCCATCTTGACATTCTGGGATGCTAGACTCTATAAAATGGCTGTTGGCTTTA TGTTGGCTCATCCTTATGGTTTCACACGGGTGATGTCAAGTTACTATTGGCCAAGAAATTTCCAGAATGG AAAAGATGTCAATGACTGGGTTGGACCACCAAATAACAATGGAAAAACCAAAGAAGTGAGCATTAACCCA GACAGCACTTGTGGCAATGACTGGATCTGTGAACATCGATGGCGTCAAATAAGGAACATGGTTGCCTTCA GAAATGTCGTCAATGGTCAGCCTTTTGCAAACTGGTGGGATAATGACAGCAACCAGGTAGCTTTTGGCAG AGGAAACAAAGGACTCATTGTCTTTAACAATGATGACTGGGCTTTGTCAGAAACTTTACAGACTGGTCTT CCTGCTGGCACATACTGTGATGTCATTTCTGGAGATAAAGTCGATGGCAATTGCACTGGAATAAAAGTCT ATGTTGGCAATGATGGCAAAGCTCACTTTTCTATTAGTAACTCTGCCGAAGACCCATTTATTGCAATCCA TGCAGAGTCAAAAATATAAAATTTAAAATAAACACATATTGAGAGCATCA
TGCATCACAAGGTTAATGTGAAAACACAGCGAGAAGTCCATTTCCCAATGGACCTCTTGCAAGCCTGTGGT GCATCTGCCCCTAGGCCAGTTGCCCGTGTTTCACGTGCAACCGACCTAGACCGACGCTACAGGTGCGTCCT CAGTTTACCTGAGGAGCGTGCTCGCAGTGTTGGGTGTAAATGGTCGTCGACCCGAGCGGCGTTACGACGTG GACTCGAGGAGCTTGGCTCCCGCGAGTTCCGCCGTCGTCTCCGTTTGGCGGACGATTGCTGGCGCGCGATC TGCGCGGCCGTCTGCACGGGTCGGAAGTTTCCTTCCTTCTCGGTGACAGATCGGCCGGCAAGAGCTCGCCT TGCAAAAGTCTACCGTATGGGTCGTCGACTGCTAGTAGGTGTGGTCTGCCGAGGCGAATCGGTCG
ATAACCACACGCCTTTGGCGTGATTATCAGCTTTCAAGTTTCAGTTACTAAAACTAATACTGACTATAAAA CAGAAGCAAAAAAATTTTCGATTTTTATGAAAACGGTCGCAAAGAAGTTAGCAAAAATATATAATTTCTTT TGAAATTGTTCACTTGGCCAAGCTGCAGTTTCAATATTTTAATAAAGGGGGCAGTAAAAAGTGAAAAAAAA GAAAAGTTTCTGGCTTGTTTCTTTTTTAGTTATAGTAGCTAGTGTTTTCTTTATATCTTTTGGATTTAGCA ATCATTCTAAACAAGTTGCTCAAGCGGCTAGTGATACGACATCAACTGATCACTCAAGCAATGATACAGCT GATTCTGTTAGCGACGGTGTTATTTTGCATGCATGGTGCTGGTCGTTCAACACGATTAAAAACAACTTGAA ACAGATTCATGACGCCGGCTACACAGCGGTTCAAACTTCACCTGTTAATGAAGTTAAAGTTGGAAATAGCG GGTCTAAGTCATTAAATAACTGGTATTGGCTATATCAGCCAAC
GTGCGCGTTAGACCATATAAAGAAAAACCAATACAAACTCCAGCAAAATCTGTTGATATAAGATATACTGTA CAGTTTACTCCTTTAAACCCTGATGATGATTTCAAGCCAGTTCTCAAAGATACTAAACTATTGAAAACATTA GCTATCGGCGACACCATCACATCCCAAGAATTACTAGCTCAAGCACAAAGCATTTTAATCGAAAGCCATCCA GATTATACGATTTATGAACGTGATTCCTCAATCGTCACTCATGACAATGACATTTTCCGTACGATTTTACCA ACGGATCAAGAGTTTACTTACCATGTCAAAAATCGGGAACAAGCTTATAAGGCCAATTCTAAAACAGATATT AAAGAAAAAACGAACAACACCGAC
CTAATAATCCTTGGAATACTCCTATATTTTGTATAAAGAAGAAATCAGGGAAATGGAGAATGCTAATTGATT TTAGAGAACTTAATGCAAAAACAGAAAAAGGAGCAGAAGTCCAATTAGGATTACCTCACCCATCTGGATTAC AGAAGAGAAAGAATGTAACAGTTTTAGATATAGGAGATGCTTATTTTACCATCCCTTTAGATCCTGATTATC AGCCCTATACTGCATTTACTTTACCATCTAAGAATAATCAAAGTCCAGGAAAAAGGTATATTTGGAAATCTC TTCCACAGGGGTGGGTCTTGAGTCCCTTAATATACCAGAGCACTCTAGATAATATTCTACAACCATTTAGAA
ATGTTTTCCGGTGGCGGCGGCCCGCTGTCCCCCGGAGGAAAGTCGGCGGCCAGGGCGGCGTCCGGGTTTTTT GCGCCCGCCGGCCCTCGCGGAGCCGGCCGGGGACCCCCGCCTTGCTTGAGGCAAAACTTTTACAACCCCTAC CTCGCCCCAGTCGGGACGCAACAGAAGCCGACCGGGCCAACCCAGCGCCATACGTACTATAGCGAATGCGAT GAATTTCGATTCATCGCCCCGCGGGTGCTGGACGAGGATGCCCCCCCGGAGAAGCGCGCCGGGGTGCACGAC GGTCACCTCAAGCGCGCCCCCAAGGTGTACTGCGGGGGGGACGAGCGCGACGTCCTCCGCGTCGGGTCGGGC GGCTTCTGGCCGCGGCGCTCGCGCCTGTGGGGCGGCGTGGACCACGCCCCGGCGGGGTTCAACCCCACCGTC ACCGTCTTTCACGTGTACGACATCCTGGAGAACGTGGAGCACGCGTAC
CTCGGGTGACGAGTGGCGGACGGGTGAGTAATGTCTGGGGATCTGCCCGATAGAGGGGGATAACCACTGGAA ACGGTGGCTAATACCGTATAACGTCGCRAGACCAAAGAGGGGGACCTTCGGGCCTCTCACTATCGGATGAAC CCAKATGGGATTAGCTAGTRSGCGGGGTMACGGGCCCACCTAGGCGACKATCCCTAGCTGGTCTGAGAGGAT GACCAGCCACACTGGAACTGASACACGGYCCASACTCCTACGGGRGGCAGCAGKGGGGAATATTGCACARTG GGCGCAMGCCTGATGCASCCATGCCGYGTGTATGAAGARGGCCTTCGGGTTGTAAAGTWCTTTCAGCGGGGA GGAAGGCKATGTGGTTAATAACCGCVTYGATTGACGTTACCCGCAGAAGAAGCACCGKCTAACTCCGTGCCA GCAGCCGCGGTWATACGGAGGG
|
|||||||
Copyright © Digital World Biology All rights reserved. |