Welcome to an introduction to Bioinformatics! Here we'll do some searches to get a feel for the kinds of biological information available on the web.In today's lab, you will explore information about one type of human colon cancer - hereditary non-polyposis colon cancer (HNPCC) and the mismatch repair gene. This is one of the "spellchecker" genes for DNA replication. You will learn its relevance to yeast and bacteria, and see how tools available on the web can help keep researchers and the public informed.
To start you will search the Online Mendelian Inheritance in Man (OMIM) database. This database is a catalog of human genes and genetic disorders authored and edited by Dr. Victor A. McKusick and his colleagues at Johns Hopkins and elsewhere. You will then follow some links to explore other relevant information available to you. Finally, you will see how similar the gene responsible for HNPCC is in a variety of organisms.
MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVR LFDRGDFYTAHGEDALLAAREVFKTQGVIKYMGPA GAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKN RAGNKASKENDWYLAYKASPGNLSQFEDILFGNND MSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGL CEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDM GKLRQIIQRGGILITERKKADFSTKDIYQDLNRLL KGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELL SDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQG SVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPL MDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFP DLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQA LEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIE TTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEK KMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRV TCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLN EEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQTL NDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQG RIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFH IITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESA EVSIVDCILARVGAGDSQLKGVSTFMAEMLETASI LRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYI ATKIGAFCMFATHFHELTALANQIPTVNNLHVTAL TTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHV IECAKQKALELEEFQYIGESQGYDIMEPAAKKCYL EREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQ LKAEVIAKNNSFVNEIISRIKVTT
BLAST searching can be slow so we have saved results from previous searches of Proteins from all organisms and, another search, Only proteins in various species of insects for you to review. Scroll through the document to see the sequences that were found and also take a look at some of the alignments. See below if you would like to try BLAST yourself.
OPTIONAL: To use BLAST, you will first need to choose the database to search. Under "Basic BLAST", click on the "protein blast" link if you want to use a protein sequence to search other protein sequences. Then copy the sequence above and paste it into the big text box under "Enter Query Seqeunce". By default you'll search proteins from all organisms. Click the BLAST button to start your search. You'll quickly see a "domain analysis" of your protein, but you may have to wait a couple of minutes to see the BLAST results. Once the results come back take a look at the alignments and notice what parts of the sequence are conserved.