FIBRONECTIN TYPE-III (FnIII) DOMAINS
Vertebrate Fibronectin precursor
(Fibronectin TYPE III DOMAIN REPEATS ONLY)
HUMAN . . .609 ..SGPVEVFITETPSQPNSHPIQ W NAPQPSHIS
KY ILRWRPKNSVGRWKEATIPGHLNSYTIKGLKPGVVYEGQLISIQQY...
MOUSE . . .609 ..TGPVQVIITETPSQPNSHPIQ W NAPEPSHIT
KY ILRWRPKTSTGRWKEATIPGHLNSYTIKGLTPGVIYEGQLISIQQYGHREVTRFDFTTSASTPVT
RAT . . . .609 ..TGPVQVIITETPSQPNSHPIQ W NAPEPSHIT
KY ILRWRPKTSTGRWKEATIPGHLNSYTIKGLTPGVIYEGQLISIQQYGHQEVTRFDFTTSASTPVT
BOVINE . . 578 ..SGPVQVIITETPSQPNSHPIQ W SAPESSHIS
KY ILRWKPKNSPDRWKEATIPGHLNSYTIKGLRPGVVYEGQLISVQH...
ZFISH . . .706 ..HRPVQVIISEAGNQPNSHPIQ W NAPASAHIT
QY ILKWRPKNTHIQWMEVTIPGHVNSYTIAGLKPGVTYEGQLISILRF...
XENOPUS. . 612. . .PVQVIITESANFPTSHPIQ W NAPQASHIK
NY ILRWKPKLKAGPWKQATIPGHLNSYTISGLKPGILYEGQLISILQYCNREVTTFDFTTT
HUMAN . . .719 . SPLVATSESVTEITASSFVVS W
VSASDTVS GF RVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISED...
MOUSE . . .718 . SPVVATSESVTEITASSFVVS W VSASDTVS
GF RVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISEEGKQSLILSTSQTT
RAT . . . .718 . SPVVATSESVTEITASSFVVS W VSASDTVS
GF RVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISEEGKQSLILSTSQTT
BOVINE . . 661 . SPVVATSESVTEITASSFVVS W VSASDTVS
GF RVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYTVNVYEISEE...
ZFISH . . .712 ..PRMVDTSESVTEITSSSFVIS W VSASDTVS
GF RVEYELSEEGGQTGQPMILDLPHSATSVNISELLPGRKYTVNVYEVT...
XENOPUS. . 719 ..PPLVSISESVTEITASSFLVS W VSASDTVS
GF RVEYELSEDGDEKRYLELPNTATSVNIPDLLPGRRYNVNVYQITEEGEKS
HUMAN . . .810 ..APDAPPDPTVDQVDDTSIVVR W
SRPQAPIT GY RIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQE...
RAT . . . .809 . APDAPPDPTVDQVDDTSIVVR W SRPQAPIT
GY RIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVFIQQETTGVPRS
BOVINE . . 781 ....DAPPDPTVDQVDDTSIVVR W SRPRAPIT
GY RIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQE...
ZFISH . . .808 ....DAPSDHEVEDVADTSIRIS W SKPLAPIT
GY RVVYTPSEEGSSGSVELNLPEHSTSLTLGDLRPGVLYNISIFSVEEN...
XENOPUS. . 811 . .PDAPPEHNVENVDDTSIMIK W NKPQAPIT
GY RVVYSPSVEGSSTELNLPSTANSVTLTELLPGIEYNITIYAVEDSLESV
HUMAN . . .905 .DTVPSPRDLQFVEVTDVKVTIM W
TPPESAVT GY RVDVIPVNLPGEHGQRLPISRNTFAEVTGLSPGVTYYFKVFAVSHG...
MOUSE . . .905 ..NVPPPTDLQFVELTDVKVTIM W TPPDSVVS
GY RVEVLPVSLPGEHGQRLPVNRNTFAEITGLSPGVTYLFKVFAVHQGRESNPLTAQQTT
RAT . . . .904 .DDVPAPKDLQFVEVTDVKVTIM W TPPNSAVT
GY RVDVLPVNLPGEHGQRLPVNRNTFAEVTGLSPGVTYLFKVFAVHQGRESKPLTAQQTT
BOVINE . ..874 .DKVPPPRDLQFVEVTDVKITIM W TPPESPVT
GY RVDVIPVNLPGEHGQRLPVSRNTFAEVTGLSPGVTYHFKVFAVNQG...
ZFISH . . .903 .EEVPSPTDLQFYEVSDSKITIT W TSPSSEVS
GY RVSVGEVGPDGFDEEVLSVTQNAYAEITHLQPGTLYRFFIFSLKSG...
XENOPUS. . 905 .VIVPSPTDLQLVEVTDVKIIIM W TSPQSEVS
GY RVVVKPVSPAGRDVQNLPVNRNTFAEVVNLQPGRTYSFEVYAVNRGQESEPLVGEFATKLDAP
HUMAN . . .996 ..KLDAPTNLQFVNETDSTVLVR W
TPPRAQIT GY RLTVGLTRRGQPRQYNVGPSVSKYPLRNLQPASEYTVSLVAIKGNQE...
MOUSE . . .995 ..KLDAPTNLQFVNETDRTVLVT W TPPRARIA
GY RLTAGLTRGGQPKQYNVGPLASKYPLRNLQPGSEYTATLVAVKGNQQSPKATGVFTTL
RAT . . . .995 ..KLDAPTNLQFVNETDRTVLVT W TPPRARIA
GY RLTVGLTRGGQPKQYNVGPMASKYPLRNLQPGSEYTVTLMAVKGNQQSPKATGVFTTL
BOV .. . . 965 ..KLDAPTNLQFINETDTTVIVT W TPPRARIV
GY RLTVGLTRGGQPKQYNVGPAASQYPLRNLQPGSEYAVSLVAVKGNQQ...
ZFISH . . .995 ...PDPQTDIYFSNITEDSAVIV W LPPTAQIT
GY RLFLTVEGSNPNSCVFQPSGRSYHPSLNLQPDTEYSATLHAERGN.
XENOPUS. . 997 ...LDAPTDLQFTDVTESTVVII W IPPQAKIG
RY LLSVGQTRGGQPSQFPINPSVTNHKLDNLLPGTEYTVSLVALKGNQQSASASG
HUMAN . ..1085 .LQPGSSIPPYNTEVTETTIVIT W
TPAPRI GF KLGVRPSQGGEAPREVTSDSGSIVVSGLTPGVEYVYTIQVLRDGQ...
MOUSE . . 1085 ..QPLRSIPPYNTEVTETTIVIT W TPAPRI
GF KLGVRPSQGGEAPREVTSDSGSIVVSGLTPGVEYTYTIQVLRDGQERDAPIVNRVVTP
RAT . . . 1085 ..QPLRSIPPYNTEVTETTIVIT W TPAPRI
GF KLGVRPSQGGEAPREVTSDSGSIVVSGLTPGVEYTYTIQVLRDGQERDAPIVNRVVTP
BOV . . . 1054 .LQPLGSIPHYNTEVTETTIVIT W TPAPRI
GF KLGVRPSQGGEAPREVTSESGSIVVSGLTPGVEYVYTISVLRDGQE...
ZFISH . ..1083 .TPPMGNAPYFSTDVTDTSIIVS W SPLPKI
GY KLTVRPSQGGEAPRDVTSELGSVLISGLTPGVEYTYSVQPVISGQE...
XENOPUS. .1087 ...PVGSIPHYNTEVTETTIVVT W TPVPRI
GF KLDVRPSQGGEAPREVISESGSIVISGLTPGVEYTYSISVLTDGVEKDIPITKTVVTPLSPPTNL
HUMAN . . 1175 ...SPPTNLHLEANPDTGVLTVS W
ERSTTPDIT GY RITTTPTNGQQGNSLEEVVHADQSSCTFDNLSPGLEYNVSVY...
MOUSE . . 1173 ..LSPPTNLHLEANPDTGVLTVS W ERSTTPDIT
GY RITTTPTNGQQGTSLEEVVHADQSSCTFENLNPGLEYNVSVYTVKDDKESAPISDTVVP
RAT . . . 1173 ..LSPPTNLHLEANPDTGVLTVS W ERSTTPDIT
GY RITTTPTNGQQGTALEEVVHADQSSCTFENRNPGLEYNVSVYTVKDDKESAPISDTVIP
BOV . . . 1141 TPLSPPTNLHLEANPDTGVLTVS W ERSTTPDIT
GY RITTTPTNGQQGYSLEEVVHADQSSCTFENLSPGLEYNVSVY...
CHICK . . .. 1 .PLSPPTNLRLEPNPDTGILIVS W DRSTTPGIS
GY RVTTAPTNGQQGSTLEEVVGADQTSCTFENLNPGVEYNVSVYAVKDDQESIPISKTITQ
ZFISH . . 1173 ...SPPTDLNLESNPNTGKLTVQ W NDANIPDIT
GY RVTCTPTKGQQGNSLEEFVKAGQNSCTLENLSPGVEYNVSVF...
XENOPUS. .1173 .PLSPPTNLRLQPSRDSATLTVY W DRSISPGIT
GY RISTTPTPMQVGNSLEEEVGPSQTYCVFENLSPGVEYNVSVYAVKEEEESAPLS
NEWT . . . .14 .PISPPTDLHLEPSPDFATLTVS W SRSPSPGIT
GY RINTALLLGIRLHSGYTLEEEVTESQSRVCIFDNLSPGVEYNVSVVSVKDDQESEPIWKTITQEV
MOUSE . . 1265 ..EVPQLTDLSFVDITDSSIGLR W
TPLNSSTII GY RITVVAAGEGIPIFEDFVDSSVGYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQT
RAT . . . 1265 ..EVPQLTDLSFVDITDSSIGLR W TPLNSSTII
GY RITVVAAGEGIPIFEDFVDSSVGYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQT
CHICK . . . 94 ..EVPQLTDLSFVDITDSSIGLR W TPLNASTII
GY RITVVAAGESVPIFEDFVDSSVGYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQT
ZFISH . . 1264 ..DVPKITDLSFINVTDSTIGLS W SPLNSTAVT
GY RITVLAAGDSVPIFVEFVEPTTGFYTVHGLEPGIDYDITVTTVTE...
XENOPUS. .1268 . . PQLTDIKYDDVTDTSIDLR W TPLNSSNII
GY RITVVAAGESVPIYEEFVGPTDGYYKVSGLEPGIDYEISLITLINGGESAPTTIIQHTAVPP
NEWT . . . 113 ....PSLTDLNFVDVTDTSIDLR W TPLKGPTII
GY RVTVVAAGESVPIYEDKVGPTQGYYKVSGLEPGIDYDISVITLVTDGESAPTTLTTADCC
HUMAN . . 1266 ..AVPPPTDLRFTNIGPDTMRVT W
APPPSIDLT---NFLV RY SPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYE...
MOUSE . . 1356 ..AVPPPTDLRFTNIGPDTMRVT W APPPSIELT---NLLV
RY SPVKNEEDVAELSISPSDNAVVLTNLLPGTEYLVSVSSVYEQHESIPLRGRQKT
RAT . . . 1356 ..AVPPPTDLRFTNIGPDTMRVT W APPPSIELT---NLLV
RY SPVKNEEDVAELSISPSDNAVVLTNLLPGTEYLVSVSSVYEQHESIPLRGRQKT
BOV . . . 1235 ..AVPPPTDLRFTNVGPDTMRVT W APPSSIELT---NLLV
RY SPVKNEEDVAELSISPSDNAVVLTNLLPGTEYLVSVSSVY...
CHICK . . .186 ..AVPPPTDLRFTNVGPDTMRVT W TAPTSIVLS---SFLV
RY SPVKKEEDVAELTISPSDNVVVLTNLLPGTEYLVRVYSVAEQHESAPLSGIQKTG
ZFISH . ..1355 ..AVPAPYGLSFGEVTADTMLVT W KAPQVPKSSDINQYII
RY HPVDEDDETTERTVEGSENFVVLRHLVPNTEYLVSVICVYEGR
XENOPUS. .1358 ....PPPTNLRFTNIGPDNIRVT W SPPTSIELS---SYLV
RY SPVKKPDDVTELSLSPSTNMVVLSNLLPFTEYLVSVHSVYESRESSS
NEWT . . . 204 ....PTATDLRFTNVGPDSMLVT W SAPPSMVLS---SFLV
RY VPSKNEEDAAELTISPSDNMVVLTNLLPGTEYIVSVFAVYEERESTPLTGVQRTG
HUMAN. . .1357 ..GLDSPTGIDFSDITANSFTVH W
IAPRATIT GY RIRHHPEHFSGRPREDR-VPHSRNSITLTNLTPGTEYVVSIVALNGR...
MOUSE . . 1547 ..GLDSPTGFDSSDITANSFTVH W VAPRAPIT
GY IIRHHAEHSVGRPRQDR-VPPSRNSITLTNLNPGTEYVVSIIAVNGR
RAT . . . 1447 ..GLDSPTGFDSSDVTANSFTVH W VAPRAPIT
GY IIRHHAEHSAGRPRQDR-VPPSRNSITLTNLNPGTEYIVTIIAVNGREESPPLIGQQST
BOV . . ..1326 ..ALDSPSGIDFSDITANSFTVH W IAPRATIT
GY RIRHHPENMGGRPREDR-VPPSRNSITLTNLNPGTEYVVSIVALNSK...
CHICK . ...277 ...LDSPTGLDFSDITANSFTVH W IAPRATIT
GY KIRHHPEHGVGRPKEDR-VPPSRNSITLTNLLPGTEYVVSIIAVNGREESVPLVGQQTT
ZFISH . ..1448 ..LFLMPCCLQFSDVGTTSFTVR W QAPRAIIS
GY RIRYQMTSG-GRAKEER-LPPSRSHFTLTGLTPETEYSISVYAVSGSR...
XENOPUS. .1449 . .LDSPTGIAFSEITPNSFTVH W IAPRGPIT
GY RIRYQLESGAGRPKEER-VPPSRNSITLTHLIPGSEYLVSIIAINGQQESLPLAGQQATVSD
NEWT . . . 294 ...LDSPTGLDFSDTTSSSFTVY W MAPRATVT
GY KIQYHPETGGAGQKEERCVPPSRNSLTLTNLTPGTEYVVSIFAVNGRQESVPLVGQQATVSDT
HUMAN . ..1447 ..VSDVPRDLEVVAATPTSLLIS W
DAPAVTVR YY RITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTG...
MOUSE ....1537 ..VSDIPRDLEVIASTPTSLLIS W EPPAVSVR
YY RITYGETGGNSPVQEFTVPGSKSTATINNIKPGADYTITLYAVTGRGDSPASSKPVSINYKT
RAT . . . 1537 ..VSDVPRDLEVIASTPTSLLIS W EPPAVSVR
YY RITYGETGGNSPVQEFTVPGSKSTATINNIKPGADYTITLYAVTGRGDSPASSKPVSINYQT
BOV . . . 1416 ..VSDVPRDLEVIAATPTSLLIS W DAPAVTVR
YY RITYGETGGSSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTG...
CHICK . . .366 ..VSDVPRDLEVNPTSPTSLEIS W DAPAVTVR
YY RITYGETGGSSPVQEFTVPGTMSRATITGLKPGVDYTITVYAVTGRGDSPASSKPVTVTYKT
ZFISH . . 1538 ...SDAPTDLEVISSTPTSITVR W DAPSVTVR
YY RITHGESGGSDAPLEFMVPGSQSTATIEDLRPGTDYTITVYAVT...
XENOPUS. .1539 . .SDVPTDLEVTSSSPNTLTIS W EAPAVSVR
YY RITYSQTGGHGPEKEFTVPGTSNTATIRGLNPGVSYTITVYAVTGRGDSPA
NEWT . . . 388 ......PTNLEVTSSTPTSMSIS W DAPPVGVR
YY RITYTETGGETPVQEFTVPGDRSDAPIRGLKPGAEYIITVYAVTGRGDSPASSKPVTVTHKT
HUMAN. . .1541 ..EIDKPSQMQVTDVQDNSISVK W
LPSSSPVT GY RVTTTPKNGPGPTKTKTAGPDQTEMTIEGLQPTVEYVVSVYAQNP...
MOUSE. . .1631 . EIDKPSQMQVTDVQDNSISVR W LPSTSPVT
GY RVTTTPKNGLGPSKTKTASPDQTEMTIEGLQPTVEYVVSVYAQNRNGESQPLVQTAVT
RAT . . . 1631 ..EIDKPSQMQVTDVQDNSISVR W LPSTSPVT
GY RVTTAPKNGLGPTKSQTVSPDQTEMTIEGLQPTVEYVVSVYAQNRNGESQPLVQTAVT
BOVI . . .1510 ..EIDKPSQMQVTDVQDNSISVR W LPSSSPVT
GY RVTTAPKNGPGPSKTKTVGPDQTEMTIEGLQPTVEYVVSVYAQNQ...
CHICK . . .460 ..EIDTPSQMQVTDVQDNSISIR W LPSSSPVT
GY RVTAVPKKGHGPTKTKNVPPDQTQVTIEGLQPTVEYMVSVYAQNQNGESLPLVETAVT
XENOPUS. .1632 ..DVDQPIDMAVTDIQDHSIHVK W SPPPGPVT
GY RVTSVPKSGQGETFSQVISPDQTEVTIVGLQPAVEYVVSIYSQGENGFSEPLVETAV
NEWT . . . 478 ..VVDKPTQLQVTDVQDHSIQVR W MPSSTPVT
GY RVTSVPKSGVGPTVSHVVPPDQTEMTIEGLEPTVEYVVSVYAQGKNGETEPLVETAVT
HUMAN . ..1631 ..NIDRPKGLAFTDVDVDSIKIA W
ESPQGQVS RY RVTYSSPEDGIHELFPAPDGEEDTAELQGLRPGSEYTVSVVALH...
MOUSE . ..1721 ..NIDRPKGLAFTDVDVDSIKIA W ESPQGQVS
RY RVTYSSPEDGIRELFPAPDGEDDTAELQGLRPGSEYTVSVVALHDDMESQPLIGIQST
RAT . . . 1721 ..NIDRPKGLAFTDVDVDSIKIA W ESPQGQVS
RY RVTYSSPEDGIHELFPAPDGDEDTAELHGLRPGSEYTVSVVALHGGMESQPLIGVQST
CHICK . ...550 ..NIDRPKGLTFTEVDVDSIKIA W ESPQGQVT
RY RVTYSSPEDGIHELLPAPGGEEDTAELHGLRPGSEYTINIVAIYDDMESLPLTGTQST
ZFISH . . 1631 ..DIDSPSQMEVTDVKDNTITVR W TPAAGPIS
RY RVNATPLTGEGPVLHAETTSDHTEITFSGLMPAVEYSIKVYALGQD...
XENOPUS. .1721 .TNIDNPKGLTFTDVGVDSIRLA W EVPDGQVT
RY RVTYSSPEDGVKELFPAPEGDDDTAELHGLRPGTEYTVSIVALHDDM
ZFISH . . 1721 ..TVDKPKDLSFTDVESSSMRIS W DSPDGVVS
SY RVLYYSPEEGERELFPAPHGDDESAVLHGLRPGTEYTVKVIALHDQ...
NEWT . . . 568 ..NIDRPKGLAFTEVDVDSLKLV W ESPKGQVT
TY KVTYSNPEDGIHELVPAPNGDEDTAQLHGLRPGAEYTVSVVALHDDMESQPLIGTQVTAI
HUMAN . . 1721 ..AIPAPTDLKFTQVTPTSLSAQ W
TPPNVQLT GY RVRVTPKEKTGPMKEINLAPDSSSVVVSGLMVATKYEVSVYALKDT...
MOUSE . . 1811 . AIPAPTNLKLSQVTPTSFTAQ W IAPSVQLT
GY RVRVNPKEKTGPMKEINLSPDSSSVIVSGLMVATKYEVSVYALKDTLTSRPAQGVITTLE
RAT . . . 1811 ..AIPAPTNLKFTQVSPTTLTAQ W TAPSVKLT
GY RVRVTPKEKTGPMKEINLSPDSTSVIVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLE
BOV . . . 1600 ..TIPAPTNLKFTQVTPTSLTAQ W TAPNVQLT
GY RVRVTPKEKTGPMKEINLAPDSSSVVVSGLMVATKYEVSVYALKDT...
CHICK . . .640 ..AIPPPTNLKFTQVTPTSLTVN W NAPNVRLT
GY RVRVNPKEKTGPMKEINLSPDSTSAVVSGLMVATKYEVSVYALKDSLTSRPAQGVVTTLE
ZFISH . . 1811 ..GIPGPTRLQFSQVGPTSFTVS W SSSDASLT
GY RVAVSPKSKSGPTKEENITPDSTEFHATGLMPGTDYEVEVYGVKN...
XENOPUS. .1814 . . PAPTNLQFSQVTPSGFSLS W HAPTVHLT
GY LVRVNPKEKTGPTKEVRLSPGVAATTVTGLMVATKYEVNVYALKDSLTSQPLQGLIS
NEWT . . . 660 ....PPPTNLLFSQITPTSVTVS W RPPNVQLT
GY RVRVHPKEKAGPMKEINLSPDSTSAVVTGLMVATKYEVSVYALKDSLTSRPAQGIVTTQENVS
HUMAN . ..1813 ..NVSPPRRARVTDATETTITIS W
RTKTETIT GF QVDAVPAN-GQTPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLND...
MOUSE . . 1903 ..NVSPPRRARVTDATETTITIS W RTKTETIT
GF QVDAIPAN-GQTPVQRSISPDVRSYTITGLQPGTDYKIHLYTLNDNARSSPVIIDAST
RAT . . . 1903 ..NVSPPRRARVTDATETTITIS W RTKTETIT
GF QVDAIPAN-GQTPVQRTISPDVRSYTITGLQPGTDYKIHLYTLNDNARSSPVVIDAST
BOV . . ..1691 .ENVSPPRRARVTDATETTITIS W RTKTETIT
GF QVDAIPAN-GQTPIQRTIRPDVRSYTITGLQPGTDYKIHLYTLND...
HORSE . . . 29 .....PPRRARVTDATETTITIS W RTKTETIT
GF QVDAVPAN-GQPPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVIIDAST
DOG . . . . 29 . . .PPRRARVTDATETTITIS W RTKTETIT
GF QVDAIPAN-GQNPIQRTIRPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDAST
CHICK . .. 732 ..NVSPPRRARVTDATETTITIT W RTKTETIT
GF QIDAIPAASGQNPIQRTISPDVRTYTITGLQPGNDYKIYLYTLNENARSSPVVIDAST
ZFISH . ..1903 ..NISPPRRVRISNVKDSSITLT W RSKTEAIS
GF LVEATPTMGGHNPIQRTIEPDSRTYTIAGLEPGTNYKINIYTLNG...
XENOPUS. .1907 . . .PPRRPRIQDVTETTVTLS W RTKTETIT
GF QIDAIPAD-GQNPIRRTVDADLRTFTITGLQPGTDYKIYLYTLNDNARSSPVTVDV
NEWT. . . .753 .....PPRRRRITDVTETTVTIT W RTKTETIT
GF HIDAIPAG-GQNPIQRTISPDLRTYVITGLQPGTDYKIHIYTLNDNARSSPVTIDATTAV
HUMAN . ..1902 ..AIDAPSNLRFLATTPNSLLVS W
QPPRARIT GY IIKYEKPGSPPREVVPRPRPGVTEATIT-GLEPGTEYTIYVIALK...
MOUSE . . 1992 ..AIDAPSNLRFLTTTPNSLLVS W QAPRARIT
GY IIKYEKPGSPPREVVPRPRPGVTEATIT-GLEPGTEYTIYVIALKNNQKSEPLIGRKKT
RAT . . . 1992 ..AIDAPSNLRFLTTTPNSLLVS W QAPRARIT
GY IIKYEKPGSPPREVVPRPRPGVTEATIT-GLEPGTEYTIYVIALKNNQKSEPLIGRKKT
BOV . . ..1741 .....APSNLRFLATTPNSLLVS W QPPRARIT
GY IIKYEKPGSPPREVVPRPRPGVTEATIT-GLEPGTEYTIQVIALK...
HORSE . . .115 ..AIDAPSNLHFLATTPNSLLIS W QPPRARIT
GY IIKYEKPGSPPREVVPRPHPGVTEATIT-GLEPGTEYTIQVIAIKNNQKSEPLIGRRKTDE
DOG . . . .115 ..AIDAPSNLRFLATTPNSLLVS W QPPRARIT
GY IIKYEKPGSPPREVVPRPRPGVTEATIT-GLEPGTEYTIQVIALKNNQKSEPLIGRKKTDEL
CHICK . . .822 ..AIDAPSNLRFLTTTTNSLLAS W QPPRAKIT
GY IIRYDKPGSPAKELLPRPRPGTTEATIT-GLEPGTEYTIYIIAVKNNQKSEPLVGRKRT
ZFISH . ..1996 ..VISPPTNLHFTSLASTSISFT W EPPRSTIT
GY YVTYEEAGGIPKELTPRPQAARTFASIS-GLIPGTEYIIKIVALN...
XENOPUS. .1994 . .VDSPSNLRFLTTTSNSLLFT W QPPRARIT
GY IIRYEKAGGLIKEHLPRLPAGTTESTLT-NLEPGTEYIIYIIAVRNNMKSEPLVGRK
NEWT. . . .841 . ..DSPSNLRFLTTTSNSLLFS W QPPRSKLT
GY IIKYEKPGGPVREVVPRPHPGATESQQSQNLEPGTEYTIYIIAVRSNYKSGPLVGKKRTD
HUMAN . . 2101 .HGPGLNPNASTGQEALSQTTIS W
APFQDTS EY IISCHPVGTDEEPLQFRVPGTSTSATLT
MOUSE . . 2192 ..VPGLNPNASTGQEALSQTTIS W TPFQESS
EY IISCQPVGTDEEPLQFQVPGTSTSATLTGLTRGVTYNIIVEALQNQRRHKVREEVVTV
RAT . . . 2191 .HVPGLNPNASTGQEALSQTTIS W TPFQESS
EY IISCQPVGTDEEPLQFQVPGTSTSATLTGLTRGVTYNIIVEALHNQRRHKVREEVVTV
BOV . . . 1980 .HVVGLNPNASTGQEALSQTTIS W TPFQESS
EY IISCHPVGIDEEPLQFRVPGTSASATLTG
HORSE . . .313 PHVLGLNPNTSTGQEALSQTTIS W TPFQESS
EY IISCHPVGIDEEPLQFRVPGTSASATLTGLTRGATYNIIVEALKDQKRHKVREEVVTVGNSVDQG
DOG . . . .313 PHVMGLNPNASTGQEALSQTTIS W TPFQESS
EY IISCHPVGIDEEPLQFRVPGTSASATLTGLTRGATYNIIVEALKDQKRHKVREEV
ZFISH . . 2192 ...GGLFNETNLPQESQTQTTIV W QPVPHTS
EY VVWCDPITEINEKSFQMRLPGTSTSATLI
XENOPUS. .2194 PHGLGPQLNDSGVQEVASHTTIS W RPELETT
EY IISCHPIDHEEAPLQFRVPGTSSSATLNGLTRGATYNIVVEAQKGTDKHKVLEK
NEWT . . .1051 PTSHDSGPQQVDRTGQEAQTTIS W RPLLETT
EY IITCHPVGNEETPQQFTVPGTSSSATLNGLTRGATYNIIVEALKGKNKHKSRELVTV
Vertebrate Contactin (fibronectin
domains)
HUMAN 1 ...603 PPGPPGGLRIEDIRATSVALT W SRGSDNHSPIS KY TIQTKTILSDD
W KDAKTDPPIIEGNMEAARAVDLIPWMEYEFRVVATNTLGRGEPSIPSNRIKTDGAA
MOUSE 1 ...605 PPGPPGGLRIEDIRATSVALT W SRGSDNHSPIS KY TIQTKTILSDD
W KDAKTDPPIIEGNMESAKAVDLIPWMEYEFRVVATNTLGTGEPSIPSNRIKTDGAA
RAT 1 .....605 PPGPPGGLRIEDIRATSVALT W SRGSDNHSPIS KY TIQTKTILSDD
W KDAKTDPPIIEGNMESAKAVDLIPWMEYEFRVVATNTLGTGEPSIPSNRIKTDGAA
BOVINE 1 ..603 PPGPPGGLRIEDIRATSVALT W SRGSDNHSPIS KY TIQTKTILSDD
W KDAKTDPPIIEGNMEAARAVDLIPWMEYEFRVVATNTLGIGEPSIPSNKIKTDGAA
ZFISH 1 ...616 PPGPPGGVRVDEVTSDSVRVL W SHGTDNLSPIS RY TVQLRESAAQQDW
RDAATSPVNVEGNAEMATVVNLLPWTEYEFRVIATNTLGTGPPSEPSPKTTTREAR
CHICK 1 ...594 PPGPPGGIRIEEIRDTAVALT W SRGTDNHSPIS KY TIQSKTFLSEE
W KDAKTEPSDIEGNMESARVIDLIPWMEYEFRIIATNTLGTGEPSMPSQRIRTEGA
HUMAN 2 ...607 PPGPPGGVVVRDIGDTTIQLS W SRGFDNHSPIA KY TLQARTPPAGK
W KQVRTNPANIEGNAETAQVLGLTPWMDYEFRVIASNILGTGEPSGPSSKIRTREAA
RAT 2 .....609 PPGPPGGVVVRDIGDTTVQLS W SRGFDNHSPIA KY TLQARTPPSGK
W KQVRTNPVNIEGNAETAQVLGLMPWMDYEFRVSASNILGTGEPSGPSSKIRTKEAV
ZFISH 2 ...608 PPGPPGGLVVTSVNDTSVELR W SRGYDNHSPIG KY VILGRSSQTHD
W KRMRTDPANIEGNAESARVIDLRPWMDYEFQVIASNILGSGEPSMPSPRAQTKEAA
CHICK 2 ...602 PPGPPGGVVVRDIGDTTVQLS W SRGFDNHSPIA RY SIEARTLLSNK
W KQMRTNPVNIEGNAETAQVVNLIPWMDYEFRVLASNILGVGEPSLPSSKIRTKEAA
MOUSE 3 ...598 .PGPPENVKVDEITDTTAQLS W TEGTDSHSPVI SY AVQARTPFSVG
W QSVRTVPEVIDGKTHTATVVELNPWVEYEFRIVASNKIGGGEPSLPSEKVRTEEAA
RAT 3 .....597 SPGPPENVKVDEITDTTAQLS W TEGTDSHSPVI SY AVQARTPFSVG
W QNVRTVPEAIDGKTRTATVVELNPWVEYEFRVVASNKIGGGEPSLPSEKVRTEEAA
HUMAN 4 ...596 PPGPPEAVTIDEITDTTAQLS W RPGPDNHSPIT MY VIQARTPFSVG
W QAVSTVPELIDGKTFTATVVGLNPWVEYEFRTVAANVIGIGEPSRPSEKRRTEEAL
HUMAN 5 ...596 PPGPPGIVIVEEITESTATLS W SPAADNHSPIS SY NLQARSPFSLG
W QTVKTVPEIITGDMESAMAVDLNPWVEYEFRVVATNPIGTGDPSTPSRMIRTNEAV
RAT 5 .....670 PPGPPGVVIVEEITESTATLS W SPATDNHSPIS SY NLQARSPFSLG
W QTVKTVPEVITGDMESAMAVDLNPWVEYEFRVVATNPIGTGDPSIPSRMIRTNEAV
HUMAN 6 ...597 PPGPPEDVQVEDISSTTSQLS W RAGPDNNSPIQ IF TIQTRTPFSVG
W QAVATVPEILNGKTYNATVVGLSPWVEYEFRVVAGNSIGIGEPSEPSELLRTKASV
MOUSE 6 ...597 PPGPPEDVKVEHISSTTSQLS W RPGPDNNSPIQ IF TIQTRTPFSVG
W QAVATVPEILNGQTYNATVVGLSPWVEYEFRVVAGNNIGIGEPSKPSELLRTKASV
RAT 6 .....597 PPGPPEDVKVEHISSTTSQLS W RPGPDNNSPIQ IF TIQTRTPFSVG
W QAVATVPEILNGQTYNATVIGLSPWVEYEFRVVAGNNIGIGEPSKPSELLRTKASI
HUMAN 1 ...706 PNVAPSDVGGGGGRNRELTIT
W APLSREYHYGNNF GY
IVAFKPFDGEE W KKVTVTNPDTGRYVHKDETMSPSTAFQVKVKAFNNKGDGPYSLVAVINSAQDA
MOUSE 1 ...708 PNVAPSDVGGGGGTNRELTIT W APLSREYHYGNNF GY
IVAFKPFDGEE W KKVTVTNPDTGRYVHKDETMTPSTAFQVKVKAFNNKGDGPYSLVAVINSAQDA
RAT 1 .....708 PNVAPSDVGGGGGTNRELTIT W APLSREYHYGNNF GY
IVAFKPFDGEE W KKVTVTNPDTGRYVHKDETMTPSTAFQVKVKAFNNKGDGPYSLIAVINSAQDA
BOVINE 1 ..706 PNVAPSDVGGGGGSNRELTIT W APLSREYHYFNNF GY
IVAFKPFDGEE W KKVTVTNPDTGRYVHKDETMRPSTAFQVKVKAFNNKGDGPYSLTAVIHSAQDA
ZFISH 1 ...720 PIVAPSDIGGGGGTSRELTIT W TPVQSQYYYGSNF GY
IIAFKPHNDPE W LRVTVTDPEAQKYVHKDPKIPPSTRFEVKMKAFNSQGEGPFSNSAFIYSAQDV
CHICK 1 ...697 PNVAPSDVGGGGGSNRELTIT W MPLSREYHYGNNF GY
IVAFKPFGEKE W RRVTVTNPEIGRYVHKDESMPPSTQYQVKVKAFNSKGDGPFSLTAVIYSAQDA
HUMAN 2 ...710 PSVAPSGLSGGGGAPGELIVN W TPMSREYQNGDGF GY
LLSFRRQGSTH W QTARVPGADAQYFVYSNESVRPYTPFEVKIRSYNRRGDGPESLTALVYSAEEE
MOUSE2 ... 49 PSVAPSGLSGGGGAPGELTIN W TPMSREYQNGDGF GY
LLSFRRQGSSS W QTARVPGADTQYFVYSNDSIHPYTPFEVKIRSYNRRGDGPESLTAIVYSAEEE
MOUSE2* ...705 PTVAPSGLGGGGGAPNELIIN W TPTLRDYQNGDGF GY
ILSFRKKGTQG W LTARVPHAESLHYVYRNESIGPYTPFEVKIKAYNRKGEGPESLTAIVYSAEEE
RAT 2 .....712 PSVAPSGLSGGGGAPGELIIN W TPVSREYQNGDGF GY
LLSFRRQGSSS W QTARVPGADAQYFVYGNDSIQPYTPFEVKIRSYNRRGDGPESLTALVYSAEEE
ZFISH 2 ...711 PTVAPSGLGGGGGNRNELIIT W TPMAREYQNGDSF GY
ILAFRKQGIPT W TVVKVPNVESSRYVYSNDSLTAYCPFEVRIKAYNKKGEGPFSQIAVVHSAEEE
CHICK 2 ...705 PTVAPSGLGGGGGAPNELIIN W TPTLRDYQNGDGF GY
ILSFRKKGTQG W LTARVPHAESLHYVYRNESIGPYTPFEVKIKAYNRKGEGPESLTAIVYSAEEE
MOUSE 3 ...700 PEIAPSEVSGGGGSRSELVIT W DPVPEELQNGGGF GY
VVAFRPLGVTT W IQTVVTSPDNPRYVFRNESIVPFSPYEVKVGVYNNKGEGPFSPVTTVFSAEEE
RAT 3 .....700 PEVAPSEVSGGGGSRSELVIT W DPVPEELQNGGGF GY
VVAFRPLGVTT W IQTVVTSPDNPRYVFRNESIVPFSPYEVKVGVYNNKGEGPFSPVTTVFSAEEE
HUMAN 4 ...699 PEVTPANVSGGGGSKSELVIT W ETVPEELQNGRGF GY
VVAFRPYGKMI W MLTVLASADASRYVFRNESVHPFSPFEVKVGVFNNKGEGPFSPTTVVYSAEEE
HUMAN 5 ...699 PKTAPTNVSGRSGRRHELVIA W EPVSEEFQNGEGF GY
IVAFRPNGTRG W KEKMVTSSEASKFIYRDESVPPLTPFEVKVGVYNNKGDGPFSQIVVICSAEGE
RAT 5 .....773 PKTAPSNVSGGSGRRHELVIA W EPVSEEFQNGEGF GY
IVAFRPNGTRG W KEKMVTSSDASKFIYRDESVPPLTPFEVKVGVYNNKGDGPFSQIVVICSAEGE
HUMAN 6 ...700 PVVAPVNIHGGGGSRSELVIT W ESIPEELQNGEGF GY
IIMFRPVGSTT W SKEKVSSVESSRFVYRNESIIPLSPFEVKVGVYNNEGEGSLSTVTIVYSGEDE
MOUSE 6 ...700 PNVAPGNINGGGGSRSELVIT W EAIPEELQNGEGF GY
IVMFRPVGTTA W MKERVALVESSKFIYRNESIMPLSPFEVKVGVYNNEGEGSLSTVTIVYSGEDE
RAT 6 .....700 PNVAPVNINGGGGSRSELVIT W EPIPEELQNGEGF GY
IIMFRPVGSTT W MKEKVALVESSKFIYRNESIMPLSPFEVKVGVYNNEGEGSLSTVSIVYSGEDE
HUMAN 1 ...808 PSEAPTEVGVKVLSSSEISVH
W EHVLE----KIVE SY
QIRY W AAHDKEEAANRVQVTSQEYSARLENLLPDTQYFIEVGACNSAGCGPPSDMIEAFTKKA
MOUSE 1 ...810 PSEAPTEVGVKVLSSSEISVH W KHVLE----KIVE SY
QIRY W AGHDKEAAAHRVQVTSQEYSARLENLLPDTQYFIEVGACNSAGCGPSSDVIETFTRKA
RAT 1 .....810 PSEAPTEVGVKVLSSSEISVH W KHVLE----KIVE SY
QIRY W AGHDKEAAAHRVQVTSQEYSARLENLLPDTQYFIEVGACNSAGCGPSSDVIETFTRKA
BOVINE 1 ..808 PSEAPTAVGVKVLSSSEISVH W EHVVE----KIVE SY
QIRY W ASHDKEAAAHRVQVASQEYSARLENLLPDTQYFVEVRACNSAGCGPPSDMTETFTKKA
ZFISH 1 ...822 PAEAPIITEARALSATEAIVI W VPVQLP----TVE RY
QVRY W RESVENEASAQRVLVSSRENHTRLDNMKPDSHYLVEVRACNGAGYGPASQRNRIYTKKSPPSR
CHICK 1 ...799 PTEVPTDVSVKVLSSSEISVS W HHVTE----KSVE GY
QIRY W AAHDKEAAAQRVQVSNQEYSTKLENLKPNTRYHIDVSAFNSAGYGPPSRTIDIITRKA
HUMAN 2 ...811 PRVAPTKVWAKGVSSSEMNVT W EPVQQ-DMNGILL GY
EIRY W KAGDKEAAADRVRTAGLDTSARVSGLHPNTKYHVTVRAYNRAGTGPASPSANATTMK
MOUSE2 ...151 PKVAPAKVWAKGSSSSEMNVS W EPVLQ-DMNGILL GY
EIRY W KAGDKERAADRVRTAGLDSSARVTGLN
MOUSE2* ...807 PKVAPFRVTAKAVLSSEMDVS W EPVEQGDMTGV-L GY
EIRY W KDGDKEEAADRVRTAGLVTSAHVTGLNPNTKYHVSVRAYNRAGAGPPSPSTNITTTKPP
RAT 2 .....814 PRVAPAKVWAKGSSSSEMNVS W EPVLQ-DMNGILL GY
EIRY W KAGDNEAAADRVRTAGLDTSARVTGLNPNTKYHVTVRAYNRAGTGPASPSADAMTVK
ZFISH 2 ...813 PTVSPRWINATALTAFEIQVS W EPIQHLNINGVLR GY
EIRY W RQHEREAAADRVRTAGLENTARVTGLRPNTLYHITVLAYNSAGTGPASPRTTVITKK
CHICK 2 ...807 PKVAPFRVTAKAVLSSEMDVS W EPVEQGDMTGVLL GY
EIRY W KDGDKEEAADRVRTAGLVTSAHVTGLNPNTKYHVSVRAYNRAGAGPPSPSTNITTTK
MOUSE 3 ...802 PTVAPSHISAHSLSSSEIEVS W NTIPWKLSNGHLL GY
EVRY W NNGGEEESSRKVKVAGNQTSAVLRGLKSNLAYYTA-VRAYNSAGAGPFSATVNATTKKT
RAT 3 .....802 PTVAPSHISAHSLSSSEIEVS W NTIPWKSSNGRLL GY
EVRY W NNGGEEESSSKVKVAGNQTSAVLRGLKSNLAYYTAVRAYNTAGAGPFSATVNATTKKT
HUMAN 4 ...801 PTKPPASIFARSLSATDIEVF W ASPLEK-NRGRIQ GY
EVKY W RHEDKEENARKIRTVGNQTSTKITNLKGSVLYHLAVKAYNSAGTGPSSATVNVTTRKPP
HUMAN 5 ...801 PSAAPTDVKATSVSVSEILVA W KHIKE--SLGRPQ GF
EVGY W KDMEQEDTAETVKTRGNESFVILTGLEGNTLYHFTVRAYNGAGYGPPSSEVSATTKKS
RAT 5 .....875 PTAAPTDVTATSVSVSEIFVV W KHVKE--SLGRPQ GF
EIGY W KDTEPEDSAETVRTRGNESFVMLTGLEGDTLYHLTVRAYNGAGYGPPSREVSATTK RH
HUMAN 6 ...802 PQLAPRGTSLQSFSASEMEVS W NAIAWNRNTGRVL GY
EVLY W TDDSKESMIGKIRVSGNVTTKNITGLKANTIYFASVRAYNTAGTGPSSPPVNVTTKKS
MOUSE 6 ...802 PQLAPRGTSVQSFSASEMEVS W NAIAWNRNTGRVL GY
EVLY W TDNSKESMIGKIRVSGNVTTKNITGLRANTIYFASVRAYNTAGTGPSSLPVNVTTKKS
RAT 6 .....802 PRLAPRGTSVQSFSASDMEVS W NAIAWNRNTGRVL GY
EVLY W TDNSKESMIGKIRVSGNVTTKNITGLRANTIYFASVRAYNTAGTGPSSPPVNVTTKKS
HUMAN 1 ...904 PPSQPPRIISSVRSGSRYIIT
W DHVVALSNESTVT GY
KVLYRPDGQHDGKLYSTHKHSIEVPIPRDGEYVVEVRAHSDGGDGVVSQVKISGAPTLSP
MOUSE 1 ...906 PPSQPPRIISSVRSGSRYIIT W DHVVALSNESTVT GY
KILYRPDGQHDGKLFSTHKHSIEVPIPRDGEYVVEVRAHSDGGDGVVSQVKISGVSTL
RAT 1 .....906 PPSQPPRIISSVRSGSRYIIT W DHVVALSNESTVT GY
KILYRPDGQHDGKLFSTHKHSIEVPIPRDGEYVVEVRAHSDGGDGVVSQVKISGVSTL
BOVINE 1 ..904 PPSQPPRIISSVRSGSRYIIT W DHVVALSNESTVT GY
KVLYRPDGQHDGKLYSTHKHSIEVPIPRDGEYVVEVRAHSDGGDGVVSQV
ZFISH 1 ...923 ..PPKIISTKMHYSGTSINIA W EKVESLNNESTVA GY
KVLYRQHGQPSGTLYTTEKQSIDLPMRRGEYLVEVRAHSEGGDGAVAQVRITGSAPAPALASA
CHICK 1 ...895 PPSQRPRIISSVRSGSRYIIT W DHVKAMSNESAVE GY
KVLYRPDGQHEGKLFSTGKHTIEVPVPSDGEYVVEVRAHSEGGDGEVAQIKISGATAGV
HUMAN 2 ...911 PPRRPPGNISWTFSSSSLSIK W DPVVPFRNESAVT GY
KMLYQNDLHLTPTLHLTGKNWIEIPVPEDIGHALVQIRTTGPGGDGIPAEVHIVRNGGTSMMV
RAT 2 .....913 PPRRPPGNISWTFSSSSLSLK W DPVVPLRNESTVT GY
KMLYQNDLHPTPTLHLTSKNWIEIPVPEDIGHALVQIRTTGPGGDGIPAEVHIVRNGGTSM
ZFISH 2 ...913 PPNRPPGNVSWKVDGSWVTVR W DHVKSMDNESAVL GY
KVLYKHEGQSALKVLEKSKTSVKLPLPKDNGYVVLEIRSWGDGGDGAAHETIVSRDSGTGM
CHICK 2 ...907 PPRRPPGNISWTLTGSTVTIK W DPVVAQADESAVT GY
KMLYRQDSHSAPTLYLASKSRIDIPVPEDFTHAFVQIRVTGPGGDGTPAEVHIVRNS
MOUSE 3 ...902 PPSQPPGNVVWNATDTKVLLN W EQVKAMENESEVT GY
KVFYRTSSQNNVHVLNTNKTSAELLLPIKEDYIIEVKATTDGGDGTSSEQIRIPRITSMDAR
RAT 3 .....902 PPSQPPGNVVWNATDTKVLLN W EQVKALENESEVT GY
KVFYRTSSQNNVQVLNTNKTSAELLLPIKEDYIIEVKATTDGGDGTSSEQIRIPRITSMDAR
HUMAN 4 ...901 PSQPPGNIIW NSSDSKIILN W DQVKALDNESEVK GY
KVLYRWNRQSSTSV IETNKTSVELSLPFDEDYIIEIKPFSDGGDGSSSEQIRIPKISNAYARGS
HUMAN 5 ...899 PPSQAPSNLRWEQQGSQVSLG W EPVIPLANESEVV GY
KVFYRQEGHSNSQVIETQKLQAVVPLPDAGVYIIEVRAYSEGGDGTASSQIRVPS
RAT 5 .....973 PPSEPPGNLRWEQQGSQVSLG W EPVRPLANESEVM GY
KVFYRQEGHSKGQVIETQKPQAVVPLPEAGVYIIEVRAYSEGGDGTASSQIRVPSYAGGKI
HUMAN 6 ...902 PPSQPPANIAWKLTNSKLCLN W EHVKTMENESEVL GY
KILYRQNRQSKTHILETNNTSAELLVPFEEDYLIEIRTVSDGGDGSSSEEIRIPKMSSLSSR
MOUSE 6 ...902 PPSQPPANIAWKLSNSKLCLN W EHVKTMENESEVL GY
KILYRQNRQSKTHILETNNTSAELLVPFEEDYLIEIRTVSDGGDGSSSEEIRIPKMSSLSST
RAT 6 .....902 PPSQPPANIAWKLSNSKLCLN W EHVKTMENESEVL GY
KILYRQNRQSKTHVLETNNTSAELLVPFEEDYLIEIRTVSDGGDGSSSEEIRIPKMSSLSSV
Neogenin
Human Neogenin precursor. Q92859 GI:12643791 / Chick
Neogenin. Q90610 GI:10720134 /
mouse Neogenin precursor. P97798 GI:10720133 / rat Neogenin precursor. P97603
GI:10720132
zebrafish neogenin [Danio rerio] a DCC related netrin receptor AAK33004.1
GI:23428357
c.elegans Neogenin (154.4 kD) UNCoordinated locomotion (unc-40/UNC-91) NP_491664.1
GI:17509399
hNEOGEN ...439..PSAPRDVVASLVSTRFIKLT
W RTPASDPHGDNL TY
SVFYTKEGIARERVENTSHPGEMQVTIQNLMPATVYIFRVMAQNKHGSGESSAPLRVETQPEVQL
mNEOGEN ...470..PSAPRDVVASLVSTRFIKLT
W RTPASDPHGDNL TY
SVFYTKEGVDRERVENTSQPGEMQVTIQNLMPATVYIFKVMAQNKHGSGESSAPLRVETQPEVQL
RATNEO ... 408..PSAPRDVVASLVSTRFIKLT
W RTPASDPHGDNL TY
SVFYTKEGVARERVENTSQPGEMQVTIQNLMPATVYIFKVMAQNKHGSGESSAPLRVETQPEVQL
CHICKNEO ..425..PTAPRDVVATLVSTRFIRLT
W RTPVSDPQGDNL TY
SIFYTKEGINRERVENTSRPGETQVMIQNLMPETVYVFRVVAQNKHGHGESSAPLKVATQPEVQL
ZFISHNEO ..420..PSAPRDVVASLVSTRFLKLT
W RLPA-EPHGDDV TY
SVYYSLEGTNRERIVNTSRPGEMQVTIQNLMPDTKYAFRVVAHNKNGPGESSVPLKVETQPEVQV
C.EL.NEO ..440..PSAPLGLRSTSSGSRFINVE
W DPPVQR-NGNIM RY
HIFYKDNLIDRERMINSSSTSATLTSLQPSTMYLIRVTAENEAGMGKFSDSLKVTTNKEQAV
hNEOGEN ...539..PGPAPNLRAYAASPTSITVT
W ETPVSGNGEIQ NY
KLYYMEKGTDKEQDVDVSSHSY---TINGLKKYTEYSFRVVAYNKHGPGVSTPDVAVRTLSDV
mNEOGEN ...570..PGPAPNIRAYATSPTSITVT
W ETPLSGNGEIQ NY
KLYYMEKGTDKEQDIDVSSHSY---TINGLKKYTEYSFRVVAYNKHGPGVSTQDVAVRTLSDV
RATNEO .. .508.
PGPAPNIRAYATSPTSITVT W ETPLSGNGEIQ NY
KLYYMEKGTDKEQDVDVSSHSY---TINGLKKYTEYSFRVVAYNKHGPGVSTQDVAVRTLSDV
CHICKNEO ..525.
PGPAPNIRAYAGSPTSVTVT W ETPLSGNGEIQ NY
KLYYMEKGQDSEQDVDVAGLSY---TITGLKKYTEYSFRVVAYNKHGPGVSTQDVVVRTLSDV
ZFISHNEO ..519.
PGPAPNLHAVVMSPSTVSLS W DVPLIGNGEIQ NY
KIYYMEKGMDSEQDLDVNTLSY---TMTGLKKFTEYSFRLVAYNKHGPGVSTEDISVRTYSDV
C.EL.NEO ..536.
PGKVASLTTTATGPETIDIR W S-PPSGGQPAL RY
KIFYSHDPLEKNEKETLITTSTTHYTLHGMDKYTGYQIRIEAEGSNGSGLSSDTVKVRTQSDE
hNEOGEN... 633 PSAAPQNLSLEVRNSKSIMIH
W QPPAPATQNGQIT GY
KIRYRKASRKSDVTETLVSGTQLSQLIEGLDRGTEYNFRVAALTINGTGPATDWLSAETFESDLDETRV
mNEOGEN... 664 PSAAPQNLSLEVRNSKSIVIH W QPPSSTTQNGQIT GY
KIRYRKASRKSDVTETLVTGTQLSQLIEGLDRGTEYNFRVAALTVNGTGPATDWLSAETFESDLDETRV
RATNEO.. . 602 PSAAPQNLSLEVRNSKSIVIH W QPPSSATQNGQIT GY
KIRYRKASRKSDVTETVVTGTQLSQLIEGLDRGTEYNFRVAALTVNGTGPATDWLSAETFESDLDESRV
CHICKNEO.. 619 PSAAPQNLTLEARNSKSIMLH W QPPPAGTHSGQIT GY
KIRYRKVSRKSDVTESV-GGTQLFQLIEGLERGTEYNFRIAAMTVNGTGPATDWVSAETFESDLDESRV
ZFISHNEO.. 613 PSSPPQNMTVEVLNSKSLMIR W QPPPADAQNGEIT GY
KIRYRKGTRKSEVAE-ITSGSQLYQLIDGLQRGTEYMLRVSAMTVNGTGPATDWTTAETFESDLDESRV
C.EL.NEO.. 632 PSAPPVNIQAEADSSTSVRVS
W DEPEEESVNGEIT GY
RLKYKTKARGAKGNTLVIDATAREYTMGNLEPNTQYLIRMAVVNHNGTGPFSDWVSIDTPGQDKEERTL
hNEOGEN ...739 ..PEVPSSLHVRPLVTSIVVS
W TPPENQNIVVR GY
AIGYGIGSPHAQTIKVDYKQRYYTIENLDPSSHYVITLKAFNNVGEGIPLYESAVTRPHTDTS--EVDL
mNEOGEN ...770..
PEVPSSLHVRPLVTSIVVS W TPPENQNIVVR GY
AIGYGIGSPHAQTIKVDYKQRYYTIENLDPSSHYVITLKAFNNVGEGIPLYESAVTRPHTDTS--EVDL
RATNEO .. .708..
PEVPSSLHVRPLVTSIVVS W TPPENQNIVVR GY
AIGYGIGSPHAQTIKVDYKQRYYTIENLDPSSHYVITLKAFNNVGEGIPLYESAVTRPHTDTS--EVDL
CHICKNEO ..724..
PEVPSSLHVRPLVTSIVVS W TPPENQNIVVR GY
AIGYGIGSPHAQTIKVDYKQRYYTIENLDPSSHYVITLKAFNNVGEGIPLYESAVTRPHSDTS--EVDL
ZFISHNEO ..718..
PDVPSSLHVRSLVNSIVVS W TPPENQDIVVR GY
SISYGIGSPHAQTIKVDYKQRYYTIENLNPNSHYVITLKAFNNVGEGIPVYESAITRPQSDPIDPDVDL
C.EL.NEO ..738 ...GAPREIRPHAGIDYILVS
W LPPADEQNLVR GY
QIGWGLSVPDTETIRVTASTTQYKIARLHSERDYVISLRAFNNLGSGFPIYETVRTLSRETPSH
hNEOGEN ...853 PMMPPVGVQASILSHDTIRIT
W ADNSLPKHQKITDSR YY
TVRWKTNIPANTKYKNANATTLSYLVTGLKPNTLYEFSV-MVTKGRRSSTWSMTAHGTTFELVPTS
mNEOGEN ...884 PMMPPVGVQASILSHDTIRIT W ADNSLPKHQKITDSR YY
TVRWKTNIPANTKYKNANATTLSYLVTGLKPNTLYEFSV-MVTKGRRSSTWSMTAHGATFELVPTS
RATNEO .. .862 PMMPPVGVQASILSHDTIRIT W ADNSLPKHQKITDSR YY
TVRWKTNIPANTKYKNANATTLSYLVTGLKPNTLYEFSV-MVTKGRRSSTWSMTAHGATFELVPTS
CHICKNEO ..838 PMMPPVGVQASILSHDTIRIT W ADNSLPKNQKITDAR YY
TVRWKTNIPANTKYKTANATTLSYLVTGLKPNTLYEFSV-MVTKGRRSSTWSMTAHGTTFELVPTS
ZFISHNEO ..835 PMLPPVGVQASVLNQDTIKVT W ADNSLPKNQKITDAR YY
TVRWKTNIPANTKFKVANTTTLFHTVTGLKPNTLYEFSV-MVTKGRRTSTWSMTAHGTTFESIPSS
C.EL.NEO ..852 ....PVGVRAEAISATSIRVM
W TESDETAFNT----- QY
TVRYSTAVDGNQHRYV-NSTETWATVEGLRPATEYEFAVRAVASNGQLSTWSMATRNRTLAAPPSSA
hNEOGEN ...957..PPKDVTVVSKEGKPKTIIVN
W QPPSEANGKIT GY
IIYYSTDVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYFKIQARNSKGMGPMSEAVQFRTP
mNEOGEN ...988..PPKDVTVVSKEGKPRTIIVN
W QPPSEANGKIT GY
IIYYSTDVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYFKIQARNSKGMGPMSEAVQFRTP
RATNEO .. .926..PPKDVTVVSKEGKPRTIIVN
W QPPSEANGKIT GY
IIYYSTDVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYFKIQARNSKGMGPMSEAVQFRT
CHICKNEO ..942..PPKDVTVVSKEGKPRTIIVN
W QPPSEANGKIT GY
IIYYSTDVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYFKIQARNSKGMGPMSEAVQFRTP
ZFISHNEO ..939..PPKDVTVVSKEGRPKTIIVN
W QPPSEANGKIT GY
IIYYSTDVKAEVHDWVIEPVVGNRLTHQIQELTLDTTYYFKIQARNSKGMGPMS
C.EL.NEO ..948..PRDLTVLPAESGDPHSSSLH
W QPPKYSNGEIE EY
LVFYTDRASLADKDWTINYVAGDKLSHQVSNLLPKANYFFKIQARNEKGHGPFS
Human & Mouse Myomesin 1 & 2
HMYOM1 ....374.
PAAPLDVKCLEANKDYIIIS W KQPAVDGGSPIL GY
FIDKCEVGTDS W SQCNDTPVKFARFPVTGLIEGRSYIFRVRAVNKMGIGFPS
MMYOM1 ....489 .PAAPLDVVSLDANKDYIIIS
W KQPAVDGGSPIL GY
FIDKCEVGTDT W SQCNDTPVKFARFPVTGLIEGRSYIFRVRAVNKTGIGLPS
HMYOM2 ....383 .PGAPMDLQCHDANRDYVIVT
W KPPNTTTESPVM GY
FVDRCEVGTNN W VQCNDAPVKICKYPVTGLFEGRSYIFRVRAVNSAGISRPS
MMYOM2 ....383 .PGAPMDLQCHDANRDYVIVT
W KPPNTTTESPVI GY
FIDKCEVGTNN W VQCNDAPVKICKYPVTGLFEGRSYVFRVRAVNNAGISRPS
HMYOM1 ....501 .PGPPTDLSVTEATRSYVVLS
W KPPGQRG-HEGI MY
FVEKCEAGTEN W QRVNTELPVKSPRFALFDLAEGKSYCFRVRCSNSAGVGEPS
MMYOM1 ....617 .PGPPTDLSVTEATRSYVVLS
W KPPGQRG-HEGI MY
FVEKCDVGAEN W QRVNTELPVKSPRFALFDLVEGKSYRFRVRCSNSAGVGEPS
HMYOM2 ....511 .PGPPTGVHASEISRNYVVLS
W EPPTPRG-KDPL MY
FIEKSVVGSGT W QRVNAQTAVRSPRYAVFDLMEGKSYVFRVLSANRHGLSEPSEIT
MMYOM2 ....511 .PGPPTNVQASEVSRNYVVLS
W DPPSPRG-KDPL MY
FIEKSAVGSGS W QRVNAQTAVRSPRYAVFDLAEGKSYVFRVLSANKHGLSDPSEIT
HMYOM1 ....602 .PKAPGKIIPSRNTDTSVVVS
W EESKDA--KELV GY
YIEANVAGSGK W EPCNNNPVKTHRFTCHGLVTGQSYIFRVRAVNAAGLSEYS
MMYOM1 ....718 .PKAPGKIIPSRNTDTSVVVS
W EESRDA--KELV GY
YIEASVVGSGK W EPCNNNPVKGSRFTCHGLTTAQSYIFRVRAVNAAGLSEYS
HMYOM2 ....612 .PSAPGRVLASRNTKTSVVVQ
W DRPKHE--EDLL GY
YVDCCVAGTNL W EPCNHKPIGYNRFVVHGLTTGEQYIFRVKAVNAVGMSENSQ
MMYOM2 ....612 .PSAPGRVLASRNTKTSVVVQ
W DRPKHE--EDLL GY
YVDCCVAGTNM W EPCNHKPIGYNRFVVHGLTTGEQYIFRVKAVNAVGTSENSQ
HMYOM1 ....701 .PSPPCDITCLESFRDSMVLG
W KQPDKTGGAEIT GY
YVNYREVIDGVPGK W REANVKAVREEAYKISNLKENMVYQFQVAAMNMAGLGAPS
MMYOM1 ....915 .PSAPYDITCLESFRDSMVLG
W KQPDTTGGAEIT GY
YVNYREVVGEVPGK W REANIKAVSDAAYKISNLKENTLYQFQVSAMNIAGLGAPS
RATSIM1.....01 .PSPPYDITCLESFRDSMVLG
W KQPDKTGGAEIT GY
YVNYREVVGGVPGK W REANIKAVSDAAYKISELKENTVYQFQVSAMNIAGLGA
HMYOM2 ....711 .PSHPYGITLLNCDGHSMTLG
W KVPKFSGGSPIL GY
YLDKREV---HHKN W HEVNSSPSKPTILTVDGLTEGSL
MMYOM2 ....711 .PSHPYGITLLNCDGHSMTLG
W KVPKFSGGSAII GY
YLDKREV---HHKN W HEVNSSPVKERILTVEGLTEGSLYEFKIAATNLAGIGQPSD
HMYOM1 ....806 .PGPPHSLKCSEVRKDSLVLQ
W KPPVHSGRTPVT GY
FVDLKEK-AKEDQ W RGLNEAAIKNVYLKVRGLKEGVSYVFRVRAINQAGVGKPSDLAGPVV
MMYOM1 ...1020 .PGPPHSVKLSEVRKNSLVLQ
W KPPVYSGRTPVT GY
FVDLKEASAKDDQ W RGLNEAAIVNKYLRVQGLKEGTCYVFRVRAVNQAGVGKPS
RATSIM ....206 .PGPPHSLKLSEVRKNSLVLQ
W KPPVYSGRTPVT GY
FVDLKEASAKDDQ W RGLNEAAIPNKYLRVQGLKEGISYVFRVRAINQAGVGKPSDLAGPV
HMYOM2 ....813 .PGPAYDLTFCEVRDTSLVML
W KAPVYSGSSPVS GY
FVDFREEDA--GE W ITVDQTTTASRYLKVSDLQQGKTYVFRVRAVNANGVGKPSD
MMYOM2 ....813 .PGPAYDLTFCEVRDTSLVIL
W KAPVYSGSSPVS GY
FVDFKEEDS--GE W KTTSEAATPNRYLKVCDLQQGKTYVFRIRAVNASGPGKPSD
Integrin beta-4
HUMAN ....1127.
LGAPQNPNAKAAGSRKIHFN W LPPS---GKPM GY
RVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGEGPYSSLVSCR
RAT ......1129.
LGAPQNPNAKAAGSRKIHFN W LPPP---GKPM GY
RVKYWVQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAHGEGPYS
MOUSE ....1125.
LGAPQNPNAKAAGSRKIHFN W LPPP---GKPM GY
RVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGEGPYSSLVS CRTHQEV
HUMAN ....1220 .PSEPGRLAFNVVSSTVTQLS
W AEPAETNGEIT AY
EVCYGLVNDDNRPIGPMKKVLVDNPKNRMLLIENLRESQPYRYTVKARNGAGWGPEREAI
RAT ......1222 .PSEPGRLAFNVVSSTVTQLS
W AEPAETNGEIT AY
EVCYGLVNEDNRPIGPMKKVLVDNPKNRMLLIENLRESQPYRYTVKARNGAGWGP
MOUSE ....1218 .PSEPGRLAFNVVSSTVTQLS
W AEPAETNGEIT AY
EVCYGLVNEDNPRTHWTYEEGARGQPQEPDAAH
HUMAN ....1528 .PDTPTRLVFSALGPTSLRVS
W QEPRC-ERPLQ GY
SVEYQLLNGGELHRLNIPNPAQTSVVVEDLLPNHSYVFRVRAQSQEGWGREREGVI
RAT ......1512.
PDTPTRLVFSALGPTSLKVS W QEPQC-DRALL GY
SVEYQLLNGGEMHRLNIPNPGQTSVVVEDLLPNHSYVFRVRAQSQEGWGRE.
HUMAN ....1641.
PSAPGPLVFTALSPDSLQLS W ERPRRPNGDIV GY
LVTCEMAQGGGPATAFRVDGDSPESRLTVPGLSENVPYKFKVQARTTEGFGPEREGII
RAT ......1625.
PSAPGPLVFTALSPDSLQLS W ERPRRPNGDIL GY
LVTCEMAQGGGPARTFRVDGDNPESRLTVPGLSENVPYKFKVQARTTEGFGPEREGIITIES
Cytokine receptor common gamma chain precursor (Gamma-C)
humang2 ...151 LVIPWAPENLTLHKLSESQLELN W NNRFL-NHCLEHLV QY
RTDWDHSWTEQSVDYRHKFSLPSVDGQKRYTFRVRSRFNPLCGSAQHWSEWSHPIHWGSN
bovineg2 ..158 LVIPWAPENLTLRNLSEFQLELS W SNRYL-DHCLEHLV QY
RSDRDRSWTEQSVDHRHSFSLPSVDAQKLYTFRVRSRYNPLCGSAQHWSDWSYPIHWGSN
dogg2 .....151 LVIPWAPENLTLHNLSESQLELS W SNRHL-DHCLEHVV QY
RSDWDRSWTEQSVDHRNSFSLPSVDGQKFYTFRVRSRYNPLCGSAQRWSEWSHPIHWGSN
mouseg2 ...151 LVIPRAPENLTLSNLSESQLELR W KSRHIKERCLQYLV QY
RSNRDRSWTELIVNHEPRFSLPSVDELKRYTFRVRSRYNPICGSSQQWSKWSQPVHWGSH
Interleukin-6 receptor beta chains
HUMAN6Rb ..223 NPPHNLSVINSEELSSILKLT
W TNPSIKSVIILKYNI QY
RTKDASTWSQIPPEDTASTRSSFTVQDLKPFTEYVFRIRCMKEDGKGYWSDWSEEASGITYED
MOUSE6Rb ..221 TPPYNLSVTNSEELSSILKLS W VSSGLGGLLDLKSDI QY
RTKDASTWIQVPLEDTMSPRTSFTVQDLKPFTEYVFRIRSIKDSGKGYWSDWSEEASGTTYED
RAT6Rb ....222 SPPHNLSVTNSEELSSILKLA W VNSGLDSILRLKSDI QY
RTKDASTWIQVPLEDTVSPRTSFTVQDLKPFTEYVFRIRSIKENGKGYWSDWSEEASGTTYED
HUMAN6Rb ..325 RPSKAPSFWYKIDPSHTQGYRTVQLV
W KTLPPFEANGKIL DY
EVTLTRWKSHLQNYTVNATKLTVNLTNDRYLATLTVRNLVGKSDAAVLTIPAC
MOUSE6Rb ..323 RPSRPPSFWYKTNPSHGQEYRSVRLI W KALPLSEANGKIL DY
EVILTQSKSVSQTYTVTGTELTVNLTNDRYVASLAARNKVGKSAAAVLTIPSPHVT
RAT6Rb ....324 RPSKAPSFWYKVNANHPQEYRSARLI W KTLPLSEANGKIL DY
EVVLTQSKSVSQTYTVNGTELIVNLTNNRYVASLAARNVVGKSPATVLTIPGSHFKA
HUMAN6Rb ..420 DFQATHPVMDLKAFPKDNML
W VEWTTPRESVK KY
ILEWCVLSDKAPCITDWQQEDGTVHRTYLRGNLAESKCYLITVTPVYADGPGSPESIKAYLKQA
MOUSE6Rb ..421 ...AAYSVVNLKAFPKDNLL
W VEWTPPPKPVS KY
ILEWCVLSENAPCVEDWQQEDATVNRTHLRGRLLESKCYQITVTPVFATGPGGSESLKAYLKQA
RAT6Rb .. .423 .SHPVVDLKAFPKDNLLWVE
W TPPS---KPVN KY
ILEWCVLSENSPCIPDWQQEDGTVNRTHLRGSLLESKCYLITVTPVFPGGPGSPESMKAYLKQA
HUMAN6Rb ..518 PPSKGPTVRTKKVGKNEAVLE
W DQLPVDVQNGFIR NY
TIFYRTIIGNETAVNVDSSHTEYTLSSLTSDTLYMVRMAAYTDEGGKDGPEFTFTTPKF
MOUSE6Rb ..516 APARGPTVRTKKVGKNEAVLA W DQIPVDDQNGFIR NY
SISYRTSVGKEMVVHVDSSHTEYTLSSLSSDTLYMVRMAAYTDEGGKDGPEFTFTTPKF
RAT6Rb ....517 APSKGPTVRTKKVGKNEAVLE W DHLPVDVQNGFIR NY
SISYRTSVGKEMVVRVDSSHTEYTLSSLSSDTLYMVHMAAYTEEGGKDGPEFTFTTLKF
Interleukin-7 receptor alpha
HUMAN .....128 KPEAPFDLSVIYREGANDFVVTFNTSHLQKKYVKVLMHDVAYRQEKDENKWTHVNLSSTKLTLLQRKLQPAAMYEIKVRSIPDHYFKGFWSEWSPSY
MOUSE .....128 KAEAPSDLKVVYRKEANDFLVTFNAPHLKKKYLKKVKHDVAYRPARGESNWTHVSLFHTRTTIPQRKLRPKAMYEIKVRSIPHNDYFKGFWSEWSPSS
INTERLEUKIN-9 RECEPTOR PRECURSOR
HUMAN9R ...150 PSDLQSNISSGHCILT
W SISPALEPMTTLL SY
ELAFKKQEEAWEQAQHRDHIVGVTWLILEAFELDPGFIHEARLRVQMATLEDDVVEEERYTGQ
MOUSE9R ...149 PSDLQSNVSSGRCVLT W
GINLALEPLITSL SY ELAFKRQEEAWEARHKDRIVGVTWLILEAVELNPGSIYEARLRVQMTLESYEDKTEGEYYKS
Interleukin-12 RECEPTOR beta-1 chains
HUMANb1 ....43 .SASGPRDLRCYRISSDRYECS
W QYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQAGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSV
HUMANb1 ...138 .KYEPPLGDIKVSKLAGQLRME
W ETPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQGSSWSKWSSPVCVPPENP
HUMANb1 ...237 .PQPQVRFSVEQLGQDGRRRLTLKEQPTQLELPEGCQGLAPGTEVTYRLQLHMLSCPCKAKATRTLHLGKMPYLSGAAYNVAVISSNQFGPG
LNQTWHIPAD
HUMANb1 ...338 .THTEPVALNISVGTNGTTMY
W PARAQSMTYCIEWQPVGQDGGLATCSLTAPQDPDPAGMATYSWSRESGAMGQEKCYYITIFASAHPEKLTLWSTVLS
humanb1 ...446 .AGTPHHVSVKNHSLDSVSVD
W APSLLSTCPGVLK EY
VVRCRDEDSKQVSEHPVQPTETQVTLSGLRAGVAYTVQVRADTAWLRGVWSQPQRFSIE
mouseb1 ...467 .AGTPRHVSVRNQTGDSVSVE
W TASQLSTCPGVLT QY
VVRCEAEDGAWESEWLVPPTKTQVTLDGLRSRVMYKVQVRADTARLPGAWSHPQRFSF
ratb1 ....467 .AGTPRHVSVRNQTGDSVSVE
W TASQLSTCPGVLT QY
VVRCEAEDGAWESEWLVPPTKTQVTLDGLRSRVMYKVQVRADTARLPGAWSHPQRFSF ..
Interleukin-12 RECEPTOR beta-2 chains
humanb2 ...226 .PPWDIRIKFQKASVSRCTLY
W RDEGLVLLNRL RY
RPSNSRLWNMVNVTKAKGRHDLLDLKPFTEYEFQISSKLHL.YKGSWSDWSESLRAQTPEEE
cowb2 .....226 .PPWDIRIKFVNASVDRCTLL
W RDEGLVLLNRL RY
RPINSRSWNMVNVTNAKGRHDLLDLKPFTEYEFQISSKLHL.
humanb2 ...421 .LLAPRQVSANSEGMDNILVT
W QPPRKDPSAVQ EY
VVEWRELHPGGDTQVPLNWLRSRPYNVSALISENIKSYICYEIRVYALSGDQGGCSSILGNSKHKA
mouseb2 ...436 .LLAPHQVSAKSENMDNILVT
W QPPKKADSAVR EY
IVEWRALQPGSITKFPPHWLRIPPDNMSALISENIKPYICYEIRVHALSESQGGCS
COWb2 .....421 .LLAPQQVLAKSEGMDKLMVT
W TPPEKATAAVQ EY
VVEWRELHPGAGMQPPLGWLWSPPYRLSALISENIKPYICYEIRVHALAGDQGGCNSTRGNSQHKA
ratb2 ....613 .FLAPHQVSAKSGNMDNIMVT
W QPPGKADSVIR EY
IVEWRALQQGSVMNLTPHWLRIPPYNMSALISEKIKPYICYEIRVHALSEDQGGCSSIRGDSSHKV
humanb2 ...521 .PLSGPHINAITEEKGSILIS
W NSIPVQEQMGCLL HY
RIYWKERDSNSQPQLCEIPYRVSQNSHPINSLQPRVTYVLWMTALTAAGESSHGNEREFCLQGK
mouseb2 ...536 .PVSGPHITAITEKKERLFIS
W THIPFPEQRGCIL HY
RIYWKERDSTAQPELCEIQYRRSQNSHPISSLQPRVTYVLWMTAVTAAGESPQGNEREF
cowb2 .....521 .PLSGPHINAISEEKGSVLIS
W DEIPAREQMGCIL HY
RIYWKERDSNSQPQLCEIPYRISPNSHPIDSLQPRVTYVLWMTA
ratb2 ....713 .PLSGPHITAITEKKESLFIS
W THIPFLEQRGCIL HY
RIYWKERHSTAQPELCEIQYRHSQNSHPISSLQPKVTYVLWMTAVTAAGESP
Interleukin-12 beta chain
HUMAN .....236 .DPPKNLQLKPLKNSRQVEVS
W EYPDTWSTPHS YF
SLTFCVQVQGKSKREKKDRVFTDKTSATVICRKNASISVRAQDRYYSSSWSEWASVPCS
MONKEY ....236 .DPPKNLQLKPLKNSRQVEVS
W EYPDTWSTPHS YF
SLTFCIQVQGKSKREKKDRIFTDKTSATVICRKNASFSVQAQDRYYSSSWSEWASVPCS
MANGABEY ..236 .DPPKNLQLKPLKNSRQVEVS
W EYPDTWSTPHS YF
SLTFCIQVQGKSKREKKDRIFTDKTSATVICRKNASFSVQAQDRYYSSSWNEWTSVPCS
MOUSE .....233 PDPPKNLQMKPLKNS-QVEVS W EYPDSWSTPHS YF SLKFFVRIQRKKEKMKETEEGCNQKGAFLVEKTSTEVQCKGGNVCVQAQDRYYNSSCSK
PIG .......231 PDPPKNLQLNPLKNSRHVEIS W EYPDTWSTPHS YF SLMFGVQVQGKNKREKKDKLFTDQISAKVTCHKDANIRVQARDRYYSSSWSEWASVSCN
COW .......236 PDPPKNLQLRPLKNSRQVEVS W EYPDTWSTPHS YF SLTFCVQVQGKNKREKKLFMDQTSAKVTCHKDANVRVQARDRYYSSFWSEWASVSCS
DOG .......237 .DPPTNLQLKPLKNSRHVEVS
W EYPDTWSTPHS YF
SLTFCVQAQGKNNREKKDRLCVDKTSAKVVCHKDAKIRVQARDRYYSSSWSDWASVSCS
CAT .......236 .DPPKNLQLKPLKNSRHVEVS
W EYPDTWSTPHS YF
SLTFGVQVQGKNNREKKDRLSVDKTSAKVVCHKDAKIRVQARDRYYSSSWSNWASVSCS
WOODCHUCK .236 .DPPKNLKMKPSKTPQQVEVT
W EYPDSWSTPHS YF
SLTFSVQVQGKKKKRSNTLHVDKTSVTVTCQKGAKVSVQARDRYYNSSWSEWATMSCP
RED DEER ..237 .DPPKNLQLRPLKNSRQVEVS
W EYPDTWSTPHS YF
SLTFCVQVQGKNKREKKLFMDQTSAKVTCHKDASIRVQARDRYYNSFWSEWASVSCS
Interleukin-21 receptor
HUMAN21R ..120 ...PAPPFNVTVTFSGQYNIS
W RSDYEDPAFYMLKGKL QY
ELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVRAGPMPGSS
MOUSE21R ..120 ...PAPPLNVTVAFSGRYDIS
W DSAYDEPSNYVLRGKL QY
ELQYRNLRDPYAVRPVTKLISVDSRNVSLLPEEFHKDSSYQLQVRAAPQPGTS
Human sidekick homologs 1 & 2 NP_689957.2/GI:32880201
& NP_061937.2/GI:21735577
hsidekck1 .670 .SPQNLLVSPNSSHSHAVVLS
W VRPFDGNSPIL YY
IVELSENSSPWKVHLSNVGPEMTGVTVSGLTPARTYQFRVCAVNEVGRGQYS
hsidekck2 .216 .APEHPVATLSTVERRAINLT
W TKPFDGNSPLI RY
ILEMSENNAPWTVLLASVDPKATSVTVKGLVPARSYQFRLCAVNDVGKGQFS
hsidekck1 .769 .SAPPKNIVASGRTNQSIMVQ
W QPPPETEHNGVLR GY
ILRYRLAGLPGEYQQRNITSPEVNYCLVTDLIIWTQYEIQVAAYNGAGLGVFS
hsidekck2 .315 .TAPPQNVIASGRTNQSIMIQ
W QPPPESHQNGILK GY
IIRYCLAGLPVGYQFKNITDADVNNLLLEDLIIWTNYEIEVAAYNSAGLGVYS
hsidekck1 .870 .TAPPQNVQTEAVNSTTIQFL
W NPPPQQFINGINQ GY
KLLAWPADAPEAVTVVTIAPDFHGVHHGHITNLKKFTAYFTSVLCFTTPGDGPPS
hsidekck2 .416 .TVPPGNVHAEATNSTTIRFT
W NAPSPQFINGINQ GY
KLIAWEPEQEEEVTMVTARPNFQDSIHVGFVSGLKKFTEYFTSVLCFTTPGDGPRS
hsidekck1 .972 .PGAVGHLSFTEILDTSLKVS
W QEPLEKNGIIT GY
QISWEVYGRNDSRLTHTLNSTTHEYKIQGLSSLTTYTIDVAAVTAVGTGLVTSST
hsidekck2 .519 .PGPVGHLSFSEILDTSLKVS
W QEPGEKNGILT GY
RISWEEYNRTNTRVTHYLPNVTLEYRVTGLTALTTYTIEVAAMTSKGQGQVS
hsidekck1 1074 ..SNLVISNISPRSATLQFRP
GY DGKTSISR W
IVEGQVGAIGDEEEWVTLYEEENEPDAQMLEIPNLTPYTHYRFRMKQVNIVGPSPYS
hsidekck2 .620 .PTNLGISNIGPRSVTLQFRP
GY DGKTSISR W
LVEAQVGVVGEGEEWLLIHQLSNEPDARSMEVPDLNPFTCYSFRMRQVNIVGTSPPS
hsidekck1 1174 .DVAPTSVTVRTASETSLRLR
W VPLPDSQYNGNPESV GY
RIKYWRSDLQSSAVAQVVSDRLEREFTIEELEEWMEYELQMQAFNAVGAGP
hsidekck2 .721 .DMAPANVSLRTASETSLWLR
W MPLPEMEYNGNPESV GY
KIKYSRSDGHGKTLSHVVQDRVERDYTIEDLEEWTEYRVQVQAFNAIGSGPWS
hsidekck1 1277 .SAAPENVSAEAVSSTQILLT
W TSVPEQDQNGLIL GY
KILFRAKDLDPEPRSHIVRGNHTQSALLAGLRKFVLYELQVLAFTRIGNGVP
hsidekck2 .824 .SSGPTNVSALATTSSSMLVR
W SEVPEADRNGLVL GY
KVMYKEKDSDTQPRFWLVEGNSSRSAQLTGLGKYVLYEVQVLAFTRIGDGSPS
hsidekck1 1378 .PGPPVRLVFPEVRLTSVRIV
W QPPEEPNGIIL GY
QIAYRLASSSPHTFTTVEVGATVRQFTATDLAPESAYIFRLSAKTRQGWGEPL
hsidekck2 .925 .PGPPMGILFPEVRTTSVRLI
W QPPAAPNGIIL AY
QITHRLNTTTANTATVEVLAPSARQYTATGLKPESVYLFRITAQTRKGWGEA
hsidekck1 1479 .PPRELLVPQAEVTARSLRLQ
W VPGSDGASPIR YF
TMQVRELPRGEWQTYSSSISHEATACVVDRLRPFTSYKLRLKATNDIGDSDFS
hsidekck2 1026 .PPSRPMVQQEDVRARSVLLS W EPGSDGLSPVR YY TIQTRELPSGRWALHSASVSHNASSFIVDRLKPFTSYKFRVKATNDIGDSEF
hsidekck1 1579 .GEPPGSVSATPHTTSSVLIQ
W QPPRDESLNGLLQ GY
RIYYRELEYEAGSGTEAKTLKNPIALXAELTAQSSFKTVNSSSTSTMCELTHL
hsidekck2 1126 .DEAPTILSVTPHTTTSVLIR W QPPAEDKINGILL GF
RIRYRELLYEGLRGFTLRGINNPGATWAELTSMYSMRNLSRPSLTQYELDNLNKHRRYEIRM
hsidekck1 1702 .AMAPQNVQVTPLTASQLEVT
W DPPPPESQNGNIQ GY
KIYYWEADSQNETEKMKVLFLPEPVVRLKNLTSHTKYLVSISAFNAAGDGPKSDP
hsidekck2 1248 .TAAPRNVVVHGATATQLDVT W EPPPLDSQNGDIQ GY
KIYFWEAQRGNLTERVKTLFLAENSVKLKNLTGYTAYMVSVAAFNAAGDGPR
hsidekck1 1802 .PGAPSFLAFSEITSTTLNVS
W GEPAAANGILQ GY
RVVYEPLAPVQGVSKVVTVEVRGNWQRWLKVRDLTKGVTYFFRVQARTITYGPEL
hsidekck2 1348 .PSAPSSVKFSELTTTSVNVS W EAPQFPNGILE GY RLVYEPCSPVDGVSKIVTVDVKGNSPLWLKVKDLAEGVTYRFRIRAKTFTYGPE
hsidekck1 1902 .SPGSPRDVLVTKSASELTLQ
W TEGHSGDTPTT GY
VIEARPSDEGLWDMFVKDIPRSATSYTLSLDKLRQGVTYEFRVVAVNEAGYGEPSNP
hsidekck2 1448 .APGPPGVPIIVRYSSAIAIH W SSGDPGKGPIT RY VIEARPSDEGLWDILIKDIPKEVSSYTFSMDILKPGVSYDFRVIAVNDYGF
human retina specific protein PAL
[Homo sapiens]. NP_056428.1 GI:14149694
rat retina specific glycoprotein; putative type I transmembrane protein
NP_647547.1 GI:21326459
similar to retina specific protein PAL [Mus musculus]. XP_138990.1 GI:20873125
human .....428 .ARMVRSVKVVGDTYHSVSLV
W KAPQAKNTT AF
SVLYAVFGQHSMRRVIVQPGKTRVTITGLLPKTKYVACVCV
mouse .....423 .AQMVRSLKVVGDTYHSVSLV
W KAPQAGNTT AF
SVLYAVFGHRDMRRMTVEPGKTSVTIEGLAPKTKYVACVCVRGLVP
rat .......422 .TQMVRSLKVVGDTYHSVSLV
W KAPQAGNTT AF
SVLYAVFGQRDMRRMTVEAGKTSVTIEGLAPKTKYVACVCVRGLVPT
Neuroglian precursor. Drosophila
P20241/GI:14286138 & c.elegans NP_500276.2/GI:25148680
DROME .....614 VPNAPKLTGITCQADKAEIH
W EQQGDNRSPIL HY
TIQFNTSFTPASWDAAYEKVPNTDSSFVVQMSPWANYTFRVIAFNKIGASPPSAHSDSCTTQPDVP
C.ELEGANS .611 VPAHPVVETAHCSERKATVK
W VAASDHGDSIK KY
IVEMFTDFKKNEWEVINEEVNVNKETFEVDITLTPWVNYTFRVVAVNSHGRS
DROME .....714 FKNPDNVVGQGTEPNNLVIS
W TPMPEIEHNAPNF HY
YVSWKRDIPAAAWENNNIFDWRQNNIVIADQPTFVKYLIKVVAINDRGESNVAAEEVVGYSGEDRPL
C.ELEGANS .719 YTNPTGVKGEGTEPDNLVIS
W KPLDRYYWNAPNM QY
LVRYKLDEPIHGWTEFLVEDSLANFTIIRDQPTFRKYLIQ
DROME .....817 DAPTNFTMRQITSSTSGYMA
W TPVSEESVRGHFK GY
KIQTWTENEGEEGLREIHVKGDTHNALVTQFKPDSKNYARILAYNGRFNGPPSAVIDFDTPEGV
C.ELEGANS .832 EAPRDFHIDTQINFTTINFT
W NPVDANTVNGHFV GY
EIEYWKAENTIRKYSIKIPANSTYKVINSFHAVTNYSAHIRTRNKRLRSAPSD
DROME .....917 PSPVQGLDAYPLGSSAFMLH
W KKPLYPNGKLT GY
KIYYEEVKESYVGERREYDPHITDPRVTRMKMAGLKPNSKYRISITATTKMGEG
C.ELEGANS .921 PGKVHNLRVYSVGSTAILLQ W
DAPLQPNGRIR GY FISFQNEKNETEETYVIHRQKHYLHEKSEPDTGYKVSVWAETRAGEGPVT
DROME ....1025 PSFSWEQLPSDNGLAKFRIN
W LPSTE--GHPGTH FF
TMHRIKGETQWIRENEEKNSDYQEVGGLDPETAYEFRVVSVDGHFNTESATQEIDTNTVEGP
C.ELEGANS 1016 .PDAPIFRVKNISLDSFVVE
W QPNNHSVWKMPGA AF
FVNYTAESSKTWFQSEIIYLPYTEITIRNLKEDQKYFMQGIAKDGPRRSE
human tripartite motif protein 9
isoform 2; homolog of rat RING finger Spring NP_443210.1 GI:16519559
rat tripartite motif protein 9; NP_569104.1 GI:18426848
mouse tripartite motif protein 9 [Mus musculus]. XP_127056.4 GI:28521544
similar to tripartite motif protein 36; human homolog of rat zinc-binding
Rbcc728 XP_225947.1 GI:27681771
c.elegans Tripartite motif protein 9 (83.7 kD) NP_503395.1 GI:17558548
similar to tripartite motif protein 9, isoform 1; human homolog of rat RING
finger XP_226563.1 GI:27659604
HUMANSIMTRIP36 309 ...PEINEEQSKVYNN-ALID
W HHPEK--DKAD SY
VLEYRKINRDEEMISWNEIEVHGTSKVVSNLESNSPYAFRVRAYRGSICSPCS
HUMANSIMTRP9 ...93 .PVPLLQLEKCCTRNNSATLA
W RTPPFTHSPAD GY
ILELDDGDGGQFREVYVGKETLCTIDGLHFNSTYNARVK
HUMAN TRIP9 ...441 PATPILQLEECCTHNNSATLS W KQPPLSTVPAD GY ILELDDGNGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYS
MOUSETRIP9 ....441 PATPILQLEECCTHNNSATLS W KQPPLSTVAAD GY ILELDDGSGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYS
RATTRIP9 ......441 PATPILQLEECCTHNNSATLS W KQPPLSTVAAD GY ILELDDGSGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYS
C.ELTRIP9. ....499
PSAPIIETSECSAENNSVTVV W RPRNDG-SAVD GF
ALEIDTGRDDGNFKEVYSGPDTICTIDGLHFNTVYAARVKSYNS
Cellulomonas fimi
Exoglucanase A & B precursors (Exocellobiohydrolase
A & B)
(1,4-beta-cellobiohydrolase A) (CBP95) P50401/GI:1708083 (1,4-beta-cellobiohydrolase
B) (CBP120) P50899/GI:1708084
EXOGLUA 478 DTTAPTVPTGLTAGTTTATSVPLS W
TASTD---NVAVT GY DVYRGTTLVGTTAATSYTVTGLTPATAYSFTVRAKDAAGNVSAASAAAAATTQSGTVT
EXOGLUB 700 DTTAPTVPTGLQAGVVTSTEATIS W TASTD---DTRVT
GY DVYRGATKVGTATTTSFTDTGLTASTAYAYTVRAFDAAGNVSAPSAALTVTTKATPS
EXOGLUA 573 DTTAPSVPAGLTAGTTTTTTVPLS W
TASTDNAGGSGVA GY EVLRGTTVVGTTTATSYTVTGLTAGTTYSFSVRAKDVAGNTSAASAAVSATTQTGTVV
EXOGLUB 794 DTTAPSVPAITSS-SSTANSVTIG W SASTDNAGGSGLA
GY DVYRGATRVAQTTALTFTDTGLTASTAYEYTVRARDVAGNVSAPSTAVSVTTKSDTTP
EXOGLUA 671 DTTAPSVPTGLTAGTTTTSSVPLT W
TASTDNAGGSGVA GY EVFNGTTRVATVTSTSYTVTGLAADTAYSFTVKAKDVAGNVSAASAAVSARTQAATSGGCT
EXOGLUB 891 DTTAPSVPAGLAAMTVTETSVALT W NASTD-TGGSGLK
GY DVYRGATRVGSTTTASYTDTGLTAATAYQYTVRATDNAGNVSAASAALS
Streptomyces coelicolor putative secreted beta-mannosidase NP_733506.1
GI:32141115
Cellulomonas fimi Endoglucanase
B precursor (Endo-1,4-beta-glucanase B) (Cellulase) P26225 GI:121813
Cellulomonas fimi Endoglucanase
D precursor (Endo-1,4-beta-glucanase) (Cellulase). P50400 GI:1708080
Thermobifida fusca Endoglucanase
E-4 precursor (Endo-1,4-beta-glucanase E-4) (Cellulase E4). P26221
GI:2506384
S.COELbMANNASE 364 .PTAPGAPSVTAVTDSSVTLG
W VAATDD--TAVS GY
DVVLVGDGTSSTVASSTTTTATVTGLTAATTYTFAVHARDAAGNRSARSATVEVT
C.FIMI ENDOB ..650 PPTTPGTPVATGVTTVGASLS W AASTD-AGSGVA GY ELYRVQGTTQTLVGTTTAAAYILRDLTPGTAYSYVVKAKDVAGNVSAASAAVTFTT
C.FIMI ENDOB ..748 PPTTPGTPVASAVTSTGATLA W APSTGDP--AVS GY DVLRVQGTTTTVVAQTTVPTVTLSGLTPSTAYTYAVRAKNVAGDVSALSAPVTFTT
C.FIMI ENDOB ..847 .PTVPGTPVASNVATTGATLT
W TASTDSGGSGLA GY
EVLRVSGTTQTLVASPTTATVALAGLTPATAYSYVVRAKDGAGNVSAVSSPVTFTT
C.FIMI ENDOD ..454 .PTAPTGLRAGTPTASTVPLT
W SASTDTGGSGVA GY
EVYRGTTLVGTTTATSYTVTGLAADSAYTFSVRAKDGAGNTSAASAAVTARTAAGG
C.FIMI ENDOD ..550 .PSVPTGLTAGTPTATSVPLT
W TASTDTGGSGVT GY
EVYRGSTLVARPTGTSHTVTGLSAATAYTFTVRAVDAAGNVSAASAPVGVTTAPDPTT
T.FUSCAENDO-E4 675 PPSAPGSPAVRDVTSTSAVLT W
SASSDTGGSGVA GY DVFLRAGTGQEQKVGSTTRTSFTLTGLEPDTTYIAAVVARDNAGNVSQRSTVSFTT
Bacterial Chitinase
Streptomyces coelicolor putative secreted glycosyl
hydrolase NP_631079.1 GI:21225300
Streptomyces lividans Chitinase C precursor. P36909 GI:544014
Streptomyces plicatus Chitinase 63 precursor. P11220 GI:116350
Streptomyces olivaceoviridis EXOCHITINASE 1 PRECURSOR. Q05638 GI:544016
Bacillus circulans CHITINASE D PRECURSOR. P27050 GI:6226570
Bacillus circulans Chitinase A1 precursor. P20533 GI:116300
s.coelGlyase ...322
.PTTPANLAFTEPATGQIRLT W
NESSDDTAVT GY DVYANGDLLTSVAGDVTTYTDTRPAGTTVTYHVRAKDAAGNQSGDSNSVTRRADT
s.coelGlyase ...415 .PTAPSNLALTEPVPGQVKLT
W TASTDDRGVT GY
DVYADNKLRKSVAGNVTTYTDTQPASANVTYFVRAKDAAGNESGDSNSVFRGGT
b.circchitaseA1 465 .PSVPGNARSTGVTANSVTLA
W NASTDNVGVT GY
NVYNGANLATSVTGTTATISGLTAGTSYTFTIKAKDAAGNLSAASNAVTVSTTAQPGGDTQA
b.circchitaseA1 560 .PTAPTNLASTAQTTSSITLS
W TASTDNVGVT GY
DVYNGTALATTVTGTTATISGLAADTSYTFTVKAKDAAGNVSAASNAVSVKT
b.circ chitaseD .92 PPTVPAGLTSSLVTDTSVNLT
W NASTDNVGVT GY
EVYRNGTLVANTSTTTAVVTGLTAGTTYVFTVKAKDAAGNLSAASTSLSVTTSTG
s.olivexo ......169 PPAPPTGLRTGSVTATSVALS
W SPVT---GAT GY
AVYRDGVKVATASGTSATVTGLTPDTAYAFQVAAVNGAGESAKSATVTATT
s.liviChitase-C 142 .PSAPGTPTASNITDTSVKLS
W SAATDDKGVK NY
DVLRDGAKVATVTGTTYTDNGLTKGTAYSYSVKARDTADQTGPASGAVKVTTTGGG
s.plichitase-63 142 .PSAPGTPTASNITDTSVKLS
W SAATDDKGVK NY
DVLRDGATVATVTGTTYTDNGLTKGTDYSYSVKARDTGDQTGPASGSVKVTTTGGD
Amylopullulanase precursor (Alpha-amylase/pullulanase)
T.tfurigenes ...929 .APQAPSNVVVTSGNGKVDLS
W LQSD---GAT GY
NIYRSSVEGGLYEKIASNVTET-TFEDANVTNGLKYVYAISAIDELGNESGISNDAVAYP
T.ethano .......927 .APQPITDLKAVSGNGQVDLS
W SAVD---RAV SY
NIYRSTVKGGLYEKIASNVTQI-TYIDTDVTNGLKYVYSVTAVDSDGNESALSNEVEAYP
T.sacch ........930 .APQVPSNVVATSGNGKVDLS
W SQSD---GAT GY
NIYRSSVEGGLYEKIASNVTGT-TFEDTNVTNGLKYVYAISAVDELGNESEMSIDTVAYP
T.tfuricus .....928 .APQPITDLKAVSGNGKVDLS
W SVVD---KAV SY
NIYRSTVKGGLYEKIASNVTQI-TYTDTEVTNGLKYVYAVTAVDNDGNESALSNEVEAYP
T.tfurigenes ..1158
.KPTAPYLNQPGTESSRVSLT W
NPSTDNVGIY DY EIYRSDG--GTFNKIATVSNEVYNYIDTSVINGVTYNYKVVAVDLSFNRTESNVVTIKPD
T.ethano ......1159 .PPTALGLQQPGIESSRVTLN
W SLSTDNVAIY GY
EIYKSLSETGPFVKIATVADTVYNYVDTDVVNGKVYYYKVVAVDTSFNRTASNIVKATP
T.sacch .......1158 .KPTAPILNQPGVESSRVSLT
W SPSTDNVGIY NY
EIYRSDG--GTFNKIATVSNEVYNYVDTSVINGTTYSYKVVAADPSFNRTESNVVTIKPD
T.tfuricus ....1164 .TPTAPVLQQPGIESSRVTLN
W SPSADDVAIF GY
EIYKSSSETGPFIKIATVSDSVYNYVDTDVVNGNVYYYKVVAVDTSYNRTASNTVKATP .V
Chitin Biosynthesis protein
S.cerevisiae ...76 THKPESPVLKIVNVTQTSCVLA
W DPLKLGSAKLKSLI LY
RKGIRSMVIPNPFKVTTTKISGLSVDTPYEFQLKLITTSGTLWSEKVILRTH
S.pombe ........77 QKLPSPPVLKLKNATQTSIVLE
W DPLQLSTARLKSLC LY
RNNVRVLNISNPMTTHNAKLSGLSLDTEYDFSLVLDTTAGTFPSKHITIKTL
Candida ........74 TNLPQPPNLKIKNVTQTSCVLE
W DKLNLGTATLKNLI LF
KDGKKLGSIPQPLNNRTSKLSGLPIDKSFKVQLRLDTTAGTFLSNEIEVTTH
Roundabout 1,2 & 3 human-1:
NP_002932.1/GI:4506569 Zebrafish-2: NP_571556.1/GI:24371280
Drosophila melanogaster-3. AAG41426.1 GI:11907990
human-1 ..561 PSAPSKPEVTDVSRNTVTLS W
QPNLNSGAT-PT SY IIEAFSHASGSSWQTVAENVKTETSAIKGLKPNAIYLFLVRAANAYGISDPSQISD
Zfish-1 ..523 PSAPSKPDVTDVSRTSVSLS W
KPNLNAGAT-PT SY VIEAFSHAAGSSWQTLADHVKTESFVLKGLKPSAVYLFLVRAANAYGLSDPS
Zfish-2 ..529 PGPPSKPQVTDVTKNSVSLS W
QPGLP-GASAIS TY VIEAFSQSVSNSWQTVADHVKTTQYTIKGLRPNTIYLFMVRAVNVQGLSDPSPMSEPV
Drome-3 ..519 PSAPGQPKILNATASALTIV W
PTSDKAGASSFL GY SVEMYCTNQSRTWIPIASRLSEPIFTVESLTQGAAYMFIVRAENSLGFSPPSPISEPITAGKLVGVR
.
human-1 ..675 NAVLHLHNPTVLSSSSIEVH W TVDQQSQYIQ GY KILYRPSGANHGESDWLVFEVRTPAKNSVVIPDLRKGVNYEIKARPFFNEFQGADSEIKFAKTLEEAPSA
Zfish-1 ..637 DVVIHLHNPTILSSSSVRVQ W
TVEQQPQYIQ GY KVMYRPSPEGAPQRVDWAMFEVGAPGEHSAVVTQLKKGITYEFKVRPFFNEFQG
Zfish-2 ..643 EVIVRLHNPVVLSPTTIQVT W
TVDRQSQFIQ GY RVLYRQMSGLSSPGAWQTQDVKVPSERSMVLSALKKGIVYEIKVRPYFNEFQGMDS
Drome-3 ..643 NDVVELLEANASDSTTARLS W
DIDS-GQYIE GF YLYARELHSSEYKMVTLLNKGQGLSSCTVPGLAKASTYEFFLVPFYKSIVGKPSNSRRMRTLEDVP
human-1 ..778 PPQGVTVSKNDGNGTAILVS W QPPPEDT-QNGMVQ EY
KVWCLGNETRYHINKTVDGSTFSVVIPFLVPGIRYSVEVAASTGAGSGVKSEPQFIQLDAHGNPVSP
Zfish-1 ..741 APRGVTVTGSGDNGTAVLVA W
QPPPEEE-QNGVVQ EY KIWCLGNESRYHINRTVDGSTHSVLIPGLVAGVTYRLEVAAGTGAGPGVKS
Zfish-2 ..647 PQQVTVLTVGNQNSTSISIS W
DPPPAEH-QNGIIQ EY KIWCLGNETRFHVNKTVDAAIRSVVVGGLQAGVQYRVEVAASTSAGVGVKSEP
Drome-3 ..741 EAPPYGMEAIQFNRTSVFLK W
LPPQPNRTRNGILT SY NVVVKGLDVHNTTRIFKNMTIDAATPTLLLANLTTGVTYYIAV
AAATRVGVGP
Rat ROBO4 AAP32918.1 GI:30575795
a.k.a. Magic Roundabout 4 (gi31077126/NP_852040.1)
256 ENVTLLNPEPVKGPKPGPAV W LSWKVSGPAAPAQ SY TALFRAQRDPRDQGSPWTEVLLDGLLNAKLGGLRWGQDYEFKVRPSSGRARGPDSNVLLLRLPEQV
358 PSAPPQEVTLRPGNGSVFVS W APPPAENHNGFIR GY QVWSLGNASLPAANWTVVGEQTQLEIAARMPGSYCVQVAAVTGAGAGEPSIPVCLLLEQAMEQSARD
c.elegans sensory
AXon guidance SAX-3, roundabout homolog NP_741748.1
GI:25147920
531 PSSPTQPIIVNVTDTEVELH W NAPSTSGAGPIT GY IIQYYSPDLGQTWFNIPDYVASTEYRIK-GLKPSHSYMFVIRAENE-KGIGTPS
652 EQLIKLEEVKTINSTAVRLF W KKRKL--EELID GY YIKWRGPPRTNDNQYVNVTSPSTENYVVSNLMPFTNYEFFVIPYHSGVHSIHGAPS
western wild mouse Midline
1 protein. P82457 GI:22653813
383 NPPTIREELCTASYDTITVH W TSDDEFSVV SY ELQYTIFTGQANVVSLCNSADSWMIVPNIKQNHYTVHGLQSGTKYIFMVKAINQAGSRSSEPG
Protein-tyrosine phosphatase Delta & Sigma precursors
Human Delta-GI:1709906; rat Delta-GI:9507011;
mouse Delta-GI:20833032; human Sigma- NP_002841.2/gi:19743919; mouse Sigma
-NP_035348.1 GI:25092609; rat Sigma- S466217/GI:1085568
hDelta 320 KALPKPPGTPVVTESTATSITLT W DSGNPEPVS
YY IIQHKPKNSEELYKEIDGVATTRYSVAGLSPYSDYEFRVVAVNNIGRGPPSEPVLTQTSEQAP
mDelta .76 ...PKPPGTPVVTESTATSITLT
W DSGNPEPVS YY
IIQHKPKNSEEPYKEIDGIATTRYSVAGLSPYSDYEFRVVAVNNIGRGPASEPVLTQTSEQA
rDelta 319 ...PKAPGTPVVTENTATSITVT W
DSGNPDPVS YY VIEYKSKSQDGPYQIKEDITTTRYSIGGLSPNSEYEIWVSAVNSIGQGPPSESVVTRTGEQAP
hSigma 317 KSLPKAPGTPMVTENTATSITIT W DSGNPDPVS
YY VIEYKSKSQDGPYQIKEDITTTRYSIGGLSPNSEYEIWVSAVNSIGQGPPSESVVTRTGEQAP
mSigma 319 ...PKAPGTPVVTENTATSITVT W
DSGNPDPVS YY VIEYKSKSQDGPYQIKEDITTTRYSIGGLSPNSEYEIWVSAVNSIGQGPPSESVVTRTGEQA
rSigma 319 ...PKAPGTPVVTENTATSITVT W
DSGNPDPVS YY VIEYKSKSQDGPYQIKEDITTTRYSIGGLSPNSEYEIWVSAVNSIGQGPPSESVVTRTGEQA
hDelta 418 ...SSAPRDVQARMLSSTTILVQ W KEPEEPNGQIQ GY RVYYTMDPTQHVNNWMKHNVADSQITTIGNLVPQKTYSVKVLAFTSIGDGPLSSDIQVITQT
mDelta 170 ..PSSAPRDVQARMLSSTTILVQ W
KEPEEPNGQIQ GY RVYYTMDPTQHVNNWMKHNVADSQITTIGNLVPQKTYSVKVLAFTSIGDGPLSSDIQVITQTGV
rDelta 414 ...ASAPRNVQARMLSATTMIVQ W
EEPVEPNGLIR GY RVYYTMEPEHPVGNWQKHNVDDSLLTTVGSLLEDETYTVRVLAFTSVGDGPLSDPIQVKTQQGV
hSigma 414 ..PARPPRNVQARMLSATTMIVQ W
EEPVEPNGLIR GY RVYYTMEPEHPVGNWQKHNVDDSLLTTVGSLLEDETYTVRVLAFTSVGDGPLSDPIQVKT
mSigma 413 ..PASAPRNVQARMLSATTMIVQ W
EEPVEPNGLIR GY RVYYTMEPEHPVGNWQKHNVDDSLLTTVGSLLEDETYTVRVLAFTSVGDGPLSDPIQVKTQQGV
rSigma 413 ..PASAPRNVQARMLSATTMIVQ W
EEPVEPNGLIR GY RVYYTMEPEHPVGNWQKHNVDDSLLTTVGSLLEDETYTVRVLAFTSVGDGPLSDPIQVKTQQGV
hDelta 514 .GVPGQPLNFKAEPESETSILLS W TPPRSDTIA NY ELVYKDGEHGEEQRITIEPGTSYRLQGLKPNSLYYFRLAARSPQGLGASTAEISARTMQ
mDelta 269 ...PGQPLNFKAEPESETSILLS W
TPPRSDTIA SY ELVYRDGDQGEEQRITIEPGTSYRLQGLKPNSLYYFRLSARSPQGLGASTAEISARTMQS
rDelta 512 ...PGQPMNLRAEAKSETSIGLS W
SAPRQESVI KY ELLFREGDRGREVGRTFDPTTAFVVEDLKPNTEYAFRLAARSPQGLGAFTAVVCQRTLQ
hSigma 511 .GVPGQPMNLRAEARSETSITLS W
SPPRQESII KY ELLFREGDHGREVGRTFDPTTSYVVEDLKPNTEYAFRLAARSPQGLGAFTPVVRQRTL
mSigma 512 ...PGQPMNLRAEAKSETSIGLS W
SAPRQESVI KY ELLFREGDRGREVGRTFDPTTAFVVEDLKPNTEYAFRLAARSPQGLGAFTAVVRQRTLQAK
rSigma 512 ...PGQPMNLRAEAKSETSIGLS W
SAPRQESVI KY ELLFREGDRGREVGRTFDPTTAFVVEDLKPNTEYAFRLAARSPQGLGAFTAVVCQRTLQ
hDelta 607 SKPSAPPQDISCTSPSSTSILVS W
QPPPVEKQNGIIT EY SIKYTAVDGEDDKPHEILGIPSDTTKYLLEQLEKWTEYRITVTAHTDVGPGPESLSVLIRTNEDV
rDelta 603 AKPSAPPQDVKCTSLRSTAILI------------------------------------------------LLEALEKWTEYRVTAVAYTEVGPGPESSPVVVRTDEDV
hSigma 604 SKPSAPPQDVKCVSVRSTAILVS W RPPPPETHNGALV
GY SVRYRPLGSEDPEPKEVNGIPPTTTQILLEALEKWTQYRITTVAHTEVGPGPESSPVVVRTDEDV
mSigma 605 ..PSAPPQDVKCTSLRSTAILVS W RPPPPETHNGALV
GY SVRYRPLGSEDPDPKEVNNIPPTTTQILLEALEKWTEYRVTAVAYTEVGPGPESSPVVVRTDEDV
rSigma 603 AKPSAPPQDVKCTSLRSTAILI------------------------------------------------LLEALEKWTEYRVTAVAYTEVGPGPESSPVVVRTDEDV
hDelta 711 PSGPPRKVEVEAVNSTSVKVS W RSPVPNKQHGQIR
GY QVHYVRMENGEPKGQPMLKDVMLADAQWEFDDTTEHDMIISGLQPETSYSLTVTAYTTKGDGARSKPKLVSTTGA
rDelta 663 PSAPPRKVEAEALNATAIRVL W RSPTPGRQHGQIR
GY QVHYVRMEGTEARGPPRIKDIMLADAQEMVITNLQPETAYSITVAAYTMKGDGARSKPKVVVTKGA
hSigma 708 PSAPPRKVEAEALNATAIRVL W RSPAPGRQHGQIR
GY QVHYVRMERREARGRRSIKDVMLADAQEMVITNLQPETAYSITVAAYTMKGDGARSKPKVVV
mSigma 707 PSAPPRKVEAEALNATAIRVL W RSPTPGRQHGQIR
GY QVHYVRMEGAEARGPPRIKDIMLADAQEMVITNLQPETAYSITVAAYTMKGDGARSKPKVVVTKGA
rSigma 664 PSAPPRKVEAEALNATAIRVL W RSPTPGRQHGQIR
GY QVHYVRMEGTEARGPPRIKDIMLADAQEMVITNLQPETAYSITVAAYTMKGDGARSKPKVVVTKGA
hDelta 823 .VPGKPRLVINHTQMNTALIQ W HPPVD-TFGPLQ GY RLKFGRKDMEPLTTLEFSEKEDHFTATDIHKGASYVFRLSARNKVGFGEEMVKEISIPE
rDelta 766 .VLGRPTLSVQQTPEGSLLAR W
EPPADAAEDPVL GY RLQFGREDAAPATLELAAWERRFAAPAHKGATYVFRLAARGRAGLGEEASAALSIPEDAPR
hSigma 811 .VLGRPTLSVQQTPEGSLLAR W
EPPAGTAEDQVL GY RLQFGREDSTPLATLEFPPSEDRYTASGVHKGATYVFRLAARSPGGLGEEAAEVLSIPE
mSigma 810 .VLGRPTLSVQQTPEGSLLAR W
EPPADAAEDPVL GY RLQFGREDAAPATLELAAWERRFAAPAHKGATYVFRLAARGRAGLGEEAAAALSIPEDAPR
rSigma 766 .VLGRPTLSVQQTPEGSLLAR W
EPPADAAEDPVL GY RLQFGREDAAPATLELAAWERRFAAPAHKGATYVFRLAARGRAGLGEEASAALSIPEDAPR
hDelta 916 EVPTGFPQNLHSEGTTSTSVQLS W
QPPVLAERNGIIT KY TLLYRDINIPLLPMEQLIVPADTTMTLTGLKPDTTYDVKVRAHTSKGPGPYSPSVQFRTLPVD
rDelta 872 ...GFPQILGPAGNVSAGSVILR W
LPPVPAEGNGAII KY TVSVREAGTPGPATETELAAAAQPGAETALTLQGLRPETAYELRVRAHTRRGPGPFSPPLRYRLARD
hSigma 906 TPRGHPQILEAAGNASAGTVLLR W LPPVPAERNGAIV
KY TVAVREAGALGPARETELPAGRLSRARRTLTLQGLKPDTAYDLQVRAHTRRGPGPFSPPVRYRT
mSigma 906 ...GFPQILGAAGNVSAGSVLLR W
LPPVPAERNGAII KY TVSVREAGAPGPATETELAAAAQPGAETALTLRGLRPETAYELRVRAHTRRGPGPFSPP
rSigma 862... GFPQILGPAGNVSAGSVILR W
LPPVPAEGNGAII KY TVSVREAGTPGPATETELAAAAQPGAETALTLQGLRPETAYELRVRAHTRRGPGPFSPPLRYRLAR
hDelta 1018 ..QVFAKNFHVKAVMKTSVLLS W EIPENYNSAM PF KILYDDGKMVEEVDGRATQKLIVNLKPEKSYSFVLTNRGNSAGGLQHRVTAKTAPDVLRTKPAFIGKTNL
rDelta .965 ..PVSPKNFKVKMIMKTSVLLS
W EFPDNYNSPT PY
KIQYNGLTLDVDGRTTKKLITHLKPHTFYNFVLTNRGSSLGGLQQTVTARTAFNMLSGKPSVAPKPDNDGSIV
mSigma 1009 ..PVSPKNFKVKMIMKTSVLLS W
EFPDNYNSPT PY KIQYNGLTLDVDGRTTKKLITHLKPHTFYNFVLTNRGSSLGGLQQTVTARTAFNMLSGK
rSigma .965..
PVSPKNFKVKMIMKTSVLLS W EFPDNYNSPT PY
KIQYNGLTLDVDGRTTKKLITHLKPHTFYNFVLTNRGSSLGGLQQTVTARTAFNMLSGKPSVAPKPDND
PTPases Mu, PCP-2 & Kappa
hMU ....482 .PGAVPTESIQGSTFEEKIFLQ
W REPTQTYGVIT LY
EITYKAVSSFDPEIDLSNQSGRVSKLGNETHFLFFGLYPGTTYSFTIRASTAKGFGPPATNQFTTKISAPSM
mMu ....482 .PGAVPTESIQGSAFEEKIFLQ
W REPTQTYGVIT LY
EITYKAVSSFDPEIDLSNQSGRVSKLGNETHFLFFGLYPGTTYSFTIRASTAKGFGPPATNQFT
hPCP2 ..481 .PSGIAAESLTFTPLEDMIFLK
W EEPQEPNGLIT QY
EISYQSIESSDPAVNVPGPRRTISKLRNETYHVFSNLHPGTTYLFSVRARTGKGFGQAALTEITTNISAPS
hKAPPA .490 .PGPVPVKSLQGTSFENKIFLN
W KEPLDPNGIIT QY
EISYSSIRSFDPAVPVAGPPQTVSNLWNSTHHVFMHLHPGTTYQFFIRASTVKGFGPATAINVTTNISAPT
mKappa .489 .PGPVPVKSLQGTSFENKIFLN
W KEPLEPNGIIT QY
EVSYSSIRSFDPAVPVAGPPQTVSNLWNSTHHVFMHLHPGTTYQFFIRASTVKGFGPATAINVTTNI
LAR-like PTPase c.elegans-gi:29427539/Q9BMN8;
Drosophila-TDFFLK/GI:538578; Anopheles mosquito frag.-S53089/GI:1079024;
rat.-S46216/GI:1071964; human-P10586/GI:125978
human 309 .PKPPIDLVVTETTATSVTLT W
DSGNSEP-VT YY GIQYRAAGTEGPFQEVDGVATTRYSIGGLSPFSEYAFRVLAVNSIGRGPPSEAVRARTGEQA
rat ..319 .PKPPIDLVVTETTATSVTLT
W DSGNTEP-VS FY
GIQYRAAGTDGPFQEVDGVASTRYSIGGLSPFSEYAFRVLAVNSIGRGPPSEAVRARTGEQA
D.M. .322 .PTAPTDVQISEVTATSVRLE
W SYKGPED-LQ YY
VIQYKPKNANQAFSEISGIITMYYVVRALSPYTEYEFYVIAVNNIGRGPPSAPATCTTGETKM
c.el .339 .PPPPVNIVVSSVTSESVVIT
W KPPKYNEAIN KY
VVNYRLKYSEGRSSRGKTMETLENSLVIDGLVAFQTYEFTVRSAGPVGVGLESLPVEAQTKPSK
human 403 PSSPPRRVQARMLSASTMLVQ W EPPEEPNGLVR
GY RVYYTPDSRRPPNAWHKHNTDAGLLTTVGSLLPGITYSLRVLAFTAVGDGPPSPTIQVKTQQGV
rat ..413 PSSPPRRVQARMLSASTMLVQ W
EPPEEPNGLVR GY RVYYTPDSRRPLSAWHKHNTDAGLLTTVGSLLPGITYSLRVLAFTAVGDGPPSPTIQVKTQQG
D.M. .417 .ESAPRNVQVRTLSSSTMVIT
W EPPETPNGQVT GY
KVYYTTNSNQPEASWNSQMVDNSELTTVSDVTPHAIYTVRVQAYTSMGAGPMSTPVQVKAQQGV
c.el .436 PATAPVSPQARSLNRDSILVK W
GPCEQPNGLIT GY KVYYTNDLVTTPIREWKQHDAKSDEFMTTINGLEPDSRYFVRVIAQNSEGDSPLSTLVTVATRQGI
human 802 .PAQPADFQAEVESDTRIQLS W LLPPQ--ERII MY ELVYWAAEDEDQQHKVTFDPTSSYTLEDLKPDTLYRFQLAARSDMGVGVFTPTIEARTAQST
rat ..511 VPAQPADFQAKAESDTRIQLS W
LLPPQ--ERII KY ELVYWAAEDEGQQHKVTFDPTSSYTLEDLKPDTLYHFQLAARSDLGVGVFTPTVEACTAQST
D.M. .515 .PSQPSNFRATDIGETAVTLQ
W TKPTHSSENIV HY
ELYWNDTYANQAHHKRISNSEAYTLDGLYPDTLYYIWLAARSQRGEGATTPPIPVRTKQYV
c.el .537 IPGQPPMLTVKALDSRRMQLT W
DKPLYSSP-VV GY TVRYNTSDGEKELTLTSPHEKHVVTGLHPDKYYYFRVAAYSDRGQGEFTEPMISKTIASIP
human 596 PSAPPQKVMCVSMGSTTVRVS W VPPPADSRNGVIT
QY SVAHEAVDGEDRGRHVVDGISREHSSWDLVGLEKWTEYRVWVRAHTDVGPGPESSPVLVRTDEDV
rat ..606 PSAPPQKVTCVSTGSTTVRVS W
VPPPADSRNGIIT QY SVAYEAVDGEDRKRHVVDGISREHSSWDLLGLEKWTEYRVWVRAHTDVGPGPESSPVLVRTDEDV
D.M. .610 PGAPPRNITAIATSSTTISLS W
LPPPVERSNGRII YY KVFFVEVGREDDEATTMTLNMTSIVLDELKRWTEYKIWVLAGTSVGDGPRSHPIILRTQEDV
c.el .630 PLSSPTIVSAAATSSKSVEIR W
KGPEQKKLNGVLT AY RINYFRLEDSKTANLESVEYDEDMDDSSSFLDRMSVVVPSDATSYVLSDLLPYSSYEITVAASTMDGYGPESSI.
human 698 PSGPPRKVEVEPLNSTAVHVY W KLPVPSKQHGQIR
GY QVTYVRLENGEPRGLPIIQDVMLAEAQWRPEESEDYETTISGLTPETTYSVTVAAYTTKGDGARSK
rat ..708 PSGPPRKVEVEPLNSTAVHVS W
KLPVPNKQHGQIR GY QVTYVRLENGEPRGQPIIQDVMLAEAQETTISGLTPETTYSITVAAYTTKGDGARSKPKVVTTTGA
D.M. .709 .PGDPQDVKATPLNSTSIHVS
W KPPLEKDRNGIIR GY
HIHAQELRDEGKGFLNEPFKFDVVDTLEFNVTGLQPDTKYSIQVAALTRKGDGDRSAAIVVKTPGGVP
c.el .750 .PSAPRNFNAELTSATSVKLT
W DAPAA--ANGALL GY
YVYLDRMVNGEPVVEKGSKKRIVMIRDSSKRYFELDSLDPNTEYSFRLNAFNRNGDGEFSERKSIITQGIP
human 810 .VPGRPTMMISTTAMNTALLQ W HPPKELP--GELL GY
RLQYCRADEARPNTIDFGKDDQHFTVTGLHKGTTYIFRLAAKNRAGLGEEFEKEIRTPEDL
rat ..811 .VPGRPTMMVSTTAMHTALLQ
W HPPKELP--GELL GY
RLQYRRADEARPNTIDFGKDDQHFTVTGLHKGATYIFRLAAKNRAGPGEEFEKEITTPEDAP
D.M. .814 .RPTVSLKIMEREPIVSIELE
W ERPAQ--TYGELR GY
RLRWGVKDQALKEEMLSGPQMTKKRFDNLERGVEYEFRVAGSNHIGIGQETVKIFQTPEGT
An ....16 .RLAVTLKILELDPTVSIELE
W ERPRQ--AYGELR GY
RVRWGVREQALNEEILQGTQLAVKRINNLERGVEYEFRVAGMNHIGIGQEAVKHLQTPEGS
c.el .858 .PEIVSVSLDRDEPPVVARIE
W KMPKMKPNETPIE KY
NLWLRAQGYPDSYVKAKTVDGTDLSTTISGLWMGVVYDVLLAAENREGRSQNATETIATPVGS
human 905 PSGFPQNLHVTGLTTSTTELA W DPPVLAERNGRII
SY TVVFRDINSQQELQNITTDTRFTLTGLKPDTTYDIKVRAWTSKGSGPLSPSIQSRTMPVE
rat ..907 .SGFPQNLRVTGLTTSTTELA
W DPPVLAERNGRIT NY
TVVYRDINSQHELQNVTGDVHLTLLGLKPDTTYDIKVRAHTSKGAGPLSPSIQSRTMPME
D.M. .909 PGGPPSNITIRFQTPDVLCVT W
DPPTREHRNGIIT RY DVQFHKKIDHGLGSERNMTLRKAVFTNLEENTEYIFRVRAYTKQGAGPFSDKLIVETERDM
An ...111 PTGPPTGIAVRFQTPDVVCIT W
EPPTREHRNGQIT RY DVQFHKKIDHGLGTERNTTVRKAVFTNLDESTEYIVRVRAYTKQGAGPFSEKVVIATERDM
c.el .957 .PDGEPIDVQYEVMKGKIVVS
W RPPSEEKRNGNIT SY
KAILSAMDATADRYEQPV PAPSTSSTFEVNVRRAYLFKVAAATMKGIGPYSPVLTINPDPAAL
human 1002 QVFAKNFRVAAAMKTSVLLS W EVPDSYKSAV
PF KILYNGQSVEVDGHSMRKLIADLQPNTEYSFVLMNRGSSAGGLQHLVSIRTAPDLLPHKPLPASAY
rat ..1003 QVFAKNFRVAAAMKTSVLLS W
EVPDSYKSAV PF KILYNGQSVEVDGHSMRKLIADLQPNTEYSFVLMNRGTSAGGLQHLVSIRTAPDLLPQKPLPASA
D.M. .1007 GRAPMSLQAEATSEQTAEIW W
EPVTSRGKLL GY KIFYTMTAVEDLDDWQTKTVGLTESADLVNLEKFAQYAVAIAARFKNGLGRLSEKVTVRIK
An ....209 GRAPFSVQAVATSEQTVEVW W
EPVPSRGKLV GY KIFYTMTAVEDLDEWQTKVVGVTESADLINLEKFAQYAVAIAAMYKTGLGKLSEKATVKVK
c.el .1056 VGPPTNVRVEATSNSTAVVQ W
DFE--SQKAD SF VVKYMHEPGNRMDTEKWKQLPVVSIDKENPKRFAVVSDLNAHKPYAFCVLAVKNNLTLNEQFNKVRVTNYMTNF
D.M. .1101 PEDVPLNLRAHDVSTHSMTLS W SPPIRLTPV NY KISFDAMKVFVDSQGFSQTQIVPKREIILKHYVKTHTINELSPFTTYNVNVSAIPSD
YSYRPPTKIT
An ....303 PEDVPLNLRAHDVSTHSMTLS W
APPIRLNPI NY KISFDAVKEFVDSQGISQKQILPRKEIILKSHVKSHTISELSPFTTYFVNVSAVPTDYSYKPPAK
c.el .1178 PTYMVQNLRVLWKTSNSVQLT W
EY-NGPRNV GF YVNHTGRKDYVNHELQEKTMSTPGFGQDVDEKHREYLWTNLRPHMMYTIHVGVRTLPPGAR
PTPase Beta Human- P23467/GI:126469
Mouse- NP_084204.1 /GI:23618914
human .23 EPERCNFTLAESKASSHSVSIQ W
RILGSPC NF SLIYSSDTLGAALCPTFRIDNTTYGCNLQDLQAGTIYNFKIISLD-EERTVVLQTDP
mouse .25 ..VKCNFTLLESRVSSLSASIQ
W RTFASPC NF SLIYSSDTSGPMWCHPIRIDNFTYGCNPKDLQAGTVYNFRIVSLDGEESTLVLQTDPL
human 111 .LPPARFGVSKEKTTSTGLHVW W TPSSGKVT SY EVQLFDENNQKIQGVQIQESTSWNEYTFFNLTAGSKYNIAITAVSGGKRSFSVYTNG
mouse 113 ..PPARFEVNREKTASTTLQVR W
TPSSGKVS WY EVQLFDHNNQKIQEVQVQESTTWSQYTFLNLTEGNSYKVAITAVSGEKRSFPVYINGST
human 200 STVPSPVKDIGISTKANSLLIS W SHGSGNVE
RY RLMLMDKGILVHGGVVDKHATSYAFHGLSPGYLYNLTVMTEAAGLQNYRWKLVRTA
mouse 203 ..VPSPVKDLGISPNPNSLLIS W
SRGSGNVE QY RLVLMDKGAIVQDTNVDRRDTSYAFHELTPGHLYNLTIVTMASGLQNSRWKLVRTA
human 290 .PMEVSNLKVTNDGSLTSLKVK W QRPPGNVD SY NITLSHKGTIKESRVLAPWITETHFKELVPGRLYQVTVSCVSGELSAQKMAVGRT
mouse 290 .PMEVSNLKVTNDGRLTSLNVK W
QKPPGDVD SY SITLSHQGTIKESKTLAPPVTETQFKDLVPGRLYQVTISCISGELSAEKSAAGRTV
human 376 FPDKVANLEANNNGRMRSLVVS W SPPAGDWE
QY RILLFNDSVVLLNITVGKEETQYVMDDTGLVPGRQYEVEVIVESGNLKNSERCQGRTV
mouse 378 .PEKVRNLVSYNEIWMKSFTVN W
TPPAGDWE HY RIVLFNESLVLLNTTVGKEETHYALDGLELIPGRQYEIEVIVESGNLRNSERCQGRTV
human 468 ..PLAVLQLRVKHANETSLSIM W QTPVAEWE KY IISLADRDLLLIHKSLSKDAKEFTFTDLVPGRKYMATVTSISGDLKNSSSVKGRT
mouse 468 ..PLAVLQLRVKHANETSLGIT W
RAPLGEWE KY IISLMDRELLVIHKSLSKDAKEFTFTDLMPGRNYKATVTSMSGDLKQSSSIKGRTV
human 553 VPAQVTDLHVANQGMTSSLFTN W TQAQGDVE
FY QVLLIHENVVIKNESISSETSRYSFHSLKSGSLYSVVVTTVSGGISSRQVVVEGRTV
mouse 555 .PAQVTDLHVNNQGMTSSLFTN W
TKALGDVE FY QVLLIHENVVVKNESVSSDTSRYSFRALKPGSLYSVVVTTVSGGISSRQVVAEGRTV
human 643 .PSSVSGVTVNNSGRNDYLSVS W LVAPGDVD NY EVTLSHDGKVVQSLVIAKSVRECSFSSLTPGRLYTVTITTRSGKYENHSFSQERTV
mouse 644 .PSSVSGVTVNNSGRNDYLSVS W
LPAPGEVD HY VVSLSHEGKVDQFLIIAKSVSECSFSSLTPGRLYNVTVTTKSGNYASHSFTEERTV
human 731 .PDKVQGVSVSNSARSDYLRVS W VHATGDFD HY EVTIKNKNNFIQTKSIPKSENECVFVQLVPGRLYSVTVTTKSGQYEANEQGNGRTI
mouse 732 .PDKVQGISVSNSARSDYLKVS W
VHATGDFD HY EVTIKNRESFIQTKTIPKSENECEFIELVPGRLYSVTVSTKSGQYEASEQGTGRTI
human 819 ..PEPVKDLTLRNRSTEDLHVT W SGANGDVD QY EIQLLFNDMKVFPPFHLVNTATEYRFTSLTPGRQYKILVLTISGDVQQSAFIEGFTV
mouse 820 ..PEPVKDLTLLNRSTEDLHVT W
SRANGDVD QY EVQLLFNDMKVFPHIHLVNTATEYKFTALTPGRHYKILVLTISGDVQQSAFIEGLPV
human 907 .PSAVKNIHISPNGATDSLTVN W TPGGGDVD SY TVSAFRHSQKVDSQTIPKHVFEHTFHRLEAGEQYQIMIASVSGSLKNQINVVGRTV
mouse 908 .PSTVKNIHISANGATDRLMVT W
SPGGGDVD SY VVSAFRQDEKVDSQTIPKHASEHTFHRLEAGAKYRIAIVSVSGSLRNQIDALGQTV
human 995 .PASVQGVIADNAYSSYSLIVS W QKAAGVAE RY DILLLTENGILLRNTSEPATTKQHKFEDLTPGKKYKIQILTVSGGLFSKEAQTEGRTV
mouse 996 .PASVQGVVAANAYSSNSLTVS W
QKALGVAE RY DILLLNENGLLLSNVSEPATARQHKFEDLTPGKKYKMQILTVSGGLFSKESQAEGRTV
human 1085 .PAAVTDLRITENSTRHLSFR W TASEGELS WY NIFLYNPDGNLQERAQVDPLVQSFSFQNLLQGRMYKMVIVTHSGELSNESFIFG
mouse 1086 .PAAVTNLRITENSSRYLSFG W
TASEGELS WY NIFLYNPDRTLQERAQVDPLVQSFSFQNLLQGRMYKMVIVTHSGELSNESFIFGRTV
human 1170 RTVPASVSHLRGSNRNTTDSL W FNWSPASG
DF DFYELILYNPNGTKKENWKDKDLTEWRFQGLVPGRKYVLWVVTHSGDLSNKVTAESRTA
mouse 1174 PAAVNHLKGSHRNTTDSLWFS W SPASGDFD
FY ELILYNPNGTKKENWKEKDVTEWRFQGLVPGRKYTLYVVTHSGDLSNKVTGEGRTA
human 1261 .PSPPSLMSFADIANTSLAIT W KGPPDWTDYN DF ELQWLPRDALTVFNPYNNRKSEGRIVYGLRPGRSYQFNVKTVSGDSWKTYSKPIFGSVRTK
mouse 1262 .PSPPSLLSFADVANTSLAIT W
KGPPDWTDYN DF ELQWFPGDALTIFNPYSSRKSEGRIVYGLHPGRSYQFSVKTVSGDSWKTYSKPISGSVRTK
human 1355 .PDKIQNLHCRPQNSTAIACS W
IPPDSDFD GY SIECRKMDTQEVEFSRKLEKEKSLLNIMMLVPHKRYLVSIKVQSAGMTSEVVEDSTITMIDRPPPPPP
mouse 1356 .PDKIQNLHCRPQNSTAIACS W
IPPDSDFD GY SIECRKMDTQEIEFSRKLEKEKSLLNIMMLVPHKRYLVSIKVQSAGMTSEVVED
Tumor suppressor protein DCC precursors.
Mouse-P70211/GI:2497302; human-P43146/GI:1169233
human .429 .PSAPRDVVPVLVSSRFVRLS
W RPPAEAKGNIQ TF
TVFFSREGDNRERALNTTQPGSLQLTVGNLKPEAMYTFRVVAYNEWGPGESSQPIKVATQ
mouse .429 .PSAPRDVLPVLVSSRFVRLS
W RPPAEAKGNIQ TF
TVFFSREGDNRERALNTTQPGSLQLTVGNLKPEAMYTFRVVAYNEWGPGESSQPIKVATQ
human .528 .PGPVENLQAVSTSPTSILIT
W EPPAYANGPVQ GY
RLFCTEVSTGKEQNIEVDGLSYKLEGLKKFTEYSLRFLAYNRYGPGVSTDDITVVTLSDVP
mouse .528 .PGPVENLHAVSTSPTSILIT
W EPPAYANGPVQ GY
RLFCTEVSTGKEQNIEVDGLSYKLEGLKKFTEYTLRFLAYNRYGPGVSTDDITVVTLSDVP
human .623 .SAPPQNVSLEVVNSRSIKVS
W LPPPSGTQN GF
ITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGTGPPSNWYTAETPENDLDESQ
mouse .623 .SAPPQNISLEVVNSRSIKVS
W LPPPSGTQN GF
ITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGTGPPSNWYTAETPENDLDESQ
human .725 .VPDQPSSLHVRPQTNCIIMS
W TPPLNPNIVVR GY
IIGYGVGSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPLYESATTRSITDP
mouse .725 .VPDQPSSLHVRPQTNCIIMS
W TPPLNPNIVVR GY
IIGYGVGSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPLYESATTRS
human .843 PMLPPVGVQAVALTHDAVRVS W ADNSVPKNQKTSEVR LY
TVRWRTSFSASAKYKSEDTTSLSYTATGLKPNTMYEFSVMVTKNRRSSTWSMTAHATTY
mouse .843 PMLPPVGVQAVALTHEAVRVS W
ADNSVPKNQKTSDVR LY TVRWRTSFSASAKYKSEDTTSLSYTATGLKPNTMYEFSVMVTKNRRSSTWSMTAHATTY
human .946 SAPKDFTVITREGKPRAVIVS W QPPLEANGKIT AY ILFYTLDKNIPIDDWIMETISGDRLTHQIMDLNLDTMYYFRIQARNSKGVGPLSDPILFR
mouse .946 SAPKDLTVITREGKPRAVIVS W
QPPLEANGKIT AY ILFYTLDKNIPIDDWIMETISGDRLTHQIMDLSLDTMYYFRIQARNVKGVGPLSDPILFR
Down syndrome cell adhesion molecule (human &
rat)
hDSCam 885 PPDPPEIEIKDVKARTITLR W TMGFDGNSPIT
GY DIECKNKSDSWDSAQRTKDVSPQLNSATIIDIHPSSTYSIRMYAKNRIGKSEPSNELTITADEAAP
rDSCam 885 PPDPPEIEIKDVKARTITLR W TMGFDGNSPIT
GY DIECKNKSDSWDSAQRTKDVSPQLNSATIIDIHPSSTYSIRMYAKNRIGKSEPSNEITITADEAAP
hDSCam 985 DGPPQEVHLEPISSQSIRVT W KAPKKHLQNGIIR
GY QIGYREYSTGGNFQFNIISVDTSGDSEVYTLDNLNKFTQYGLVVQACNRAGTGPSSQEIITTTLEDVP
rDSCam 985 DGPPQEVHLEPTSSQSIRVT W KAPKKHLQNGIIR
GY QIGYREYSTGGNFQFNIISIDTTGDSEVYTLDNLNKFTQYGLVVQACNRAGTGPSSQEIITTTLEDVP
hDSCam 1089 SYPPENVQAIATSPESISIS W STLSKEALNGILQ
GF RVIYWANLMDGELGEIKNITTTQPSLELDGLEKYTNYSIQVLAFTRAGDGVRSEQIFTRTKEDV
rDSCam 1089 SYPPENVQAIATSPESISIS W STLSKEALNGILQ
GF RVIYWANLIDGELGEIKNVTTTQPSLELDGLEKYTNYSIQVLAFTRAGDGVRSEQIFTRTKEDV
hDSCam 1189 PGPPAGVKAAAASASMVFVS W LPPLKLNGIIR
KY TVFCSHPYPTVISEFEASPDSFSYRIPNLSRNRQYSVWVVAVTSAGRGNSSEIITVEPLAKAPARILT
rDSCam 1189 PGPPAGVKAAAASASMVFVS W LPPLKLNGIIR
KY TVFCSHPYPTVISEFEASPDSFSYRIPNLSRNRQYSVWVVAVTSAGRGNSSEIITVEPLAKAPARILT
hDSCam 1379 PPDQPRLTVSKTTSSSITLS W LPGDNGGSSIR
GY ILQYSEDNSEQWGSFPISPSERSYRLENLKCGTWYKFTLTAQNGVGPGRISEIIEAKTLGKEPQFSKE
rDSCam 1379 PPDQPRLTVSKTTSSSITLS W LPGDNGGSSIR
GY ILQYSEDNSEQWGSFPISPSERSYRLENLKCGTWYKFTLTAQNGVGPGRISEIIEAKTLGKEPQFSKE
Human putative neuronal cell adhesion molecule
NP_004875.1 GI:30911097
427 PRNVRAVSVSSTEVRVS W SEPLANTKEII GY
VLHIRKAADPPELEYQEAVSKSTFQHLVSDLEPSTAYSFYIKAYTPRGASSASVPTLASTLGEAPA
524 PPPLSVRVLGSSSLQLL W EPWPRLAQHEG GF
KLFYRPASKTSFTGPILLPGTVSSYNLSQLDPTAVYEVKLLAYNQHGDGNAT.
Neural cell adhesion molecule L1 precursor
(N-CAM L1) (CD171antigen). P32004 GI:1705571
L1 CAM ADhesion molecule homolog (lad-1) [Caenorhabditis elegans]. NP_501349.1
GI:17538700
rat neural cell adhesion molecule L1 [Rattus norvegicus]. NP_059041.1 GI:8393820
hNCAML1 ..615 VPRLVLSDLHLLTQSQVRVS W
SPAEDHNAPIE KY DIEFEDKEMAPEKWYSLGKVPG NQTSTTLKLSPYVHYTFRVTAINKYGPGEPSPVSETVVTPEAAP
mNCAML1 ..614 VPHLELSDRHLLKQSQVHLS W
SPAEDHNSPIE KY DIEFEDKEMAPEKWFSLGKVPGNQTSTTLKLSPYVHYTFRVTAINKYGPGEPSPVSESVVTPEAAP
rNCAML1 ..614 VPHLELSDRHLLKQSQVHLS W
SPAEDHNSPIE KY DIEFEDKEMAPEKWFSLGKVPGNQTSTTLKLSPYVHYTFRVTAINKYGPGEPSPVSETVVTPEAA
hNCAML1 ..715 EKNPVDVKGEGNETTNMVIT W KPLRWMDWNAPQV QY
RVQWRPQGTRGPWQEQIVSDPFLVVSNTSTFVPYEIKVQAVNSQGKGPEPQVTIGYSGEDYP
mNCAML1 ..714 EKNPVDVRGEGNETNNMVIT W
KPLRWMDWNAPQI QY RVQWRPQGKQETWRKQTVSDPFLVVSNTSTFVPYEIKVQAVNNQGKGPEPQVTIGYSGEDYP
rNCAML1 ..714 EKNPVDVRGEGNETNNMVIT W
KPLRWMDWNAPQI QY RVQWRPLGKQETWKEQTVSDPFLVVSNTSTFVPYEIKVQAVNNQGKGPEPQVTIGYSGEDY
c.elegans 712 DKNPDEVAAKGTSPENIIVQ W KPMSREEWNGADF
HY VVKYRPKDEDQRVGDWKEVAVEDPFADRVTVNLDDEKDVKPFQPYEVQVQAVNSEGRT
hNCAML1 ..813 QAIPELEGIEILNSSAVLVK W RPVDLAQVKGHLR GY
NVTYWREGSQRKHSKRHIHKDHVVVPANTTSVILSGLRPYSSYHLEVQAFNGRGSGPASEFTFST
mNCAML1 ..812 QVSPELEDITIFNSSTVLVR W
RPVDLAQVKGHLK GY NVTYWWKGSQRKHSKRHIHKSHIVVPANTTSAILSGLRPYSSYHVEVQAFNGRGLGPASEWTFSTP
rNCAML1 ..812 QVSPELEDITIFNSSTVLVR W
RPVDLAQVKGHLR GY NVTYWWKGSQRKHSKRHVHKSHMVVPANTTSAILSGLRPYSSYHVEVQAFNGRGLGPASEWTF
c.elegans 822 SSIPSGLRVLEKSGTTVTLA W NGVDPQTANGNFT
GY KITYWVDEADQDSDDSSEDDEDDEKRKFRWKRSIRVKRQSGIRKTVVFGPSATQGTLTDLKPA
hNCAML1 ..918 PGHPEALHLECQSNTSLLLR W QPPLSHNGVLT GY VLSYHPLDEGGKGQLSFNLRDPELRTHNLTDLSPHLRYRFQLQATTKEGPGEAIVREGGTMALS
mNCAML1 ..917 PGHPEALHLECQSDTSLLLH W
QPPLSHNGVLT GY LLSYHPVEGESKEQLFFNLSDPELRTHNLTNLNPDLQYRFQLQATTQQGGPGEAIVREGGTMALF
rNCAML1 ..917 PGHPEALHLECQSDTSLLLH W
QPPLSHNGVLT GY LLSYHPLDGESKEQLFFNLSDPELRTHNLTNLNPDLQYRFQLQATTHQGPGEAIVREGGTMALFGK
c.elegans 957 LRAYPMNSKVGGEKGVVVLV W KKPRQTNGKLA
RY EVEYCKTQNGKLVEKSCPRKQIDADSKEIRITGLENETPYRFILRAHTSAGEGDPN
hNCAML1 .1016 GISDFGNISATAGENYSVVS W VPKEGQCNF RF HILFKALGEEKGGASLSPQYVSYNQSSYTQWDLQPDTDYEIHLFKERMFRHQMAVKTNGTGRV
mNCAML1 .1016 GKPDFGNISATAGENYSVVS W
VPRKGQCNF RF HILFKALPEGKVSPDHQPQPQYVSYNQSSYTQWNLQPDTKYEIHLIKEKVLLHHLDVKTNGTG
c.elegans Juxtamembrane
domain-Associated Catenin JAC-1 NP_502910.1 GI:17541130
C.EL JAC1 240.PLAPGRPTVIAVDGQGVLLE W
TPPVADVHSSPPQ GY QVEYRVYGSRD- W IVANEQLVQENVFTVESLRPNGVYEFRVRGKNQDGLGHPS
C.EL JAC1 455 PNILEAPEFLEVDGDKITIC W LPAQ---SQLPVM
GY DVEFRDLQQDDR W YKVNDQPVFACKMTVGDLIMDHDYQFRVLAHNASGCSQPS
Nephrin precursor (Renal glomerulus-specific
cell adhesion receptor).
mouse Q9QZS7/GI:20178015; Human: O60500/GI:20177993; Rat: Q9R044/GI:20177992
human ....941 PDPPSGLKVVSLTPHSVGLE W
KPGFDGGLPQ RF CIRYEALGTPGFHYVDVVPPQATTFTLTGLQPSTRYRVWLLASNALGDSGLA
mouse ....941 PDPPLGLKVVSVSPHSVGLE W
KPGFDGGLPQ RF QIRYEALETPGFLYMDVLPAQATTFTLTGLKPSTRYRIWLLASNALGDSGLT
rat ......937 PDPPLGLKVVSISPHSVGLE W
KPGFDGGLPQ RF QIRYEALETPGFLHVDVLPTQATTFTLTGLKPSTRYRIWLLASNALGDSGLT..D
Myosin-binding Protein C (cardiac
and skeletal muscle)
HUMAN MYBCC .772 PDAPAAPKISNVGEDSCTVQ W EPPAYDGGQPIL GY ILERKKKKSYR
W MQLNFDLIQELSHEARRMIEGVVYEMRVYAVNAIGMSRPSPASQPFMPIGP
MOUSE MYBCC .768 PDAPAAPKISNVGEDSCTVQ W EPPAYDGGQPVL GY ILERKKKKSYR
W MRLNFDLLRELSHEARRMIEGVAYEMRVYAVNAVGMSRPSPASQPFMPIGP
CHICK MYBCC .770 PDPPEAPKISNIGEDYCTVQ W QPPTYDGGQPVL GY ILERKKKKSYR
W MRLNFDLLKELTYEAKRMIEGVVYEMRIYAVNSIGMSRPSPASQPFMPIAP
HUMAN MYBCS .620 PDPPVAPTVTEVGDDWCIMN W EPPAYDGGSPIL GY FIERKKKQSSR
W MRLNFDLCKETTFEPKKMIEGVAYEVRIFAVNAIGISKPSMPSRPFVPLAV
MOUSE MYBC .307 PDAPAAPKISNVGEDSCTVQ
W EPPAYDGGQPVL GY
ILERKKKKSYR W MRLNFDLLRELSHEARRMIEGVAYEMRVYAVNAVGMSRPSPASQPFMPIGP
MOUSE MYBCF .634 PDPPEAVRVTSVGENWAILV W EPPKYDGGQPVT GY LMERKKKGSQR
W MKINFEVFTDTTYESTKMIEGVLYEMRVFAVNAIGVSQPSMNTKPFMPIAP
CHICK MYBC ..631 PDPPQSVRVTSVGEDWAVLS W EAPPFDGGMPIT GY LMERKKKGSMR
W MKLNFEVFPDTTYESTKMIEGVFYEMRVFAVNAIGVSQPSLNTQPFMPIAP
HUMAN MYBCC .870 PSEPTHLAVEDVSDTTVSLK
W RPPERVGAGGLD GY
SVEYCPEGCSE W VAALQGLTEHTSILVKDLPTGARLLFRVRAHNMAGPGAPVTTTE
MOUSE MYBCC .866 PGEPTHLAVEDVSDTTVSLK W RPPERVGAGGLD GY SVEYCQEGCSE
W TPALQGLTERRSMLVKDLPTGARLLFRVRAHNVAGPGGPIVTKEPVTV
CHICK MYBCC .868 PSEPTHFTVEDVSDTTVALK W RPPERIGAGGLD GY IVEYCKDGSAE
W TPALPGLTERTSALIKDLVTGDKLYFRVKAINLAGESGAAIIKEPV
HUMAN MYBCS .718 TSPPTLLTVDSVTDTTVTMR W RPPDHIGAAGLD GY VLEYCFEGTED
W IVANKDLIDKTKFTITGLPTDAKIFVRVKAVNAAGASEPKYYSQPI
MOUSE MYBC .405 PGEPTHLAVEDVSDTTVSLK
W RPPERVGAGGLD GY
SVEYCQEGCSE W TPALQGLTERTSMLVKDLPTGARLLFRVR
MOUSE MYBCF .732 TSAPQHLTVEDVTDTTTTLK W RPPDRIGAGGID GY LVEYCLEGSEE
W VPANKEPVERCGFTVKDLPTGARILFRVVGVNIAGRSEPATLLQPVTIREI
CHICK MYBC ..729 TSEPTHVVLEDVTDTTATIK W RPPERIGAGGVD GY LVEWCREGSNE
W VAANTELVERCGLTA RGLPTGERLLFRVISVNMAGKSPPATMAQP
HUMAN MYBCC 1066 PSPPQDLRVTDAWGLNVALE W
KPPQDVGNTELW GY TVQKADKKTME W FTVLEHYRRTHCVVPELIIGNGYYFRVFSQNMVGFSDRAATTKEPVFI
MOUSE MYBCC 1062 PSPPQDIRIVETWGFNVALE W KPPQDDGNTEIW
GY TVQKADKKTME W FTVLEHYRRTHCVVSELIIGNGYYFRVFSHNMVGSSDKAAATKEPVFIPRP
CHICK MYBCC 1064 PGPPQNIKLADVWGFNVALE W TPPQDDGNAQIL
GY TVQKADKKTME W YTVYDHYRRTNCVVSDLIMGNEYFFRVFSE
HUMAN MYBCS .914 PGPPQIVKIEDVWGENVALT W TPPKDDGNAAIT GY TIQKADKKSME
W FTVIEHYHRTSATITELVIGNEYYFRVFSENMCGLSEDATMTKESAVIAR
MOUSE MYBC .601 PSPPQDIRIVETWGFNVALE
W KPPQDDGNTEIW GY
TVQKADKKTME W FTVLEHYRRTHCVVSELIIGNGYYFRVFSHNMVGSSDKAAATKEPVFIPRP
MOUSE MYBCF .928 AGPAENVMVKEVWGTNALVE W QPPKDDGNSEIT GY FVQKADKKTME
W FNVYEHNRHTSCTVSDLIVGNEYYFRIFSENICGLSDSPGVSKNTA
CHICK MYBC ..924 PGPPQAVRVMEVWGSNALLQ W EPPKDDGNAEIS GY TVQKADTRTME
W FTVLEHSRPTRCTVSELVMGNEYRFRVYSENVCGTSQEPATSHNTARIAK
Myosin Binding Protein H
HUMAN MYBH . .71 PSAPLLLTLDDVSSSSVTVS W EPPERLGRLGLQ GY VLELCREGASE
W VPVSARPMMVTQQTVRNLALGDKFLLRVSAVSSAGAGPPAMLDQPIHIRENIE
MOUSE MYBH . .77 PSAPLRLTLEDVSHSSLTVS W EPPEKLGKLGLQ GY VLEFCREGASE
W VPVNPRPVMVTQQTVRNLALGDKFFLRVTAVSSAGAGPPAVLDQPVH
RAT ..MYBH . .78
PSAPLQLTLEDVSHSSLTVS W EPPEDLGSWGSR AM CWSSVREGASE
W VPVNPRPVMVTQQTVRNLALGDKFFLRVTAVNSAGAGPPAVLDQPV
CHICK MYBH ..135 PSVPLSLAVEEVTENSVTLT W KAPEHTGKSSLD GY VVEICKDGSTD
W TAVNKEPFLSTRYKIHDLASGEKVHVRVKAISASGTSDPATLEQPVLIR
HUMAN MYBH ..267 PGPPSSIRLLDVWGCNAALQ
W TPPQDTGNTELL GY
MVQKADKKTGQ W FTVLERYHPTTCTISDLIIGNSYSFRVFSENLCGLSTSATVTKELAH
MOUSE MYBH ..273 PGPPSSIKLLDVWGCNAALE W TPPQDTGNTELL GY TVQKADKRTGQ
W FTVLERYHPTTCTISDLIIGNSYSFRVFSENLCGLSDLATTTKELAHIHKA
RAT ..MYBH ..274
PGPPSSIKLLDVWGCNAALE W MPPQDTGNTELL GY
TVQKADKKTGQ W FTVLERYHPTTCTVSDLIIGNSYSFRVFSENLCGLSDLAT
CHICK MYBH ..331 PGPPQNLKLVDVWGFNVALE W SPPADNGNSEIK GY TVQKSDKKSGK
W FTVLERCTRTSCTISDLIIGNTYSFRVFSENACGMSETAAVAA
Collagen Isoforms
RAT Collagen alpha-1 (III)
477 PMTVPRKHWWTDAGAEKKHV W FGESMNG GF
QFSYGNPDLPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAYMDQANGNVKKSL
Collagen alpha-1 (VII)
HUMAN a1(VII) .232 TSAPRDLVLSEPSSQSLRVQ W TAASGPVT GY KVQYTPLTGLGQPLPSERQEVNVPAGETSVRLRGLRPLTEYQVTVIALYANSIGEAVSGTARTT
MOUSE a1(VII) .233 PSGPRDLVLSEPSSQSLRVQ W TAASGPVT GY KVQYTPLTGLGQPLPSERQEVNIPAGETSTRLQGLRPLTDYQVTVVALYANSIGEAVSGTARTT
HUMAN a1(VII) .327 ALEGPELTIQNTTAHSLLVA
W RSVPGAT GY RVTWRVLSGGPTQQQELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTD
MOUSE a1(VII) .328 AKEGLELSLQNITSHSLLVA W RRVPGAN GY RVTWRDLSGGPTQQQDLSPGQGSVFLDHLEPGTDYEVTVSALFGHSVGPAASLTARTA
HUMAN a1(VII) .415 ASVEQTLRPVILGPTSILLS
W NLVPEAR GY RLEWRRETGLEPPQKVVLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELP
MOUSE a1(VII) .416 SSVEQTLHPIILSPTSILLS W NLVPEAR GY RLEWRRESGLETPQKVELPPDVTRHQLDGLQPGTEYRLTLYTLLEGREVATPATVVPTGLEQL
HUMAN a1(VII) .508 VSPVTDLQATELPGQRVRVS
W SPVPGAT QY RIIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPREGSASVLTVRREPET
MOUSE a1(VII) .509 VSPVMNLQAIELPGQRVRVS W NPVPGAT EY RFTVRTTQGVERTLLLPGSQTTFDLDDVRAGLSYTVRVSARVGAQEGDASILTIHRDPEA
HUMAN a1(VII) .598 PLAVPGLRVVVSDATRVRVA
W GPVPGAS GF RISWSTGSGPESSQTLPPDSTATDITGLQPGTTYQVAVSVLRGREEGPAAVIVARTDP
MOUSE a1(VII) .599 PLVVPGLRVVASDATRIRVA W GLVPGAS GF RISWRTGSGPESSRTLTPDSTVTDILGLQPGTSYQVAVSALRGREEGPPVVIVARTDP
HUMAN a1(VII) .685 LGPVRTVHVTQASSSSVTIT W TRVPGAT GY RVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGVDGPPASVVVRTAPEP
MOUSE a1(VII) .687 LGPVRRVHLTQAGSSSVSIT W TGVPGAT GY RVSWHSGHGPEKSLLVSGDATVAEIDGLEPDTEYIVRVRTHVAGVDGAPASVVVRTAPEP
HUMAN a1(VII) .776 VGRVSRLQILNASSDVLRIT
W VGVTGAT AY RLAWGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDREGTPVSIVVTTPPEA
MOUSE a1(VII) .777 VGSVSKLQILNASSDVLRVT W VGVPGAT SY KLAW GRSEGGPMKHRILPGNKESAEIRDLEGGVSYSVRVTALVGDREGAPVSIVITTPPAT.
HUMAN a1(VII) .867 PPALGTLHVVQRGEHSLRLR
W EPVPRAQ GF LLHWQPEGGQEQSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAGEGPSAEVTARTESP
MOUSE a1(VII) .868 PALLETLQVVQSGEHSLRLR W EPVPGAP GF RLHWQPEGGQEQSLTLGPESNSYNLVGLEPATKYQVWLTVLGQTGEGPPRKVTAYTEPS
HUMAN a1(VII) .956 RVPSIELRVVDTSIDSVTLA
W TPVSRAS SY ILSWRPLRGPGQEVPGSPQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGVRGPEA
MOUSE a1(VII) .957 HIPSTELRVVDTSIDSVTLT W TPVSGAS SY ILSWRPLRGTGQEVPRAPQTLPGTSSSHRVTGLEPGISYVFSLTPIQSGVRGSEISVTQTPACS
Collagen a1(XII)
HUMAN a1(XII) ..25 VDPPSDLNFKIIDENTVHMS W AEPVDPIV GY RITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQT
MOUSE a1(XII) ..25 VDPPSDLNFKIIDENTVHMS W ERPVDPIV GY RITVDPTTDGPTKEFTLAASTTETLLSDLIPETQYVVTITSYNEVEESVPVIGQLTIQT
CHICK a1(XII) ..25 VNPPSDLNFTIIDEHNVQMS W KRPPDAIV GY RITVVPTNDGPTKEFTLSPSTTQTVLSDLIPEIEYVVSIASYDEVEESLPVFGQLTIQTGGPGIP
HUMAN a1(XII) .334 VEPPSNLIAMEVSSKYVKLN
W NPSPSPVT GY
KVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIME
MOUSE a1(XII) .334 IEPPSNLVVTELSSKYIRLS W DPSPSAVT GY KILLTPMAAGSRHHALSVGPQTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQ
CHICK a1(XII) .333 VEPASNLVATQISSKSVRIT W DPSTSQIT GY RVQFIPMIAGGKQHVLSVGPQTTALNVKDLSPDTEYQINVYAMKGLTPSEPITIMEKTQQVKVQVEC
HUMAN a1(XII) .632 YVPPKDLSFSEVTSYGFKTN
W SPAGENVF SY
HITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGFSIPLAGEETTEEV
MOUSE a1(XII) .636 YVPPKDLRFTQVTANSFKAE W SPPGDNVF SY HVTYKDANGDDEVTVVEPASSTSVVLNSLRPETLYLVNVTAEYEDGFSVPITGEETTAEV
NEWT a1(XII) ...65 LTPPRDLSFAEVTSSSFRVS W SPAAEDAI AY LVNYTVALGGEEFVVSVPAPTTSTVLTNLFPKTTYEVRVVAEYPEGESPPLKGEETTLEV
CHICK a1(XII). 631 YVPAKNMVFSDVTSDSFKVS W SAAGSEEK SY LIKYKVAIGGDEFIVSVPASSTSSVLTNLLPETTYAVSVIAEYEDGDGPPLDGEETTLEV
HUMAN a1(XII) .723 KGAPRNLKVTDETTDSFKIT
W TQAPGRVL RC RIIYRPVAGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFSGPGTPLTGNAATEEV
MOUSE a1(XII) .727 KGVPRNLKVTDETTDSFKLT W SQAPGRVL RY RIRYRPVSGGESKEVSTPANQRRKTLENLTPDTKYEISVIAEYPSGPGSPLTGNAATEEV
NEWT a1(XII) ..156 RGAPRNLRVTDETTDSFKVG W TPAPGNVL RY RIAYRPVAGGERKEVTVQGNERATTLYNLFPDTKYHVSGVPEYQSGPGTALNGNGATEEVV
CHICK a1(XII) .722 KGAPRNLRITDETTDSFIVG W TPAPGNVL RY RLVYRPLTGGERRQVTVSANERSTTLRNLIPDTRYEVSVIAEYQSGPGNALNGYAKTDEVR
HUMAN a1(XII) .814 RGNPRDLRVSDPTTSTMKLS
W SGAPGKVK QY
LVTYTP-VAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGTTLEE
MOUSE a1(XII) .818 RGNPRDLRVSDATTSTLKLS W SRAPGKVK QY LVTYTP-AAGGETQEVTVRGDTTTTMLRKLKEGTQYDLSVTALYASGAGEALSGKGSTLEE
NEWT a1(XII) ..248 GEPKNLRVSEPTTSTAMRLT W DKAPGKVQ RY LRNLHSRSAGGDIKEVTVKGDTSTTVLKELDPGTAYTLSVNPLYASGAGTAVTGEGATLQE
CHICK a1(XII) .814 GNPRNLRVSDATTSTTMKLS W SAAPGKVQ HV LYNLHTRYAGVETKELTVKGDTTSKELKGLDEATRYALTVSALYASGAGEALSGEGETLEE
HUMAN a1(XII) .905 RGSPQDLVTKDITDTSIGAY
W TSAPGMVR GY
RVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSGEGEPLTGDATTEL
MOUSE a1(XII) .909 RGSPQNLVTKDITDTSIGAY W TSAPGMVR GY RVSWKSLYDDIEAGETTLNGDAIHTMIENLQPETKYKISVFATYSSGEGEPVTGDATTEL
NEWT a1(XII) ..340 RGSPRDLIIKDITDTTIGTS W TAAPGMVR GY RIAWQSLFDDKTGENHVPGDTTNTVLRNLDPETKYRLSVYANYASGEGDPLSGEATTEA
CHICK a1(XII) .906 RGSPRNLITTDITDTTVGLS W TPAPGTVN NY RIVWKSLYDDTMGEKRVPGNTVDAVLDGLEPETKYRISIYAAYSSGEGDPVEGEAFTDVSQSAR
HUMAN a1(XII) .996 SQDSKTLKVDEETENTMRVT
W KPAP-GKVV NY
RVVYRPHGRGKQMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASR
MOUSE a1(XII) 1000 SQDSKILRVDEETEHTMRVT W KAAP-GKVV
NY RVVYRPQGGGRQMVAKVPPTVTSTVLKRLQPQTTYDITVLPMYKTGEGKLRQRSGTTASR
NEWT a1(XII) ..430 SPDGKIVKISEETETTMKAT W QPAP-GNVL NY RVVYRPRAGGRQIVAKVPPAVTSTVLRRLTPLTTYDISVIPVYKEGDGKTRQGSGTTLSP
CHICK a1(XII) 1001 TVTVDNETENTMRVSVAALT W EGLVLARVL
PN RSGGRQMFGKVNASATSIVLKRLKPRTTYDLSVVPIYDFGQGKSRKAEGTTASP
HUMAN a1(XII) 1087 FKSPRNLKTSDPTMSSFRVT W
EPAPGEVK GY KVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGM
MOUSE a1(XII) 1091 FKSPRNLKTSDPTMSSFRVT W EPAPGEVK
GY KVTFHPTGDDRRLGELVLGPYDNTVVLEELRAGTTYRVNVFGMFDGGESLPLVGQEMTTLS
NEWT a1(XII)...521 FNAPRSIKTSEPTRSTFRVT W EPAPGEVK GY KITFHPEGDDGYLGEMMVGPYDSTVVLEELRARTSYKVNVFGVFDDGQS
CHICK a1(XII) 1087 FKPPRNLRTSDSTMSSFRVT W EPAPGRVK
GY KVTFHPTEDDRNLGELVVGPYDSTVVLEELRAGTTYKVNVFGMFDGGESNPLVGQEMTTLSDT
HUMAN a1(XII) 1385 LEAPSNLVISERTHRSFRVS W
TPPSDSVD RY KVEYYPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTL
MOUSE a1(XII) 1389 LEAPTNLVISERTHRSFRVS W TPPSDSVD
RY KVEYYPVSGGKRQEFYVSRLDTSTVLKDLKPETDYVVNVYSVVEDEYSEPLKGTEKAL
RABBIT a1(XII).128 LEAPSNLVISERTHRSFRVS W TPPSDSVD RY KVEYYPVSGGKRQEFYVSRLETSTVLKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTL
NEWT a1(XII) ..819 LNPPSNLVTSEPTPRSFRVT W VPPSQSVE RF KVEYYPVAGGRPQEVYVRGTQTTTVLVGLKPETEYYVNVYSVEGNEISEPLAGTETTLPIP
CHICK a1(XII) 1385 LPPPSNLVISEVTPHSFRLR W SPPPESVD
RY RVEYYPTTGGPPKQFYVSRMETTTVLKDLTPETEYIVNVFSVVEDESSEPLIGREITYP
HUMAN a1(XII) 1474 PVPVVSLNIYDVGPTTMHVQ W
QPV-GGAT GY ILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHDLTSEPVTVREVTLP
MOUSE a1(XII) 1478 PVPVVSLNIYDVGPTTMHVQ W QPV-GGAT
GY TVSYQPTRSPEGTKPKEMRVEPTVNDVQLTGLLPNTEYEVTVQAVLYDLTSEPAKAREVTLP
RABBIT a1(XII) 217 PVPIVSLNIYDIGPTTMRVQ W QPV-GGAT
GY TVSYEPVKTTESTKPKEMRVGPTVNDVQLTDLLPSTEYEVTVQAVLHDLTSEPATAREMTLP
CHICK a1(XII) 1475 LSSVRNLNVYDIGSTSMRVR W EPV-NGAT
GY LLTYEPVNATVPTTEKEMRVGPSVNEVQLVDLIPNTEYTLTAYVLYGDITSDPLTSQEVTLP
HUMAN a1(XII) 1566 LPRPQDLKLRDVTHSTMNVF W
EPVPGKVR KY IVRYKTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQETTRP
MOUSE a1(XII) 1570 LPRPQDVKLRDVTHSTMNVV W EPVLGKVR
KY IVRYKTPDEEFKEVEVDRSRASTILKDLSSQTQYTVSVSAVYDEGTSPPATAYDTTRR
RABBIT a1(XII) 309 LPRPQDVKLRDVTHSTMSVF W EPVLGKVR
KY VVRYQTPEEDVKEVEVDRSRTSTSLKDLLSQTLYTVSVSAVYDEGESPPVTAQETTRP
CHICK a1(XII) 1567 LPGPRGVTIRDVTHSTMNVL W DPAPGKVR
KY IIRYKIADEADVKEVEIDRLKTSTTLTDLSSQRLYNVKVVAVYDEGESLPVVASCYSA
HUMAN a1(XII) 1655 VPAPTNLKITEVTSEGFRGT W
DHGASDVS LY RITWGPFGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGSERTLPILTTQAP
MOUSE a1(XII) 1669 VPAPTNLQFTEVTPESFRGT W DHGASDVS
LY RITWAPVGNPDKMETILNGDENTLVFENLNPNTPYEVSITAIYPDESESEDLSGTE
RABBIT a1(XII) 398 VPAPTNLRITEVTPESFRGT W DHGASDVS
LY RITWAPFGSSDKMETILNGDENTLVFENLNPNTLYEVSVTAIYPDESESDDLTGSERTSP
CHICK a1(XII) 1656 VPSPVNLRITEITKNSFRGT W DHGAPDVS
LY RITWGPYGRSEKAESIVNGDVNSLLFENLNPDTLYEVSVTAIYPDESETVDDLIGSERTL
HUMAN a1(XII) 1753 KSGPRNLQVYNATSNSLTVK W
DPASGRVQ KY RITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKP
MOUSE a1(XII) 1759 KSGPRNLQVYNATSNSLTIK W DPASGRVQ
KY RITYQPSTGEGNEQTITVGGRQNSVLLQKLKPDTPYTITVYSQYPGGEGGRMTGRGKTKP
RABBIT a1(XII) 489 KSGPRNLQVYNATSNSLTVK W DPASGRVQ
KY RITYQPSRGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKP
CHICK a1(XII) 1757 KSGPRNLQVYNATSHSLTVK W DPASGRVQ
RY KIIYQPINGDGPEQSTMVGGRQNSVVIQKLQPDTPYAITVSSMYADGEGGRMTGRGRTKP
HUMAN a1(XII) 1844 LNTVRNLRVYDPSTSTLNVR W
DHAEGNPR QY KLFYAPAAGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLM
MOUSE a1(XII) 1850 LNTVRNLRVYDPSTSSLSVR W DHAEGNPR
QY KLFYAPTSGGPEELVPIPGNTNYAILRNLQPDTPYTITVVPVYTEGDGGRTSDTGRTLV
RABBIT a1(XII) 580 LNTVRNLRVYDPSTSTLNVR W DHAEGNPR
QY KLFYAPTAGGSEELVPIPGNTNYAILRNL
CHICK a1(XII) 1848 LTTVKNMLVYDPTTSTLNVR W DHAEGNPR
QY KVFYRPTAGGAEEMTTVPGNTNYVILRSLEPNTPYTVTVVPVFPEGDGGRTTDTGRTLE
HUMAN a1(XII) 1934 RGLARNVQVYNPTPNRLGVR W
DPAPGPVL QY RVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGNPSPAQGRTLP
MOUSE a1(XII) 1940 RGLSRNIQVYNPTPNSLDVR W DPAPGPVQ
QY RIVYSPVAGTRPSESIVVPGNTRTVHLERLIPDTPYSVYIVALYSDGEGNPSPSQGRTLP
CHICK a1(XII) 1938 RGTPRNIQVYNPTPNSMNVR W EPAPGPVQ
QY RVNYSPLSGPRPSESIVVPANTRDVMLERLTPDTAYSINVIALYADGEGNPSQAQGRTLP
HUMAN a1(XII) 2025 RSGPRNLRVFGETTNSLSVA W
DHADGPVQ QY RIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDGDGGHLTGNGRTVG
MOUSE a1(XII) 2031 RSGPRNIRVFGETTNSLSVA W DHADGPVQ
QY RIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAIYEDGDGGHLTGNGRTVG
CHICK a1(XII) 2029 RSGPRNLRVFDETTNSLSVQ W DHADGPVQ
QY RIIYSPTVGDPIDEYTTVPGIRNNVILQPLQSDTPYKITVVAVYEDGDGGQLTGNGRTVG
HUMAN a1(XII) 2116 LLPPQNIHISDEWYTRFRVS W
DPSPSPVL GY KIVYKPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQGTTL
MOUSE a1(XII) 2122 LLPPQNIHIFDEWYTRFRVS W DPSPSPVL
GY KIVYKPVGSNEPMEAFVGEVTSYTLHNLNPSTTYDVSVYAQYDSGLSVPLTDQGTTL
CHICK a1(XII) 2120 LLPPQNIYITDEWYTRFRVS W DPSPSPVL
GY KIVYKPVGSNEPMEVFVGEVTSYTLHNLSPSTTYDVNVYAQYDSGMSIPLTDQGTTL
HUMAN a1(XII) 2204 YLNVTDLKTYQIGWDTFCVK W
SPHRAAT SY RLKLSPADGTRGQEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTE
MOUSE a1(XII) 2210 YLNVTDLKTYQVGWDTFCVK W SPHRAAT
SY RLKLSPADGTRGQEITVRGSETSHCFTGLSPEAEYGVTVFVQTPNLEGPGVPIKEQTTVKP
CHICK a1(XII) 2208 YLNVTDLTTYKIGWDTFCIR W SPHRSAT
SY RLKLNPADGSRGQEITVRGSETSHCFTGLSPDTEYNATVFVQTPNLEGPPVSVREHTVLKPTE
Collagen alpha-1 (XIV) a.k.a. Undulin
RAT a1(XIV) ...23 VAPPTRLRYNVISHDSIQIS W KAPRGKFG GY KLLVAPASGGKTNQMNLQNGATKAIIQGLLPEQNYTVQLIAYYKDKESKPAQGQFRI
RAT a1(XIV) ..351 IGPPTELITSEVTARSFMVN W THSPGKVE KY RVVYYPTRGGKPEEVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHTASEGLRGTETTL
RAT a1(XIV) ..453 LPMASDLELYDVTENSMRVK
W DAVPGAT GY LILYAPLTEGLAGDEKEMKIGETHTDIELSGLFPNTEYTVTVYAMFGEEASDPATGQETTLP
CHICK a1(XIV) 443 LPMASDLKLYDVSHSSMRAK W NGVAGAT
GY MILYAPLTEGLAADEKEIKIGEASTELELDGLLPNTEYTVTVYAMFGEEASDPLTGQETTLP
RAT a1(XIV) ..545 LTPPRNLRISNVGSNSARLT
W DPTSGKIT GY
RIVYTSADGTEINEVEVDPITTFPLKGLTPLTDYSIAIFSIYEEGQSLPLVGEFTTEE
CHICK a1(XIV) 535 LSPPSNLKFSDVGHNSAKLT W DPASKNVK
GY RIMYVKTDGTETNEVEVGPVSTHTLKSLTALTEYTVAIFSLYDEGQSEPLTGSFTTRK
RAT a1(XIV) ..634 VPAQQYLEIDEVKTDSFRVT
W HPLSAEEG QH KLMWIPVYGGKTQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDDGSESEVVTAVGTTLL
CHICK a1(XIV) 624 VPPPQHLEVDEASTDSFRVS W KPTSSDIA
FY RLAWIPLDGGESEEVVLSGDADSYVIEGLLPNTEYEVSLLAVFDDETESEVVAVLG
RAT a1(XIV) ..725 QTGIRNLVVDDEAATSLRVT W DISDSNVE HF RVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEGVSVSAPGKTLP
CHICK a1(XIV) 742 RTGVRNLVIDDETTSSLRVV W DISDHNAQ
QF RVTYLTAKGDRAEEAIMVPGRQNTLLLQPLLPDTEYKVTITPIYADGEGVSVSAPGKTLP
RAT a1(XIV) ..816 TSGPQNLRVSEEWYNRLRIT
W DPPSAPVK GY
RIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAGYSDALTG
CHICK a1(XIV) 833 LSAPRNLRVSDEWYNRLRIS W DAPPSPTM
GY RIVYKSINVPGPALETFVGDDINTILILNLFSGTEYSVKVFASYSTGFSDALTGVAKTL
CHICK a1(XIV) 923 YLGVTNLDTYQVRMTSLCAQ W QLHRHAT AY RVVIESLVDGKKQEVNLGGGVPRHCFFELMPGTEYKISVHAQLQEIEGPAVSIMETTLP
Collagen alpha 1 type XVIII
c.el a1 (XVIII) 32 .VPNAPQNVRIKTQSTSATLW W DAPPD-------PTVLIR GY
TVEYGEGSISQRILIEGPDSTSFTVTRLSPNTNYVFAVSAYNEAEG
c.el a1 (XVIII)136 PPTSVRARIDEKSAAGSAFVS W
DDPNPESSSENSIDSTQK QY VINYGIYESDTQQKVRSNAKAVRLTGLIPGKEYEVAVKVVAGDGRESPWS
Immunoglobulin superfamily member 9; NCAM-like
protein NRT1. human-NP_065840.1/GI:21357327; mouse-NP_291086.1/GI:18426807
human ..491 .SPHVVTNVSVVALPKGANVS
W EPGFDGGYLQ RF
SVWYTPLAKRPDRMHHDWVSLAVPVGAAHLLVPGLQPHTQYQFSVLAQNKLGSGPFSEIVLSAPEGL
mouse ..507 .SPHVVTNVSVVPLPKGANVS
W EPGFDGGYLQ RF
SVWYTPLAKRPDRAHHDWVSLAVPIGATHLLVPGLQAHAQYQFSVLAQNKLGSGPFSEIVLSIPEGLPT
human ..607 .PLSPPRGLVAVRTPRGVLLH
W DPPELVPKRLD GY
VLEGRQGSQGWEVLDPAVAGTETELLVPGLIKDVLYEFRLVAFAGSFVSDPSNTANVSTS
mouse ..622 PPLSPPRGLVAVRTPRGVLLH W
DPPELIPGRLD GY ILEGRQGSQGWEILDQGVAGTEIQLLVPGLIKDVLYEFRLVAFADSYVSDPSNVANISTSGLEV
Surface glycoprotein CDO, Ig
superfamily member human-NP_058648.1/GI:8393084; rat-NP_059054.1/GI:8393087
human 553 PDAPIILSPPQTHTPDTYNLV W RAGKDGGLPIN
AY FVKYRKLDDGVGMLGSWHTVRVPGSENELHLAELEPSSLYEVLMVARSAAGEGQPAMITFRTS
rat ..574 PDAPNILSPPQTHMPDTYTLV W
RTGRDGGMPIN AY FVKYRKLDDGSGAVGSWHTVRVPGSESELHLTELEPSSLYEVLMVARSAVGEGQPAMLTFRTSKEKMASSKN
human 697 .PEAPDRPTISTASETSVYVT W IPRANGGSPIT AF KVEYKRMRTSNWLVAAEDIPPSKLSVEVRSLEPGSTYKFRVIAINHYGESFRSSASRPYQ
rat ..718 .PEAPDRPTISMASETSVYVT
W IPRANGGSPIT AF
KVEYKRMKSSDWLVAAEDIPPSKLSVEVRSLEPGSIYKFRVIVINHYGESFRSSASRPYQVAGFPNRFSNR
human 802 PITGPHIAYTEAVSDTQIMLK W TYIPSSNNNTPIQ
GF YIYYRPTDSDNDSDYKRDVVEGSKQWHMIGHLQPETSYDIKMQCFNEGGESEFSNVMICETK
rat ..823 PITGPHIAYTEAVSDTQIMLK W
TYIPSSNNNTPIQ GF YIYYRPTDSDNDSDYKRDVVEGSKQWHTIGHLQPETSYDIKMQCFNEGGESEFSNVMICET
brother of CDO human-NP_150279.1/GI:15147240;
mouse-NP_766094.1 GI:27777681
human 473 .EAPIILSSPRTSKTDSYELV W
RPRHEGSGRAPIL YY VVKHRKVTNSSDDWTISGIPANQHRLTLTRLDPGSLYEVEMAAYNCAGEGQTAMVTFRTGRR
mouse 468 .EAPIILSSPRTSKTDSYELV W
RPRHEGSSRTPIL YY VVKHRKVTNSSDDWTISGIPANQHRLTLTRLDPGSLYEVEMAAYNCAGEGQTAMVTF
human 606 .PEAPDRPTISTASETSVYVT W IPRGNGGFPIQ SF RVEYKKLKKVGDWILATSAIPPSRLSVEITGLEKGTSYKFRVRALNMLGESEPSAPSRPYV
mouse 601 .PEAPDRPTISTASETSVYVT W
IPRGNGGFPIQ SF RVEYKKLKKVGDWILATSAIPPSRLSVEITGLEKGISYKFRVRALNMLGESEPSAPSRPYVVSGY
human 713 .VAGPYITFTDAVNETTIMLK W MYIPASNNNTPIH GF
YIYYRPTDSDNDSDYKKDMVEGDKYWHSISHLQPETSYDIKMQCFNEGGESEFSNVMICETK
mouse 708 .VAGPYITFTDAVNETTIMLK W
MYIPASNNNTPIH GF YIYYRPTDSDNDSDYKKDMVEGDRYWHSISHLQPETSYDIKMQCFNEGGESEFSNVM
c.elegans UNCoordinated
locomotion UNC-22 protein kinase family member (unc-22)
NP_502274.1 GI:17542662
1596 PKPPKGPLETKNVTAEGLDLV W GTPDPDEGAPVK AY IIEMQEGRSGN W AKVGETKGTDFKVKDLKEHGEYKFRVKALNECGLSDPLTGESVLAKNPYGV
1696 .PGKPKNMDAIDVDKDHCTLA W
EPPEEDGGAPIT GY IIERREKSEKD W HQVGQTKPDCCELTDKKVVEDKEYLYRVKAVNKAGPGDPCDHGKPIKMKAKKASPEFT
1831 PTPPKGPLDIADVCADGATLS W NPPDDDGGDPLT GY IVEAQDMDNKGKYIEVGKVDPNTTTLKVNGLRNKGNYKFRVKAVNNEGESEPLSADQYTQIKDPWDE
1994 .PGKPGRPEITDFDADRIDIA W
EPPHKDGGAPIE EY IVEVRDPDTKE W KEVKRVPDTNASISGLKEGKEYQFRVRAVNKAGPGQPSEPSEKQLAKPKFI
2188 PSKPNGPLEVSDVFEDNLNLS W KPPDDDGGEPIE YY EVEKLDTATGR W VPCAKVKDTKAHIDGLKKGQTYQFRVKAVNKEGASDALSTDKDTKAKNPYDE
2288 .PGKTGTPDVVDWDADRVSLE W
EPPKSDGGAPIT QY VIEKKGKHGRD W QECGKVSGDQTNAEILGLKEGEEYQFRVKAVNKAG
2482 PSSPLGPLEVSNVYEDRADLE W KVPEDDGGAPID HY EIEKMDLATGR W VPCGRSETTKTTVPNLQPGHEYKFRVRAVNKEGESDPLTTNTAILAKNPYEV
2582 .PGKVDKPELVDWDKDHVDLA W
NAPDD-GGAPIE AF VIEKKDKNGR W EEALVVPGDQKTATVPNLKEGEEYQFRISARNKAGTGDPSDPSDRVVAKP
2773 PTKPKGPIEVTDVFEDRATLD W KPPEDDGGEPIE FY EIEKMNTKDGI W VPCGRSGDTHFTVDSLNKGDHYKFRVKAVNSEGPSDPLETETDILAKNPFDR
2874 .PDRPGRPEPTDWDSDHVDLK W
DPPLSDGGAPIE EY QIEKRTKYGR W EPAITVPGGQTTATVPDLTPNEEYEFRVVAVNKGGPSDPSDASKAVIAKPRNLKPHID
3069 PTSPNGPLDVSDVHGDHVTLN W RAPDDDGGIPIE NY VIEKYDTASGR W VPAAKVAGDKTTAVVDGLIPGHEYKFRVAAVNAEGESDPLETFGTTLAKDPFDK
3171 .PGKTNAPEITDWDKDHVDLE W
KPPANDGGAPIE EY VVEMKDEFSPF W NDVAHVPAGQTNATVGNLKEGSKYEFRIRAKNKAGLGDPSDSASAVAKARNVPPVID
3365 PSSPRGPLDVTNIVKDGCDLA W KEPEDDGGAEIS HY VIEKQDAATGR W TACGESKDTNFHVDDLTQGHEYKFRVKAVNRHGDSDPLEAREAIIAKDPFDR
3465 .ADKPGTPEIVDWDKDHADLK W
TPPADDGGAPIE GY LVEMRTPSGD W VPAVTVGAGELTATVDGLKPGQTYQFRVKALNKAGESTPSDPSRTMVAKP
3660 PGAPEGPLRHKDITKESVVLK W DEPLDDGGSPIT NY VVEKQEDGGR W VPCGETSDTSLKVNKLSEGHEYKFRVKAVNRQGTSAPLTSDHAIVAKNPFDE
3759 .PDAPTDVTPVDWDKDHVDLE W
KPPANDGGAPID AY IVEKKDKFGD W VECARVDGKTTKATADNLTPGETYQFRVKAVNKAGPGKPSDPTGNVVAKP
3953 PGTPEGPLKIDEIHKEGCTLN W KPPTDNGGTDVL HY IVEKMDTSRGT W QEVGTFPDCTAKVNKLVPGKEYAFRVKAVNLQGESKPLEAEEPIIAKNQFDV
4053 .PDPVDKPEVTDWDKDRIDIK W
NPTANNGGAPVT GY IVEKKEKGSAI W TEAGKTPGTTFSADNLKPGVEYEFRVIAVNAAGPSDPSDPT
4245 PSAPEGPLEVSDVTKDSCVLN W KPPKDDGGAEIS NY VVEKRDTKTNT W VPVSAFVTGTSITVPKLTEGHEYEFRVMAENTFGRSDSLNTDEPVLAKDPFGT
4346 .PGKPGRPEIVDTDNDHIDIK W
DPPRDNGGSPVD HY DIERKDAKTGR W IKVNTSPVQGTAFSDTRVQKGHTYEYRVVAVNKAGPGQPSDSSAAATA
4538 .PGAPENITYPAVSRHTCTLN W
DAPKDDGGAEIA GY KIEYQEVGSQI W DKVPGLISGTAYTVRGLEHGQQYRFRIRAENAVGLSDYCQGVPVVIKDPFDP
4637 .PGAPSTPEITGYDTNQVSLA W
NPPRDDGGSPIL GY VVERFEKRGGGD W APVKMPMVKGTECIVPGLHENETYQFRVRAVNAAGHGEPSNGSEPVTCRPYVEK
4739 .PGAPDAPRVGKITKNSAELT W
NRPLRDGGAPID GY IVEKKKLGDND W TRCNDKPVRDTAFEVKNLGEKEEYEFRVIAVNSAGEGEPSKPSDL
4935 PGKPTGPIRATDIQADAMTLS W RPPKDNGGDAIT NY VVEKRTPG-GD W VTVGHPVGTTLRVRNLDANTPYEFRVRAENQYGVGEPLETDDAIVAKNPFDT
5034 .PGAPGQPEAVETSEEAITLQ W
TRPTSDGGAPIQ GY VIEKREVGSTE W TKAAFGNILDTKHRVTGLTPKKTYEFRVAAYNAAGQGEYSVNSVPITADN
5231 .PASPQHIRVEDIAPDCCTLY W
MPPSSDGGSPIT NY IVEKLDLRHSDGK W EKVSSFVRNLNYTVGGLIKDNRYRFRVRAETQYGVSEPCELADVVVAKYQFEV
5333 .PNQPEAPTVRDKDSTWAELE W
DPPR-DGGSKII GY QVQYRDTSSGR W INAKMDLSEQCHARVTGLRQNGEFEFRIIAKNAAGFSKPSPPSERCQLKSRFGP
5433 .PGPPIHVGAKSIGRNHCTIT W
MAPLEDGGSKIT GY NVEIREYGSTL W TVASDYNVREPEFTVDKLREFNDYEFRVVAINAAGKGIPSLPSGPIKIQESGGS
5722 PEAPQGPLHISNIGPSTATLS W RPPVTDGGSKIT SY VVEKRDLSKDE W VTVTSNVKDMNYIVTGLFENHEYEFRVSAQNENGIGAPLVSEHPIIARLPFDP
5823 .PTSPLNLEIVQVGGDYVTLS W
QRPLSDGGGRLR GY IVEKQEEEHDE W FRCNQNPSPPNNYNVPNLIDGRKYRYRVFAVNDAGLSDLAELDQTLFQASG
6114 .PEPPRFPIIENILDEAVILS W
KPPALDGGSLVT NY TIEKREAMGGS W SPCAKSRYTYTTIEGLRAGK
METALLOPROTEINASES
Matrix metalloproteinases (MMP-16) & (MMP-17)
HUMAN MMP16 346 NFNTLAILRREMFVFKDQWF W RVRNNRVMD
GY PMQITYFWRGLPPSI
MOUSE MMP16 346 NFNTLAILRREMFVFKDQWF W RVRNNRVMD
GY PMQITYFWRGLPPSI
RAT ..MMP16 346 NFNTLAILRREMFVFKDQWF W RVRNNRVMD GY PMQITYFWRGLPPSI
HUMAN MMP16 393 DAVYEN-SDGNFVFFKGNKY W
VFKDTTLQP GY PHDLITLGSGIPPHGI
MOUSE MMP16 393 DAVYEN-SDGNFVFFKGNKY W VFKDTTLQP
GY PHDLITLGNGIPPHGI
RAT ..MMP16 393 DAVYEN-SDGNFVFFKGNKY W VFKDTTLQP GY PHDLITLGNGIPPHGI
HUMAN MMP17 389 DAVYERTSDHKIVFFKGDRY W VFKDNNVEE
GY PRPVSDFSLPPGGI
MOUSE MMP17 390 DAVYERTSDHKIVFFKGDRY W VFKDNNVEE
GY PRPVSDFSLPPGGI
HUMAN MMP16 440 DSAIWWEDVGKTYFFKGDRY W
RYSEEMKTMDP GY PKPITVWKGIPESP
MOUSE MMP16 440 DSAIWWEDVGKTYFFKGDRY W RYSEEMKTMDP
GY PKPITIWKGIPESP
RAT ..MMP16 440 DSAIWWEDVGKTYFFKGDRY W RYSEEMKTMDP GY PKPITIWKGIPESP
HUMAN MMP17 435 DAAFSWAHNDRTYFFKDQLY W RYDDHTRHMDP
GY PAQSPLWRGVPST
MOUSE MMP17 436 DAVFSWAHNDRTYFFKDQLY W RYDDHTRRMDP
GY PAQGPLWRGVPSM
HUMAN MMP16 488 QGAFVHKENGFTYFYKGKEY W
KFNNQILKVEP GY PRSILKDFMGCDGPTDRVK
MOUSE MMP16 488 QGAFVHKENGFTYFYKGKEY W KFNNQILKVEP
GY PRSILKDFMGCDGPT
RAT ..MMP16 488 QGAFVHKENGFTYFYKGKEY W KFNNQILKVEP GY PRSILKDFMGCDGPT
HUMAN MMP17 482 LDDAMRWSDGASYFFRGQEY W KVLDGELEVAP
GY PQSTARDWLVCGDSQ ADGSVAAGVD
MOUSE MMP17 483 LDDAMRWSDGASYFFRGQEY W KVLDGELEAAP
GY PQSTARDWLVCGEP
Matrix metalloproteinases (MMP-3) & (MMP-10)
HUMAN MMP-3 295 SFDAVSTLRGEILIFKDRHF W RKSLRKLEP
EL HLISSFWPSLPSGV
MOUSE MMP-3 295 FFDAVSTLRGEVLFFKDRHF W RKSLRTPEP
EF YLISSFWPSLPSNM
RAT ..MMP-3 293 SFDAVSTLRGEVLFFKDRHF W RKSLRTPEP GF YLISSFWPSLPSNM
HORSE MMP-3 295 SFDAISTLRGEILFFKDRYF W RKTFRTLVP
EF HPISSFWPSLPSGI
HUMAN MMP10 294 SFDAISTLRGEYLFFKDRYF W RRSHWNPEP
EF HLISAFWPSLPSYL
MOUSE MMP10 294 SFDSVSTLRGEVLFFKDRYF W RRSHWNPEP
EF HLISAFWPTLPSDL
RAT ..MMP10 294 SFDAVTMLRGEFLFFKDRHF W RRTQWNPEP EF HLISAFWPSLPSGL
HUMAN MMP-3 341 DAAYEVTSKDLVFIFKGNQF W
AIRGNEVRA GY PRGIHTLGFPPTVRKI
MOUSE MMP-3 341 DAAYEVTNRDTVFIFKGNQF W AIRGHEELA
GY PKSIHTLGLPATVKKI
RAT ..MMP-3 339 DAAYEVTNRDTVFILKGNQI W AIRGHEELA GY PKSIHTLGLPETVQKI
HORSE MMP-3 341 DAAYEVTSRDSVFIFKGNKF W AIRGNEEQA
GY PRGIHTLGFPPTVRKI
HUMAN MMP10 340 DAAYEVNSRDTVFIFKGNEF W AIRGNEVQA
GY PRGIHTLGFPPTIRKI
MOUSE MMP10 340 DAAYEAHNTDSVLIFKGSQF W AVRGNEVQA
GY PKGIHTLGFPPTVKKI
RAT ..MMP10 340 DAAYEANNKDRVLIFKGSQF W AVRGNEVQA GY PKRIHTLGFPPTVKKI
HUMAN MMP-3 389 DAAISDKEKNKTYFFVEDKY W
RFDEKRNSMEP GF PKQIAEDFPGIDSKIDAVFEEFGFFYFFTGSSQLEFDPNAKKVTHTLKSNSWLNC
MOUSE MMP-3 389 DAAISNKEKRKTYFFVEDKY W RFDEKKQSMEP
GF PRKIAEDFPGVDSRVDAVFEAFGFLYFFSGSSQLEFDPNAKKVTHILKSNSWFNC
RAT ..MMP-3 387 DAAISLKDQKKTYFFVEDKF W RFDEKKQSMDP EF PRKIAENFPGIGTKVDAVFEAFGFLYFFSGSSQLEFDPNAGKVTHILKSNSWFNC
HORSE MMP-3 389 DAAIFDKEKQKTYFFVEDKY W RFDEKRQSMEP
GY PKQIAEDFPGIDSKLDAAFESFGFFYFFSGSSQFEFDPNAKKVTHVLKSNSWFNC
HUMAN MMP10 388 DAAVSDKEKKKTYFFAADKY W RFDENSQSMEQ
GF PRLIADDFPGVEPKVDAVLQAFGFFYFFSGSSQFEFDPNARMVTHILKSNSWLHC
MOUSE MMP10 388 DAAVFEKEKKKTYFFVGDKY W RFDETRHVMDK
GF PRQITDDFPGIEPQVDAVLHEFGFFYFFRGSSQFEFDPNARTVTHILKSNSWLLC
RAT ..MMP10 388 DAAVFEKEKKKTYFFVGDKY W RFDETRQLMDK GF PRLITDDFPGIEPQVDAVLHAFGFFYFFCGSSQFEFDPNARTVTHTLKSNSWLLC
DROSOPHILA MELANOGASTER Sevenless protein.
P13368 GI:14424434
.313 QPQLERAPRADGQSTPLTIR W
AMHFPEHYLASRPFNIQYQFVDHHGEELDLEQEDQDASGETGSSAWFNLADYDCDEYYVCEILEALIPYTQYRFRFELPFGENRDEVLY
.438 ISAPVIEHLMGLDDSHLAVH W
HPGRFTNGPIEGYRLRLSSSEGNATSEQLVPAGRGSYIFSQLQAGTNYTLALSMINKQGEGPVAKGFVQT
.824 AGGKPHSLKALLGAQAAKIS W
KEPERNPYQSADAARSWSYELEVLDVASQSAFSIRNIRGPIFGLQRLQPDNLYQLRVRAINVDGEPGEWTEPLAART
1300 VFVERLATALQEANVSAVLR W DAPEQGQEAPMQALEYHISCWVGSELHEELRLNQSALEARVEHLQPDQTYHFQVEARVAATGAAAGAASHAL
1680 LLLPSSGGSLLKATDCEEQR C LLNLPMITASEDCPLPIPGVRYQLNLTLARGPGSEEHDHGVEPLGQWLLGAGESLNLTDLLPFTRYRVSGILSSFYQKKLALPTLVLAPL
1799 PSPPRNFSVRVLSPRELEVS W LPPEQLRSESVYYTLHWQQELDGENVQDRREWEAHERRLETAGTHRLTGIKPGSGYSLWVQAHATPTKSNSSERLHVRS
1899 FAELPELQLLELGPYSLSLT W AGTPDPLGSLQLECRSSAEQLRRNVAGNHTKMVVEPLQPRTRYQCRLLLGYAATPGAPLYHGTAEVYETLGDAPSQPGKPQ
Rat Osteotesticular phosphatase;
PTPase, receptor type, V NP_149090.1 GI:14861868
mouse ES cell phosphatase-P70289 GI:3183134
RAT ..36 ..GPLLSVNVSSHGKSTSLFLS
W VAAELGGF DY
ALSLRSVNSSGSPEGQQLQAHTNESGFEFHGLVPGSRYQLKLTVLRPCWQNVTITLTARTA
MOUSE 36 ..GPPLSVSVTSRGRPTSLFLS W
VAAEPGGF DY ALCLRAMNLSGFPEGQQLQAHTNESSFEFHGLVPGSRYQLELTVLRPCWQNVTITLTARTA
RAT ..128 PTVVRGLQLHSAGSPARLEAS W SDAPGDQD SY QLLLYHLESQTLACNVSVSPDTLSYSFGDLLPGTQYVLEVITWAGSLHAKTSILQWTE
MOUSE 128 PTVVRGLQLHSTGSPASLEAS W SDASGDQD
SY QLLLYHPESHTLACNVSVSPDTLSYNFGDLLPGSQYVLEVITWAGSLHAKTSILQWTE
RAT ..218 .PVPPDHLALRALGTSSLQAF
W NSSEGAT SF HLMLTDLLGGTNTTAVIRQGVSTHTFLHLSPGTPHELKICASAGPHQIWGPSATEW
MOUSE 218 .PVPPDHLRVRALGTSSLQAF W
NSSEGAT WF HLILTDLLEGTNLTKVVRQGISTHTFLRLSPGTPYQLKICAAAGPHQIWGPNATEW
RAT ..304 TYPSYPSDLVLTPLRNELWAS W KAGLGARD GY VLKLSGPMESTSTLGPEECNAVFPGPLPPGHYTLQLKVLAGPYDAWVE
MOUSE 301 TYPSYPSDLVLTPLWNELWAS W KAGQGARD
GY VLKLSGPVENTTTLGPEECNAVFPGPLPPGHYTLGLRVLAGPYDAWVE
RAT ..384 .GSTWLAESAALPREVPGARL
W LDGLEASKQPGRRALLYSDDAPGSLGNISVPSGATHVIFCGLVPGAHYRVDIASSTGDISQSISGYTS
MOUSE 384 .GSIWLAESAARPMEVPGARL W
LEGLEATKQPGRRALLYSVDAPGLLGNISVSSGATHVTFCGLVPGAHYRVDIASSMGDITQSLTGYTS
RAT ..473 PLPPQSLEVISRSSPSDLTIA W GPAPGQLE GY KVTWHQDGSQRSPGDLVDLGPDTLSLTLKSLVPGSCYTVSAWAWAG
NLDSDSQKIH SCTR
MOUSE 473 PLPPQSLEIISRNSPSDLTIG W APAPGQME
GY KVTWHQDGSQRSPGDLVDLGPDISSLTLKSLVPGSCYTVSAWAWSGNLSSDSQKIH
SCTR
RAT ..565 PAPPTNLSLGFAHQPAALKAS W YHPPGGRD AF HLRLYRLRPLTLES
EKVLPREAQN FSWAQLTAGC EFQVQLSTLW GSERSSSANA TGWT
MOUSE 565 PAPPTNLSLGFAHQPATLRAS W CHPPGGRD
AF QLRLYRLRPLTLESEKILSQEAQNFSWAQLPAGYEFQVQLSTLWGSEESGSANTTGWT
RAT ..655 PPSAPTLVNVTSDAPTQLQVS W AHVPGGRS RY QVTLYQESTRTATSIMGPKEDGTSFLGLTPGTKYKVEVISWAGPLYTAAANVSAWTY
MOUSE 655 PPSAPTLVNVTSEAPTQLHVS W VHAAGDRS
SY QVTLYQESTRTATSIVGPKADSTSFWGLTPGTKYKVEAISWAGPLYTAAANVSAWTY
RAT ..744 PLIPNELLVSMQAGSAVVNLA W PSGPLGQGACHAQLSDAGHLSWEQPLKLGQELFMLRDLTPGHTISMSVRCRAGPLQASTHLVVLS
MOUSE 744 PLTPNELLASMQAGSAVVNLA W PSGPLGRGTCHAQLSDAGHLSWEQPLSLGQDLLMLRNLIPGHTVSLSVKCRAGPLQASTHPLVLS
RAT ..831 VEPGPVEDVLCHPEATYLALN W TMPAGDVDVCLVVVERLVPGGGTHFVFQVNTSGDALLLPNLMPTTSYRLSLTVLGRNSRWSRAVSLVCSTSAEAWHPP
MOUSE 831 VEPGPVEDVFCQPEATYLSLN W TMPTGDVAVCLVEVEQLVPGGSAHFVFQVNTSEDALLLPNLTPTTSYRLSLTVLGGNRQWSRAVTLVCTTSAEVWHPP
Granulocyte colony stimulating factor receptor precursor
human -Q99062/ GI:729564; mouse -P40223/ GI:729565
human 123 ..PAIPHNLSCLMNLTTSSLICQ W
EPGPETHLPT SF TLKSFKSRGNCQTQGDSILDCVPKDGQSHCCIPRKHLLLYQNMGIWVQAENALGTSMSPQLCLDPMDVVK
mouse 124 ..PASPSNLSCLMHLTTNSLVCQ W
EPGPETHLPT SF ILKSFRSRADCQYQGDTIPDCVAKKRQNNCSIPRKNLLLYQYMAIWVQAENMLGSSESPKLCLDPMDVVK
human 230 PMLRTMDPSPEAAPPQAGCLQLC W
EPWQPGLHINQKCELRHKPQRGEASWALVGPLPLEALQYELCGLLPATAYTLQIRCIRWPLPGHWSDWSPSLELRTTERA
mouse 231 PMLQALDIGPDVVSHQPGCLWLS W KPWKPSEYMEQECELRYQPQLKGANWTLVFHLPSSKDQFELCGLHQAPVYTLQMRCIRSSLPGFWSPWSPGLQLRPTMKA
human 334 PTVRLDTWWRQRQLDPRTV--QLF W
KPVPLEEDSGRIQ GY VVSWRPSGQAGAILPLCNTTELSCTFHLPSEAQEVALVAYNSAGTSRPTPVVFSESRG
mouse 335 PTIRLDTWCQKKQLDPGTVSVQLF W KPTPLQEDSGQIQ
GY LLSWNSPDHQGQDIHLCNTTQLSCIFLLPSEAQNVTLVAYNKAGTSSPTTVVFLENEG
human 430 PALTRLHAMARDPHSLWVG W EPPNPWPQ
GY VIEWGLGPPSASNSNKTWRMEQNGRATGFLLKENIRPFQLYEIIVTPLYQDTMGPSQHVYAYSQEMA
mouse 433 PAVTGLHAMAQDLNTIWVD W EAPSLLPQ GY LIEWEMSSPSYNNSYKSWMIEPNGNITGILLKDNINPFQLYRITVAPLYPGIVGPPVNVYTFAGERAP
human 527 PSHAPELHLKHIGKTWAQLE W VPEPPELGKSPLT
HY TIFWTNAQNQSFSAILNASSRGFVLHGLEPASLYHIHLMAASQAGATNSTVLTLMTLTPEGSELHIIL
mouse 531 P-HAPALHLKHVGTTWAQLE W VPEAPRLGMIPLT
HY TIFWADAGDHSFSVTLNISLHDFVLKHLEPASLYHVYLMATSRAGSTNSTGLTLRTLDPSDLNIF
C.elegans Hypothetical protein
C34F6.10 CAB03937.1 GI:3874793
752 .PSAPEDVAVSQVQIDRCTVS W
TAAEGNGSPII GY VVRMMLNGELFSEHFVTESENETNYRHVLKYLDPNTEYLINIAAKNSVGFSEKIE
C. elegans Hypothetical protein
C27B7.7 CAA90982.1 GI:3874523
648 PDSPPDNLKVLINEANQVIVY W NTPNSTTEVT GY LIYYTRDLSLSNDDYKNWQFVEMNNNSTRYKFDLSVGLKPKTFYRVRISGKNSHADGPASEVVEFETAY
Rat neurofascin NP_446361.1
GI:19924211 (1 of 4)
931 ....PRRFRVRQPNLETINLE W
DHPEHPNGILI GY TLRYVPFNGTKLGKQMVENFSPNQTKFSVQRADPVSRYRFSLSARTQVGSGEAATEESP
rat Leukocyte common antigen variant 4 precursor
(L-CA) (CD45) (T200). P04157 GI:116008
343 .PEMLPHVQCKNSTNSTTLVS W
AEPASKHH GY ILCYKKTPSEKCENLANDVNSFEVKNLRPYTEYTVSLFAYVIGRVQRNGPAKDCNFRTK
Tyrosine-protein kinase receptor UFO (AXL oncogene)
Human-P30530/GI:267193 mouse- Q00993/gi:267194
human 426 PLGPPENISATRNGSQAFVH W QEPRAPLQGTLL
GY RLAYQGQDTPEVLMDIGLRQEVTLELQGDGSVSNLTVCVAAYTAAGDGPWSLPVPLEAWR
mouse 327 PLGPPENVSAMRNGSQVLVR W QEPRVPLQGTLL
GY RLAYRGQDTPEVLMDIGLTREVTLELRGDRPVANLTVSVTAYTSAGDGPWSLPVPL
Bifidobacterium protein
.....1906 NSGQPLDVTDLAVAADGNYA
W SVKVTGGP GY
SGLDNTADGGSVAGTRPSDSSEGTKSLKSGTDVTTLAKNPWILGFGNTPISSTTIHKQ