EGF-LIKE MOTIF SEQUENCES
GROWTH FACTORS
human......EGF.1
.NSDSE C PLSHDGY C LHDGV
C MYIEAL----DKYA C N C VV GY
IGER C QYRDLKWWELR
mouse......EGF.1
.NSYPG C PSSYDGY C LNGGV
C MHIESL----DSYT C N C VI GY
SGDR C QTRDLRWWELR
rat........EGF.1
.NSNTG C PPSYDGY C LNGGV
C MYVESV----DRYV C N C VI GY
IGER C QHRDLR
guinea pig.EGF.1
.QDAPG C PPSHDGY C LHGGV
C MHIESL----NTYA C N C VI GY
VGER C EHQDLDLWE
horse......EGF.1
.NSYQE C SQSYDGY C LHGGK
C VYLVQV----DTHA C N C VV GY
VGER C QHQDLR
pig........EGF.1
.NSYSE C PPSHDGY C LHGGV
C MYIEAV----DSYA C N C VF GY
VGER C QHRDLKWWELR
human...TGFa.1
.VVSFHND C PDSHTQF C FHG-T
C RFL--VQE--DKPA C V C HS GY
VGAR C EHADLLA
rat.....TGFa.1
.VVSFHNK C PDSHTQY C FHG-T
C RFL--VQE--EKPA C V C HS GY
VGVR C EHADLLA
macaque.TGFa.7 .VVSHFND
C PDSHTQF C FHG-T C RFL-AV-E--DRPA C V C HS GY VGAR C EHADLLA
sheep...TGFa.1
.VVSHFND C PDSHTQF C FHG-T
C RFL--LQE--EKPA C V C HS GY
VGAR C EHADLLA
pig.....TGFa.1
.VVSHFND C PDSHSQF C FHG-T
C RFL--VQE--DKPA C V C HS GY
VGAR C EHADLLA
rabbit..TGFa.1
.VVSHFNQ C PDSHTQF C FHG-T
C RFL--VQE--DKPA C V C HS GY
VGAR C EHADLLA
human...AR.41
....KKKNP C NAEFQNF C IHG-E
C KYIEHL----EAVT C K C QQ EY
FGER C GEKSMKTHSMID...
sheep...AR.34
....KRKNL C DTEFQNF C IHG-K
C TFLEQL----ETVS C Q C YP EY
FGER C GEK...
human HBEGF.30 ...KKRDP C LRKYKDF C IHG-E
C KYVKEL----RAPS C I C HP GY
HGER C HGLSLPVENRLY...
mouse HBEGF101 ...KKRDP C LRKYKDY C IHG-E C RYLQEF----RTPS C K C LP GY HGHR C HGLTLPV
monkeyHBEGF104. ...KRDP C LRKYKDF C IHG-E C KYVKEL----RAPS C I C HP GY HGER C HGLSLP...
pig ..HBEGF.1 KGLGKKRDP C LRKYKDF C IHG-E C KYVKEL----RAPS
C I C HP GY HGER
C HGLSLPVKNRL...
rat ..HBEGF103 ...KKRDP
C LKKYKDY C IHG-E C RYLKEL----RIPS C H C LP GY HGQR C HGLTLPVENP...
chick HBEGF107....KKRDP
C LRKYKDF C IHG-E C KYIREL----GAPS C I C QP GY HGER C HGLLLPVEHP...
human NTAK 340....GHARK
C NETAKSY C VNGGV C YYIEGI----NQLS C K C PN GF FGQR C LEKLPLRLY...
mouse NTAK 356 ...GHARK C NETAKSY C
VNGGV C YYIEGI----NQLS C K C
PN GF FGQR C LEKLPLRLY...
human .BTC 62
..RKGHFSR C PKQYKHY C IKG-R C RFV--VAE--QTPS
C V C DE GY IGAR
C ERVDLFYLRGDR...
mouse .BTC 62...VKTHFSR C PKQYKHY C IHG-R C RFV--VDE--QTPS
C I C EK GY FGAR
C ERVDLFYLQQDRG...
rat SDGF .116...RKKKKNP C AAKFQNF C IHG-E C RYIENL----EVVT
C H C HQ DY FGER
C GEKTMKTQKK
VacciniaVGF38
..DIPAIRL C GPEGDGY C LHG-D C IHARDI----DGMY
C R C SH GY TGIR
C QHVVLVDYQ
Myxoma .MGF30
..IIKRIKL C NDDYKNY C LNNGT C FTV-ALNNVSLNPF
C A C HI NY VGSR
C QFINLITIK
FibromaSFGF30
..IVLHVKV C NHDYENY C LNNGT C FTI-ALDNVSITPF
C V C RI NY EGSR
C QFINLVTY
rat .NDF ..177
...SHLIK C AEKEKTF C VNGGE
C FTVKDLSNPS-RYL C K C QP GF
TGAR C TENVPMKVQTQE
humanHRGa .177 ...SHLVK
C AEKEKTF C VNGGE C FMVKDLSNPS-RYL C K C QP GF TGAR C TENVPMKVQNQE
humanHRGb1 177 ...SHLVK C AEKEKTF C
VNGGE C FMVKDLSNPS-RYL C K C
PN EF TGDR C QNYVMASFYKHL
humanHRGb2 177 ...SHLVK C AEKEKTF C
VNGGE C FMVKDLSNPS-RYL C K C
PN EF TGDR C QNYVMASFYKAE
humanHRGb3 177 ...SHLVK C AEKEKTF C
VNGGE C FMVKDLSNPS-RYL C K C
PN EF TGDR C QNYVMASFYSTST
pro-ARIA ..136
...SHLTK C DIKQKAF C VNGGE
C YMVKDLPNPP-RYL C R C PN EF
TGDR C QNYVMASFYKHLGI
...GGFI..
177 ....SHLVK C AEKEKTF C VNGGE
C FMVKDLSNPS-RYL C K C PN EF
TGDR C QNYVMASFYSTSTP
human.Cripto74
.........SKELNRT C C LNGGT
C ML---------ESF C A C PP SF
YGRN C EHD...
mouse.Cripto55
.....GIQNSKSLNKT C C LNGGT
C IL---------GSF C A C PP SF
YGRN C EHD...
Return-to-Table-of-Contents
Human.EGFPPr1..314
......EQKL C KLRKGN
C SSTVCGQDLQSHLCM C AE GY
ALSRDRKY C ED
Mouse.EGFPPr1..327
...............RGRP C RFGLCERDPKSHSSA C
AE GY TLSRDRKY C E
Rat...EGFPPr1..312
...QASDSER C KQRRGQ C LYSLSERDPNSDSSA C AE GY TLSRDRKY C
E
Human EGFPPr2..357.....VNE
C AFWNHG C TLG C KNTPGSYY C T C PV GF
VLLPDGKR C HQ
Mouse.EGFPPr2..362....DVNE C ATQNHG C TLG C ENTPGSYH C T C
PT GF VLLPDGKQ C HELVS
Rat...EGFPPr2..357
...DVNE C ALQNHG C TLG C ENIPGSYY C T C PT GF VLLPDGKR C
HE
Human EGFPPr3..398 ....lvs
c prnvse c shd c vltsegpl c f c pe gs vlerdgkt c s
Mouse EGFPPr3..404 ....lvs
c pgnvsk c shg c vltsdgpr c I c pa gs vlgrdgkt c t
Rat ..EGFPPr3..399
....lva c pgnrse c shd c iltsdgpl c i c pa
gs vlgkdgkt c t
Human EGFPPr4..438.....G
C SSPDNGG C SQL
C VPLSPVSWE C D C FP GY
DLQLDEKS C AASG...
Mouse EGFPPr4..443 ...TG
C SSPDNGG C SQI
C LPLRPGSWE C D C FP GY
DLQSDRKS C AASGPQPL...
Rat ..EGFPPr4..439.....G C SFSDNGG C
SQI C LPLSLASWE C D C
FP GY DLQLDRKS C AAS...
Human EGFPPr5..741 ...GADP C LYQNGG C EHI C KKRLGTAW
C S C RE GF MKASDGKT
C LALDG
Mouse EGFPPr5..747 ...GADP C LYRNGG
C EHI C QESLGTAR C L C
RE GF VKAWDGKM C LPQDYPIL
Rat ..EGFPPr5..743
...GADP C LHRNGG C EHI
C QESLGTAQ C L C RE GF
VKAPDGKM C LTRKD
Human EGFPPr6..831....DQDD
C APVG C SMYAR C ISEGEDAT C Q C LK GF
AGDGKL C SD
Mouse EGFPPr6..838....YEDD
C GPGG C GSHAR C VSDGETAE C Q C
LK GF ARDGNL C S
Rat ..EGFPPr6..835....YEDD C GPGG C GSHAH
C ISEGEAAV C Q C LK GF
AGDGNL C SD
Human EGFPPr7..871..IDE
C EMGVPV C PPASSK C INTEGGYV C R C SE GY QGDGIH C LD
Mouse EGFPPr7..877 DIDE C VLARSD C PSTSSR C
INTEGGYV C R C SE GY
EGDGIS C FD
Rat ..EGFPPr7..875..IDE C ELGSSD C PPTSSR C INTEGGYV C Q C
SE GY EGDGIY C LD
Human EGFPPr8..913...ide
c qlgvhs c genas c tnteggyt c m c agrlsepgli c pdstppphl...
Mouse EGFPPr8..920...ide
c qrgahn c aenaa c tnteggyn c t c agrpsspgrs c pds...
Rat ..EGFPPr8..917...vde c qqgshg c senat c tnteggyn c t c agcpsapglp
c pdstspsl...
Return-to-Table-of-Contents
Blood Coagulation Factor
VII
rabbit VII ..85 .........DGDQ
C ASNP C QNGGS C EDQIQSYI C F C
LA DF EGRN C EKNK
bovine VII ..46 ........VDGDQ
C ESNP C LNGGM C KDDINSYE C W C
QA GF EGTN C E
mouse..VII ..85
........SDGDQ C ASNP C QNGGT
C QDHLKSYV C F C LL DF
EGRN C EKSK
bovine VIIa..45 ........NDGDQ
C ASSP C QNGGS C EDQLRSYI C F C
PD GF EGRN C ETDK
human..VIIa 105 ........SDGDQ
C ASSP C QNGGS C KDQLQSYI C F C
LP AF EGRN C ETHK
human..VIIb..83
........SDGDQ C ASSP C QNGGS
C KDQLQSYI C F C LP AF
EGRN C E
rabbit VII..125 .......NDQLI
C MYENGG C EQY
C SDHVGSQRS C R C HE GY
TLLPNGVS C TPTV...
bovine VII...84.........LDAT
C SIKNGR C KQF
C KRDTDNKVV C S C TD GY
RLAEDQKS C EPAV...
mouse..VII..127
.......NEQLI C ANENGD
C DQY C RDHVGTKRT C S C
HE DY TLQPDEVS C KPKVEY
bovine VIIa..86 .......QSQLI
C ANDNGG C EQY
C GADPGAGRF C W C HE GY
ALQADGVS C APTVEY
human..VIIa 146 .......DDQLI
C VNENGG C EQY
C SDHTGTKRS C R C HE GY
SLLADGVS C TPTVEY...
human..VIIb 124 .......DDQLI
C VNENGG C EQY
C SDHTGTKRS C R C HE GY
SLLADGVS C TPTVEY
Return-to-Table-of-Contents
Milk-fat Globule Protein
(Factor VIII)
bovine VIII 17 ....FAASGDF C DSSL
C LHGGT C LLNEDRTPPFY C L C
PE GF TGLL C NETE
mouse..VIII 31 ....FAASGDF
C DSSL C LNGGT C LTGQD--NDIY C L C
PE GF TGLV C NETE
rat....VIII 25 ........GDF
C DSSL C LNGGT C LMGQD--NDIY C L C
PE GF TGLV C NETE
pig....VIII 01 ......FSGDF
C DSSL C LHGGT C LLDQDPQKPFH C L C
PE GF TGLI C NETE
human..VIII..23..ALDI C SKNP C HNGGL
C EEISQEVRGDVFPSYT C T C LK GY
AGNH C ETK...
bovine VIII..63...HGP
C FPNP C HNDAE C QVTDDSHRGDVFIQYI C K C PL GY VGIH C ETT...
mouse..VIII..65...RGP C SPNP C YNDAK
C LVTLDTQRGDIFTEYI C Q C PV GY
SGIH C ETE...
rat....VIII..65...KGP C SPNP C FHDAK
C LVTEDTQRGDIFTEYI C Q C PV GY
SGIH C ELG...
pig....VIII..45...KGP S SPNP C HNDAE
C EVTLDTERGDIFTEYI C K C PH GY
TGIH C EII...
Return-to-Table-of-Contents
Blood Coagulation Factor
IX
human. IX 93 ............DGDQ
C ESNP C LNGGS C KDDINSYE C W C
PF GF EGKN C E
dog... IX 86 ............DGDQ C ESNP C LNDGV C KDDINSYE C W C
RA GF EGKN C E
mouse. IX 81 ............DGDQ C ESNP C LNGGI C KDDISSYE C W C
QV GF EGRN C E
bovine IX 46 ...........VDGDQ C ESNP C LNGGM C KDDINSYE C W C
QA GF EGTN C E
human. IX 130 ..........LDVT C NIKNGR C EQF C KNSADNKVV
C S C TE GY RLAENQKS
C EPAVPF...
dog... IX 123 ..........LDVT C NIKNGR
C KQF C KLGPDNKVV C S C
TT GY QLAEDQRS C EPAVPFP...
mouse. IX 118 ..........LDAT C NIKNGR
C KQF C KNSPDNKVI C S C
TE GY QLAEDQKS C EPTVPFP...
bovine IX .84 ..........LDAT
C SIKNGR C KQF
C KRDTDNKVV C S C TD GY
RLAEDQKS C EPAVPF
Return-to-Table-of-Contents
Blood Coagulation Factor X
human...... X.
86........DGDQ C ETSP C QNQGK
C KDGLGEYT C T C LE GF
EGKN C EL
bovine .....X.
86........DGDQ C EGHP C LNQGH
C KDGIGDYT C T C AE GF
EGKN C EFS
rabbit .....X.
85 ......VDGDQ C ESNP C QNQGT
C KDGLGMYT C S C VE GY
EGQD C EP
chicken ....X.
86........DGDQ C SSNP C HYGGQ C KDGLGSYT C
S C LD GY QGKN
C EFV
Tropidechis X. 46 .......DGDQ
C SSNP C HYRGT C KDGIGSYT C T C LP NY
EGKN C EKV
human...... X.124....FTRKL C SLDNGD
C DQF C HEEQN---SVV C S C
AR GY TLADNGKA C IPTGPY...
bovine......X 125.....TREI
C SLDNGG C DQF
C REERS---EVR C S C AH GY
VLGDDSKS C VSTERFP...
rabbit.. ...X
124 ...VTRKL C SLDNGG
C DQF C KEEEN---SVL C S C AS GY
TLGDNGKS C IST
chicken ....X 125.....IPKY
C KINNGD C EQF
C SIKKSVQKDVV C S C TS GY
ELAEDGKQ C VSKVKYP...
Tropidechis X. 85 ....LYQS
C RVDNGN C WHF C KRVQS---ETQ C S C
AE SY RLGVDGHS C VAEGDFS...
Return-to-Table-of-Contents
Blood Coagulation Factor
XII
human..... XII .94......vkdh c skhsp c qkggt c vnmpsgph c l c pq hl
tgnh c qk...
guinea pig XII .93......vkdh
c skhnp c qrggi c vntlssph c l c pd hl tgkh c qr...
bovine.... XII .81...PKKVKDH
C SKHNP C QKGGT C VNMPDGPR C I C AD HF TGKH
C...
human..... XII 132 .....kek
c fepqllrffhkneiwyrteqaavar c q c kg pd ah c qrlas
guinea pig XII 132 ......ek c fepqlhrffheneiwfrtgpagvak
c h c kg pd ah c kqmhsqe c qtn
bovine.... XII 121 ....qkek
c fepqffrffheneiwhrlepagvvk c q c kg pn aq c kp
human..... XII 173 .....LASQA
C RTNP C LHGGR C LEVEGHRL C H C
PV GY TGPF C DVDTK...
guinea pig XII 174 .....MHSQE C QTNP C LNGGR C LEVEGHHL C D C
PM GY TGPF C DL...
bovine.... XII 163 .....LASQV
C RTNP C LNGDS C LQAEGHRL C R C
AP SF AGRL C DVDLK...
Return-to-Table-of-Contents
Urokinase-Type Plasminogen
Activator
Human. uPA 27 ...........VPSN
C D C LNGGT C VSNKYFSNIHW C N C
PK KF GGQH C EIDKSKT...
Baboon uPA 26 ...........VPSD C G C LNGGT C MSNKYFSSIHW C N C
PK KF GGQH C EIDKSKT...
Bovine uPA 29 ...........GESN C G C LNGGK C VTYKYFSNIQR C S C
PK KF QGEH C EIDTSK...
Mouse. uPA 28 ...........DESN C G C QNGGV C VSYKYFSRIRR C S C
PR KF QGEH C EIDASKT...
Rat... uPA 27 ...........DESN C G C QNGGV C VSYKYFSSIRR C S C
PK KF KGEH C EIDTSKT...
Pig... uPA 29 ...........GASN C G C LNGGK C VSYKYFSNIQR C S C
PK KF QGEH C EIDTSQT...
Chick. uPA 36 ...........QHRE C Q C LNGGT C ITYRFFSQIKR C L C
PE GY GGLH C EIDTNSI...
Tissue Plasminogen Activator
Human. tPA 82 .........PVKS
C SEPR C FNGGT C QQALYFSDFV C Q C
PE GF AGKC C EIDTRAT...
Rat... tPA 79 .........PVRS C SEPR C FNGGT C QQALYFSDFV C Q C
PD GF VGKR C DIDTRAT...
Mouse. tPA 79 .........PVRS C SEPR C FNGGT C QQALYFSDFV C Q C
PD GF VGKR C DIDTRAT...
Bovine tPA 83 .........PVRS C SEPW C FNGGT C RQALYSSDFV C Q C
PE GF MGKL C EIDATAT...
Limulus (leech)
Clot Factor C
Lim /.CF-C 101/....
...KYGTW C SGE C Q C KNGGI C DQRTGA C T C RD RY EGAH C EILK...
Vampire Bat Salivary Plasminogen Activator
Alpha-1 ...83 .........PVNS
C SEPR C FNGGT C WQAVYFSDFV C Q C
PA GY TGKR C EVD...
Alpha-2 ...83 .........PVKS C SELR C FNGGT C WQAASFSDFV C Q C
PK GY TGKQ C EVD...
Beta ......37 .........AYGG C SELR C FNGGT C WQAASFSDFV C Q C
PK GY TGKQ C EVD...
Return-to-Table-of-Contents
Mammalian Protein C
Human..PrC...
93 .....LVLPLEHP C ASLC C GHGT
C IDGIGSFS C D C RS GW
EGRF C QREV
Mouse..PrC...
96..........LDHQ C DSPC C GHGT
C IDGIGSFS C S C DK GW
EGKF C QQEL
Rat... PrC...
92 .....STPPLDHQ C DSPC C GHGT
C IDGLGGFS C S C DK GW
EGRF C QQEM
Rabbit PrC....90.........PSEHP
C SSQC C GHGT C ADSIGGFS C Q C
HG GW EGSF C QYEV
Bovine PrC...104..........SGSP
C DLPC C GRGK C IDGLGGFR C D C
AE GW EGRF C LHEV
Human..PrC...136.........SFLN C SLDNGG
C THY C LEEVGWRR C S C
AP GY KLGDDLLQ C HPAVK...
Mouse..PrC...135.........RFQD C RVNNGG
C LHY C LEESNGRR C A C
AP GY ELADDHMR C KSTVNF...
Rat... PrC...135.........GFQD C RVKNGG
C YHY C LEETRGRR C R C
AP GY ELADDHMH C RPTVNF...
Rabbit PrC...130.........RFSN
C SVDNGG C AHY
C LEEEAGRS C S C AP GY
ELADDHLQ C EPA
Bovine PrC...133.........RFSN
C SAENGG C AHY
C MEEEGRRH C S C AP GY
RLEDDHQL C VSKVTFP...
Return-to-Table-of-Contents
Plasma Protein S
human/. PrS 117 //.....IPDQ
C SPLP C NEDGYMS C KDGKASFT C T C
KP GW QGEK C EF
bovine/.PrS 115 //...NAISDQ
C NPLP C NEDGFMT C KDGQATFT C I C
KS GW QGEK C ES
mouse/. PrS 116//.....AISDQ
C DPIP C NEDGYLA C QDGQAAFT C F C
KP GW QGDR C QY
rat//.. PrS 115//....NAIPDQ
C DPMP C NEDGYLS C KDGQGAFT C I C
KP GW QGDK C QF
rabbit/.PrS .88 //.....IPDQ
C NPLP C SEEGYLN C KDGQATFT C I C
KP GW QGEK C EI
macaque PrS .90 //.....IPDQ C SPLP C NEDGYMS C KDGKASFT C T C
KP GW QGER C EF
human/. PrS 157//.....DINE
C KDPSNINGG C SQI
C DNTPGSYH C S C KN GF
VMLSNKKD C K
bovine/.PrS 157//.....DINE
C KDPVNINGG C SQI
C ENTPGSYH C S C KN GF
VMLSNKKD C K
mouse/. PrS 157//.....DVNE
C KDPSNVNGG C SQI
C DNTPGSYH C S C KR GF
AMLPNKKD C K
rat//.. PrS 157//.....DINE
C KDPSNINGG C SQT
C DNTPGSYH C S C KI GF
AMLTNKKD C K
rabbit/.PrS 128//.....DINE
C KDPTNINGG C SQI
C DNTAGSYH C S C KS GF
VMLANEKD C K
macaque PrS 130//.....DINE C KDPSNINGG
C SQI C DNTPGSYH C S C
KS GF VMLSNKKD C K
human/. PrS 201//.......DVDE
C SLKPSI C GTAV C KNIPGDFE C E C PE GY
RYNLKSKS C ED
bovine/.PrS 201//......
DVDE C VLKPSI C GTAV C KNIPGDFE C E C AE GY KYNPVSKS C DD
mouse/. PrS 201//.......DLDE
C ALKPSV C GTAV C KNIPGDFE C E C PD GY
RYDPSSKS C KD
rat//.. PrS 201//......
DVDE C SLKPSV C GTAV C KNIPGDFE C E C PN GY RYDPSSKS C KD
rabbit/.PrS 172//.......DMDE
C SVKPSV C GTAV C KNTPGDFE C E C SE GY
RYNPTAKS C E...
macaque PrS 174//...... DVDE C S-KPNM C GTAV
C KNIPGDFE C E C PE GY
RYNLKSKS C ED...
human/. PrS.244/..........ide c senm c aql
c vnypggyt c y c dgkk gf klaqdqks c evvs...
bovine/.PrS 244/..........vde
c aenl c aql c vnypggys c y c dgkk gf klaqdqks
c eavp...
rat//.. PrS 244/..........vde
c sent c aql c vnypggys c y c dgkk gf klaqdqrs
c egip...
mouse/. PrS 244/..........vde
c senm c aql c vnfpggys c y c dgkk gf klaqdqks
c eg...
rabbit/.PrS 214/..........ide
c senm c aql c vnypggys c y c dgkk gf klaqdkks
c ea...
macaque PrS 217/..........vde c senm c aql c vnypggyt c y c dgkk gf klaqdqks c eavsv...
Return-to-Table-of-Contents
Mammalian Protein Z
human/.PrZ...
86 .......KGGSP C ISQP C LHNGS
C QDSIWGYT C T C SP GY
EGSN C EL
bovine PrZ... 47 ........GGSP
C ASQP C LNNGS C QDSIRGYA C T C
AP GY EGPN C AF
human/.PrZ...125
.......AKNE C HPERTDG C QHF
C LPGQESYT C S C AQ GY
RLGEDHKQ C VPHDQ...
bovine PrZ... 85 .......AESE
C HPLRLDG C QHF C YPGPESYT C S C AR GH KLGQDRRS
C LPHDR
Return-to-Table-of-Contents
Human Complement
component 1(human and Rodent Mesocrecetus)
human C1r 142 ...DLDE C ASRSKSGEEDPQPQ C QHL
C HNYVGGYF C S C RP GY
ELQEDRHS C QAE
human C1s 131 ...DINE C TDFV-----D-VP- C SHF
C NNFIGGYF C S C PP EY
FLHDDMKN C GVN
MESO /C1s.137
...DVNE C TDFT-----D-VP- C SHF C NNFIGGYF C
S C PP EY FLHDDMRN
C GVN...
Complememt-Activating RARF Serum Protease
Human RARF 136 /...MAVDVDE C KEREDEELS
C DHY C HNYIGGYY C S C
RF GY ILHTDNRT C RVE...
Mouse RARF 144 /......DVDE C KEREDEELS C DHY C HNYIGGYY C S C
RF GY ILHTDNRT C RVE...
Return-to-Table-of-Contents
Zonadhesin Proteins
HUMAN..ZAN6..2619.//..........SP C LQNP C QNDGQ C REQGATFT C E C
EV GY GGGL C MEPR
RABBIT.ZAN...2187.//..........SP C LRNP C QNDGR C REQGTSFT C E C
EP GY GGHL C TEPR
PIG....ZAN...2368.//..........SP C LQNP C QNDGR C REQGTHFT C E C
EL GY GGDL C TEPR
MOUSE..ZAN...5261.//..........SP C LQNP C HNDGR C EEQGATFI C H C
DF GY GGEF C TEPQ
C.EL...ZAN....142...........NEIK C KDNS C GKNAD C YVANHQLN C I C
KP GY TARRNGRD C DMK
C.EL...ZAN....581....IYHE C QNGTMWSDYRP C SDDGS C VLNSIDMQ C K C
NN GY RGDGYN C T
C.EL...ZAN....627..........DINE C VETPGI C GHGQ C VNTPGSYH C T C
DD FW LGDN C NTYKPRR
Mammalian Uromodulin
Human//Umod .28.//..........EARW
C SEC HSNAT C TEDEAVTT C T C
QE GF TGDGLT C V
Rat//..Umod .30.//..........EARR
C SEC HDNAT C VLDGVVTT C S C
QA GF TGDGLV C E
Bovine Umod .30.//..........SAKS C SEC HSNAT C TVDGAATT C A C
QE GF TGDGLE C V
Human//Umod .65.//....DLDE
C AIPGAHN C SANSS C VNTPGSFS C V C
PE GF RLSPGLG C T
Rat//..Umod .67.//....DIDE
C ATPWTHN C S-NSI C MNTLGSYE C S C
QD GF RLTPGLG C I
Bovine Umod .67.//....DLDE C AVLGAHN C SATKS
C VNTLGSYT C V C PE GF
LLSSELG C E
Human//Umod 108.//....DVDE
C AEPGLSH C HALAT C VNVVGSYL C V C PA GY RGDGWH C E...
Rat//..Umod 109.//....DVNE
C TEQGLSN C HSLAT C VNTEGSYS C V C PK GY RGDGWY C E...
Bovine Umod 110.//....DVDE C AEPGLSR C HALAT
C INGEGNYS C V C PA GY
LGDGRH C E...
Return-to-Table-of-Contents
Mammalian Thrombomodulin
Human//TMD 241 .//.......gawd
c svengg c eha c naipgapr c q c pagaalqadgrs c tasa
Mouse//TMD 241 .//........awn
c svengg c eyl c nrstnepr c l c prdmdlqadgrs c arpv
Bovine TMD. 17 .//.......gawa c gvergg c qhe
c kgsagasn c l c padaalqadgrs c glpa
Human//TMD 285.//.........TQS
C NDL C EHF C VPNPDQPGSYS C M C
ET GY RLAADQHR C ED
Mouse//TMD 285.//.........VQS
C NEL C EHF C VSNAEVPGSYS C M C
ET GY QLAADGHR C ED
Bovine TMD..61.//.........EHP C HQL C EHF C HLH--GLGNYT C I C
EA GY QLAADQHR C ED
Human//TMD 327.//.........VDD
C ILEPSP C PQR C VNTQGGFE C H C
YP NY DLVDGE C VEP
Mouse//TMD 325.//.........VDD
C KQGPNP C PQL C VNTKGGFE C F C
YD GY ELVDGE C VEL
Bovine TMD 100.//.........VDD C AQLPSP C PQR C VNTEGGFQ C H C
DT GY ELVDGE C VDP
Human//TMD 366.//.........
VDP C FRAN C EYQ C QPLNQTSYL C V C AE GF APIPHEPHR C Q
Mouse//TMD 365.//.........
LDP C FGSN C EFQ C QPVSPTDYR C I C AP GF APKPDEPHK C E
Bovine TMD 140.//......... VDP C FDNN C EYQ
C QPVGRSEHK C I C AE GF
APVPGAPHK C Q
Human//TMD 406.//............
MF C NQTA C PAD C DPNTQAS C E C PE GY
ILDDGFI C TD
Mouse//TMD 405.//............
MF C NETS C PAD C DPNSPTV C E C PE GF
ILDEGSV C TD
Bovine TMD 180.//............ MF C NQTS C PAD
C DPHYPTI C R C PE GY
IIDEGST C TD
Human//TMD 442.//.........ide
c enggf c s-gv c hnlpgtfe c i c gpdsalarhigtd c dsgk...
Mouse//TMD 441.//.........ide
c sqge- c ftse c rnfpgsye c i c gpdtalagqiskd c d...
Bovine TMD 216.//.........ine c dtn-i c -pgq
c hnlpgtye c i c gpdsalsgqigid c dptq...
Return-to-Table-of-Contents
Thrombospondin
1 & 2
Mouse...TSP1 549.........DG
C LSNP C FAGAK C TSYPDGSWK C GA C PP GY SGNGIQ C KD
Xenopus TSP1 551 .......IDG C LSNP C FAGVK
C TSFIDGSWK C GS C PP GY
RGNGIT C KD
Chick...TSP1 555 ......PIDG
C LSNP C FPGAE C NSYPDGSWS C GP C PA GF LGNGTV C ED
Human...TSP2 549 ......PVDG
C LSNP C FPGAQ C SSFPDGSWS C GF C PV GF LGNGTH C ED
Mouse...TSP2 549 ......PIDG
C LSNP C FPGAK C NSFPDGSWS C GS C PV GF LGNGTH C ED
Mouse...TSP1 589.../VDE
C KEVPDA C FNHNGEHR C KNTDPGYN C LP C
PP RF TGSQPFGRGVEHAMANKQV C KP
Xenopus TSP1 592.../IDE C KEVPDA C FTLNGVHR C ENTEPGYN C LP C
PP RF TGTQPFGKGIEEAKANKQV C KP
Mouse...TSP1 597.../LDE
C IAVSDV C FKVNQVHR C VNTNPGFH C LP C
PP RY KGSQPYGVGLEVAKTEKQV C EP
Human...TSP2 591.../LDE
C ALVPDI C FSTSKVPR C VNTQPGFH C LP C PP RY RGNQPVGVGLEAAKTEKQV C EP
Mouse...TSP2 591.../LDE
C AVVTDI C FSTNKAPR C VNTNPGFH C LP C
PP RY KGNQPFGVGLEDARTEKQV C EP
Mouse...TSP1 647....RNP
C TDGTHD C NKNAK C NYLGHYSDPMYR C E C
KP GY AGNGII C GEDTD...
Xenopus TSP1 650....RNP C ADGTHD C HKNAR C IYLGHYSDPMFR C E C
RP GY AGNGII C GEDTD...
Chick...TSP1 655....ENP
C KDKTHS C HKSAE C IYLGHFSDPMYK C E C RT GY AGDGRI C GED...
Human...TSP2 649....ENP
C KDKTHN C HKHAE C IYLGHFSDPMYK C E C
QT GY AGDGLI C GEDSDLD...
Mouse...TSP2 649....ENP
C KDKTHS C HKNAE C IYLGHFSDPMYK C E C
QI GY AGDGLI C GEDSD...
Return-to-Table-of-Contents
Thrombospondin-3
Human...TSP3 317./......INE
C AHADP C FPGSS C INTMPGFH C EA C PR GY K G TQVSGVGIDYARASKQV C ND
Mouse...TSP3 316.......DINE
C AHADP C FPGSS C INTMPGFH C EA C PP GY K G TRVSGVGIDYARASKQV C ND
Human...TSP3 371.//...IDE
C NDGNNGG C DPNSI
C TNTVGSFK C GP C RL GF
LGNQSQG C LP
Mouse...TSP3 371.//...IDE
C NDGNNGG C DPNSI
C TNTVGSFK C GP C RL GF
LGNQSQG C VP
Human...TSP3 415.//...ART
C HSPAHSP C HIHAH C LFERNGAVS C Q C
NV GW AGNGNV C GTDTD...
Mouse...TSP3 415.//...ART
C HSPAHSP C HIHAH C LFERNGAVS C Q C
NV GW AGNGNV C GPDTD...
Return-to-Table-of-Contents
Thrombospondin-4
Human...TSP4 286 .......PPRR C DSNP C FRGVQ
C TDSRDGFQ C GP C PE GY
TGNGIT C ID
Rat.... TSP4 301 ....PTRPTRR
C DSSP C FRGVR C TDTRDGFQ C GP C
PD GY TGNGIT C SD
Xenopus TSP4 281 .......PKPR C DATS C FRGVR C IDTEGGFQ C GP C
PE GY TGNGVI C TD
Human...TSP4 326...//...DVDE
C KYHP C YPGVH C INLSPGFR C DA C PV GF
TGPMVQGVGISFAKSNKQV C TD
Rat.... TSP4 345...//...
VDE C KYHP C YPGVR C TNLAPGFR C DA C PV GF TGPMVQGVGINFAKTNKQV C TD
Xenopus TSP4 322.....//. VDE C RLNP C FLGVR
C INTSPGFK C ES C PP GY
TGSTIQGIGINFAKQNKQV C TD
Human.. TSP4 380......IDE
C ---RNGA C VPNSI
C VNTLGSYR C GP C KP GY
TGDQIRG C KV
Rat.... TSP4 398......VDE
C ---RNGA C VLNSI
C INTLGSYR C GP C KP GY
TGDQTRG C RT
Xenopus.TSP4 374......TNE
C ENGRNGG C TSNSL
C INTMGSFR C GG C KP GY
VGDQIKG C KP
Human.. TSP4 421......ERN
C RNPELNP C SVNAQ C IEERQGDVT C V C
GV GW AGD-GYI C GKDVD
Rat.... TSP4 439......ERS
C RNPEQNP C SVHAQ C IEERQGDVT C V C
GV GW AGRAGYV C GKD...
Xenopus.TSP4 419......EKS
C RH-GQNP C HASAQ C SEEKDGDVT C T C SV GW AGN-GYL C GKDTD...
Return-to-Table-of-Contents
Mammalian Selectin
Proteins (EGF-like Motifs only)
human.. P-SEL 159 .......YTAS
C QDMS C SKQGE C LETIGNYT C S C
YP GF YG PE C EYVRE...
mouse.. P-SEL 159 .......YTAS
C QDMS C SNQGE C IETIGSYT C S C
YP GF YG PE C EYVKE...
bovine..P-SEL 159 .......YRAS
C QDMS C SKQGE C IETIGNYT C S C
YP GF YG PE C EYVRE...
sheep.. P-SEL 159 .......YRAS
C QDMS C SKQGE C IETIGNYT C S C
YP GF YG PE C EYVRE...
rat.... P-SEL 159 .......YTAS
C QDMS C NSQGE R IETIGSYT C S C
YP GF YG PE C EYVQE...
human.. L-SEL 156 .......YTAS
C QPWS C SGHGE C VEIINNYT C N C
DV GY YGPQ C QFVIQ...
mouse.. L-SEL 156 .......YTAS
C QPGS C NGRGE C VETINNHT C I C
DA GY YGPQ C QYVVQ...
bovine..L-SEL 156 .......YTAS
C KPWS C SGHGQ C VEVINNYT C N C
DL GY YGPE C QFVTQ...
rat.... L-SEL 156 .......YTAS
C QPES C NRHGE C VETINNNT C I C
DP GY YGPQ C QYVIQ...
chimp.. L-SEL 156 .......YTAS
C QPWS C SGHGE C VEIINNYT C N C
DV GY YGPQ C QFVIQ...
baboon..L-SEL 156 .......YTAS
C QPWS C SGHGE C VEIINNYT C N C
DV GY YGPQ C QFVIQ...
ape.... L-SEL 156 .......YTAS
C QPWS C SGHGE C VEIINNYT C N C
DV GY YGPQ C QFVIQ...
macaque L-SEL 156 .......YTAS C QPWS C SGHGE C VEIINNYT C N C
DV GY YGPQ C QFVI...
human.. E-SEL 139 .......YTAA
C TNTS C SGHGE C VETINNYT C K C
DP GF SGLK C EQIVN...
mouse.. E-SEL 139 .......YTAS
C TNAS C SGHGE C IETINSYT C K C
HP GF LGPN C EQAVT...
dog.... E-SEL 139 .......YTAA
C TPTS C SGHGE C VETVNNYT C K C
HP GF RGLR C EQVVT...
pig.... E-SEL 140 .......YTAA
C TPTS C SGHGE C IETINSST C Q C
YP GF RGLQ C EQVVE...
bovine..E-SEL 140 .......YKAA
C NPTP C GSHGE C VETINNYT C Q C
HP GF KGLK C EQVVT...
rabbit..E-SEL 141 .......YTAA
C TEAS C SGHGE C IETINNYS C K C
YP GF SGLK C EQVVT...
rat.... E-SEL 139 .......YTAS
C TNTS C SGHGE C VETINSYT C K C
HP GF LGPK C DQVVT...
Return-to-Table-of-Contents
TGF-beta Binding
protein
Rat...TGF-BP...181
..........tkps c vpp c qnggm c lrpqf c v c
kp gt kgka c eitaaqdt...
Human TGF-BP... 73 .........
RVVI C HLP C MNGGQ C SSRDK C Q C
PP NF TGKL C QIPVHGAS...
Rat...TGF-BP...391
......... RVVI C HLP C MNGGQ
C SSRDK C Q C PP NF
TGKL C QIP...
Human TGF-BP...301 .......INE
C QLQGV C PNGE C LNTMGSYR C T C
KI GF GPDPTFSS C VPDPPVISEEK...
Rat...TGF-BP...618
......DINE C QLQGV C PNGE
C LNTMGSYR C S C KM GF
GPDPTFSS C VPD...
Human TGF-BP...546......EINE
C TVNPDI C GAGH C INLPVRYT C I C YE GY
RFSEQQRK C V
Rat...TGF-BP...865
.....EINE C TVNPDI C GAGH C INLPVRYT C I C YE GY KFSEQQRK C
I
Human TGF-BP...588 .....DIDE
C TQVQHL C SQGR C ENTEGSFL C I C
PA GF MASEEGTN C I
Rat...TGF-BP...907......DIDE C AQAQHL C SQGR
C ENTEGSFL C I C PA GF
IASEEGSN C I
Human TGF-BP...630......DVDE
C LRPDV C GEGH C VNTVGAFR C EY C DS GY
RMTQRGR C E
Rat...TGF-BP...949......DVDE C LRPDV C RDGR C INTAGAFR C EY C DS GY RMSRRGH C E
Human TGF-BP...671......DIDE
C LNPST C PDEQ C VNSPGSYQ C VP C TE GF
RGWNGQ C L
Rat...TGF-BP...990......DIDE C LTPST C PEEQ C VNSPGSYQ C VP C TE GF RGWNGQ C L
Human TGF-BP...711.......DVDE
C LEPNV C ANGD C SNLEGSYM C S C
HK GY TRTPDHKH C R
Rat...TGF-BP..1030.......DVDE C LQPKV C TNGS
C TNLEGSYM C S C HK GY
SPTPDHRH C Q
Human TGF-BP...752.......DIDE
C QQGNL C VNGQ C KNTEGSFR C T C
GQ GY QLSAAKDQ C E
Rat...TGF-BP..1071.......DIDE C QQGNL C MNGQ
C KNTDGSFR C T C GQ GY
QLSAAKDQ C E
Human TGF-BP...793.......DIDE
C QHRHL C AHGQ C RNTEGSFQ C V C
DQ GY RASGLGDH C E
Rat...TGF-BP...1112......DIDE C EHRHL C SHGQ
C RNTEGSFQ C L C NQ GY
RASVLGDH C E
Human TGF-BP...834......DINE
C LEDKSV C QRGD C INTAGSYD C T C
PD GF QLDDNKT C Q
Rat...TGF-BP...1153.....dine c ledssv c qggd c intagsyd c t c pd gl
qlndnkg c q
Human TGF-BP...875......DINE
C EHPGL C GPQGE C LNTEGSFH C V C
QQ GF SISADGRT C E
Rat...TGF-BP...1194.....DINE C AQPGL C APHGE
C LNTQGSFH C V C EQ GF
SISADGRT C E
Human TGF-BP...917......DIDE
C VNNTV C DSHGF C DNTAGSFR C L C
YQ GF QAPQDGQG C V
Rat...TGF-BP...1236.....DIDE C VNNTV C DSHGF
C DNTAGSFR C L C YQ GF
QAPQDGQG C V
Human TGF-BP...959......dvne
c ellsgv c geaf c envegsfl c v c ad en qeyspmtgq c rsrtstd...
Rat...TGF-BP...1278.....dvne c ellsgv c geaf c envegsfl c v c ad en
qeyspmtgq c rsrate...
Human TGF-BP...1098 ....ade
c llfgqei c kngf c lntrpgye c y c kq gt yydpvklq
c fd...
Rat...TGF-BP...1415
...dade c llfgeei c kngy c lntqpgye c y c ke
gt yydpvklq c f
Rat...TGF-BP...1458 .....dmde c qdpns c idgq c vntegsyn c f c th pm vldasekr c vqp...
Human TGF-BP...1294......QAEE
C GILNG C ENGR C VRVQEGYT C D C
LD GY HLDTAKMT C F
Rat...TGF-BP...1612......QAEE C GILNG C ENGR
C VRVQEGYT C D C FD GY
HLDMAKMT C V
Human TGF-BP...1335..DVNE
C DELNNRMSL C KNAK C INTDGSYK C L C
LP GY VPSDKPNY C TPLNTAL...
Rat...TGF-BP...1653..DVNE C SELNNRMSL C KNAK
C INTEGSYK C V C LP GY
VPSDKPNY C TPLNTAL...
Return-to-Table-of-Contents
Mammalian
Angiopoietin Receptors/Endothelial Protein Tyrosine Kinase Receptors (TIE/TEK)
Human. TIE-1 220 ........WGPG C TKECPG C LHGGV
C HDHDGE C V C PP GF
TGTR C EQACREGR
Mouse. TIE-1 218 ........WGPG
C VKDCPG C LHGGV C HDHDGE C V C
PP GF TGTR C EQACREGR
Bovine TIE-1 218 ........WGQD C TKECPG C LHGGV C HDQDGE C V C
PP GF TGTR C EQACREGR
Human. TIE-2 215........KWGPE
C NHLCTA C MNNGV C HEDTGE C I C
PP GF MGRT C EKACELHT
Mouse. TIE-2 216........KWGPD
C SRPCTT C KNNGV C HEDTGE C I C
PP GF MGRT C EKACEPHTF
Bovine TIE-2 215........KWGPE C NRICTA C MNNGI C HEDTGE C I C
PP GF MGRT C EKACEPHTF
Human. TIE-1 264 ..
FGQS C QEQ C PGISG C RGLTF C LPDPYG C S C GS
GW RGSQ C QEACA
Mouse. TIE-1 262 ..
FGQS C QEQ C PGTAG C RGLTF C LPDPYG C S C GS
GW RGSQ C QEACA
Bovine TIE-1 262 .. FGQS C QEQ C PGTSG C RGLTF
C LPDPYG C S C GS GW
RGSQ C QEACA
Human. TIE-2 260 ..
FGRT C KER C SGQEG C KSYVF C LPDPYG C S C AT
GW KGLQ C NEACH
Mouse. TIE-2 260 ..
FGRT C KER C SGPEG C KSYVF C LPDPYG C S C AT
GW RGLQ C NEACP
Bovine TIE-2 260 .. FGRT C KER C SEPEG C KSFVF
C LPDPYG C S C AT GW
KGLQ C NEACQ
Human..TIE-1 308 ......
PGHFGAD C RLQCQ C QNGGT C DRFSG C V C
PS GW HGVH C EKS...
Mouse..TIE-1 310 ......
PDHFGAD C RLQCQ C QNGGT C DRFSG C V C
PS GW HGVH C EKS...
Bovine.TIE-1 306 ......
PGRFGAD C HLQCQ C QNGGT C DRFSG C V C
PS GW HGMH C EKS...
Human..TIE-2 304 ......
PGFYGPD C KLRCS C NNGEM C DRFQG C L C
SP GW QGLQ C ERE...
Mouse..TIE-2 304 ......
SGYYGPD C KLRCH C TNEEI C DRFQG C L C SQ GW QGLQ C EKE...
Bovine.TIE-2 304 ......
pgyygpd c klrcs c tngek c drfqg c l c sp gr qglq c eke...
Very Low-Density Lipoprotein
Receptor
human. VLDLR 356. .......
...HINE C LVNNGG C SHI
C KDLVIGYE C D C AA GF
ELIDRKT C GD
mouse. VLDLR 356. .......
...HINE C LVNNGG C SHI
C KDLVIGYE C D C AA GF
ELIDRKT C GD
rat... VLDLR 356. .......
...HINE C LVNNGG C SHI
C KDLVIGYE C D C AA GF
ELIDRKT C GD
chick. VLDLR 374. .......
...NINE C LVNNGG C SHI
C RDLVIGYE C D C PA GF
ELVDRRT C GD
rabbit VLDLR 356. ....... ...HVNE C LVNNGG C SHI C KDSVIGYE
C D C AA GF ELIDRKT
C GD
human. VLDLR 395 . . ....... . IDE C
QNPGI C SQI C INLKGGYK C E C
SR GY QMDLATGV C KAVGKE...
mouse. VLDLR 395 . . ....... . IDE C QNPGI
C SQI C INLKGGYK C E C
SR GY QMDLATGV C KAVGKE...
rat... VLDLR 397 . . ....... . IDE C QNPGI
C SQI C INLKGGYK C E C
SR GY QMDLATGV C KAVGKE...
chick. VLDLR 415 . . ....... . IDE C QNPGI
C SQI C INLKGGYK C E C
SR GY QMDLATGV C KAVGK...
rabbit VLDLR 397 . ....... . . IDE C QNPGI
C SQI C INLKGGYK C E C
SR GY QMDLATGV C KAVGKE...
human. VLDLR 701 ...SGKNW C EDDMENGG
C EYL C LPAPQINDHSPKYT C S C
PN GY NLEENGRE C QSTSTP...
mouse. VLDLR 701 ...SGKNW C EEDMENGG C EYL C LPAPQINDHSPKYT C S C
PS GY NVEENGRD C QSTATT...
rat... VLDLR 701 ...SGKNW C EEDMENGG C EYL C LPAPQINDHSPKYT C S C
PN GY NLEENGRE C QSTST...
chick. VLDLR 722 ....GRNW C EENMVNGG C SYL C LPAPQINEHSPKYT C T C
PA GY FLQEDGLR C GGFNIS...
rabbit VLDLR 702 ....GKNW C EEDMENGG C EYL C LPAPQINEHSPKYT C S C
PN GY HLEENGRE C QSTATT...
Low-Density Lipo-protein
Receptor
Human... LDLR..314...........GTNE C LDNNGG
C SHV C NDLKIGYE C L C
PD GF QLV--AQRR C ED
Mouse... LDLR..315...........KTNE C LDNNGG
C SHI C KDLKIGSE C L C
PS GF RLV--DLHR C ED
Rat..... LDLR..308............MGV C SVLN-- C EYQ
C HQTPFGGE C F C PP GH IINSNDSRT C ID
Rat'.... LDLR..316............TNE C LDNNGG
C SHI C KDLKIGYE C L C
PS GF RLV--DGHQ C ED
Hampster LDLR..315...........RTNE
C LDNNGG C SHV
C KDLKIGYE C L C PN GF
QLV--DQHR C ED
Rabbit...LDLR..301...........ATNE C MRGNGG
C SHT C FDLRIGHE C H C
PK GY RLV--DQRR C ED
Xenopus..LDLR..312...........GENE C LRNNGG
C SHI C NDLKIGYE C L C
NE GY RLV--DQKR C E
Human... LDLR..355
. ....... . IDE C QDPDT C SQL
C VNLEGGYK C Q C EE GF
QLDPHTKA C KAVGSIAY...
Mouse... LDLR..356
. ....... . IDE C QEPDT C SQL
C VNLEGSYK C E C QA GF
HMDPHTRV C KAVGSIG...
Rat..... LDLR..346. ....... . DFDD C QIWGI C DQK
C ENRQGRHQ C L C EE GY
ILERGQ-H C KSSDSF...
Rat'.... LDLR..356
. ....... . IDE C QEPDT C SQL
C VNLEGSFK C E C RA GF
HMDPHTRV C KAVGSIG...
Hampster LDLR..356 .
....... . IDE C QEPDT C DQL C VNLEGSYK
C E C RA GF HMDPHTRV
C KAVGSVA...
Rabbit...LDLR..342
. ....... . INE C EDPDI C SQL
C VNLAGSYK C E C RA GF
QLDPHSQA C KAVD...
Xenopus..LDLR..352........... DINE C ENPNT C TQI
C INLHGGYK C E C RE GY
QMDPVTAS C KSIGTVAY...
Human..LDLR 661..prgvnw c erttls-NGg c qyl c lpapqinphspkft c a c pd gm llardmrs
c lteaeaa...
Mouse..LDLR 663....gvnw c ettallpNGg
c qyl c lpapqigphspkft c a c pd gm llakdmrs c ltevd...
Rat....LDLR 658 ...ATNP C GSN----NGG
C AQV C VLSHRTDNGGLGYR C K C
EF GF ELDDDEHR C VAVKNFL...
Rat'...LDLR 661..prgvnw c eat-vlpNGg
c qym c lpapqisahspkft c a c pd gm llakdmrs c lpevd...
Hamstr.LDLR 661..prgvnw c ert-alpNGg
c qyl c lpapqinphspkft c a c pd gm llakdmrt c ltevap...
Rabbit.LDLR 651.....vnw c ekt-alpNGg
c qyl c lpapqinshspkft c a c pd gt llaadmrs c rtead...
Bovine.LDLR.
81.qprgvnw c ertal-rNGg c qyl c lpapqinprspkft
c a c pd gm llakdmrs c ltesesav..
XenopusLDLR 661....aenw c eshhl-gNGg
c gyl c lpaphvnarspkft c a c pd gm hlgtdmrn c mkep...
Rat'...LDLR 969 ...gsny c sqttha-NGd
c shf c fpvp-----nfqrv c g c py gm klqrdqmt c egd...
Human. . LDLR2 408.......
...INE C HDPSISG C DHN C TDTLTSFY C
S C RP GY KLMSDKRT
C VD
Rat. ... LDLR 3113.......
...INE C LDSSISR C DHN C TDTITSFY C
S C LP GY KLMSDKRS
C VD
Rat'. .. LDLR 1350.......
...NQDS C SHFNGG C THQ C MQGPFGAT C
L C PL GY QLANDTKT
C ED
Human. . LDLR2 450.
....... . IDE C TEMPFV C SQK C ENVIGSYI
C K C AP GY LREPDGKT
C RQNSNIE...
Rat. ... LDLR 3155.......
. . IDE C KESPQL C SQK C ENVVGSYI C
K C AP GY IREPDGKS
C RQNS...
Rat'. .. LDLR 1392. ....
. ...INE C -DIPGF C SQH
C VNMRGSFR C A C DP EY
TLESDGRT C KVTGSE...
Human. . LDLR2 762.........VSNP
C GTNNGG C SHL C LIKPGGKGFT C E C
PD DF RTLQLSGSTY C MP...
Rat. ... LDLR 3467. .......MSNP
C GTNNGG C SHL C LIKAGGRGFT C A C
PD DF QTVQLRDRTL C M...
Rat'. .. LDLR .1701. .. ...SRNP C AS--AS C SHL
C LLSAQAPRHYSC A C PS GW
-NLSDDSVN- C VRGD...
Human. . LDLR2 1264.
. ... ...KERT C AENI C EQN C TQLNEGGFI
C S C TA GF ETNVFDRTS
C L
Rat. ... LDLR..3968... .......DNRT C AENI C EQN
C TQLSSGGFI C S C RP GF
KPSTSDKNS C Q
Rat'. .. LDLR .2019...........SSNG CSNNPNAC QQI
C LPVPGGMFS C A C AS GF
K-LSPDGRS C SPYNS...
Human. . LDLR2 1306.........DINE
C -EQFGT C PQH C RNTKG-SYE C V C
AD GF TSMSDRPGKR C AAEGS...
Rat. ... LDLR .4009.........DINE C -EEFGI C PQS
C RNSKG-SYE C F C VD GF
KSMSTHYGER C AADGSP...
Human. . LDLR2 1628........vpnl
c kqi--- c shl c llrpggys-- c a c pq gs sfiegstte c daaie
Rat. ... LDLR .4331.......svsnp c kqv--- c shl c llrpggys-- c a c pq
gs dfvtgstvq c daase
Rat'. .. LDLR .2343........nnnp c lqsngg c shf c falpelptpr c g c af
gt lgndgks-- c ats...
Rat'. .. LDLR .2653.........snp c dqfngg c shi c apgpngae-- c q c ph
eg nwylandnky c vvd
Human LDLR2 1661 STTE C DAAIELPINLPPP C RCMHGGN
C YFDETDLPK C K C PS GY
TGKY C EMAFSKGI...
Rat. . LDLR 4365 STVQ C DAASELPVTMPPP C RCMHGGN C YFDENELPK C K C
SS GY SGEY C EVGLSR...
Alpha-2
Macroglobulin Receptor
Human A2Mac 111...............LQGN C SRLG-
C QHH C VPTLDGPT C Y C
NS SF QLQADGKT C K
Human A2Mac 150...............DFDE C
SVYGT C SQL C TNTDGSFI C G C
VE GY LLQPDNRS C KAKNE...
Chick A2Mac 152...............DFDE C TVYGT
C SQT C TNTEGSYT C S C
VE GY LLQPDNRS C KAKNE...
Human A2Mac 474........RSHA C ENDQYGKPGG C
SDI C LLANSHKART C R C RS GF
SLGSDGKS C KKPE...
Chick A2Mac 476........RSHA C EPDQFGKPGG C
SDI C LLGNSHKSRT C R C RS GF
SLGSDGKS C KKPE...
Human A2Mac 802.............vgtnk c
rvnngg c ssl c latpgsrq c a c aedqvldadgvt c lanpsy...
Chick A2Mac 802...............snk c rvnngg
c ssl c latprgrq c a c aedqilgadsvt c eanp...
Human A2Mac 1183..............dq c slnngg
c shn c svapgegiv c s c pl gm elgpdnht c qi
Chick A2Mac 1181..............dq c slnngg c
shn c tvapgegiv c s c pl gm elgadnkt c qi
Human A2Mac 1223.............IQSY C
AKHLK C SQK C DQNK-FSVK C S C
YE GW VLEPDGES C RSLD...
Chick A2Mac 1221.............IQSY C AKHLK C
SQK C EQDK-YNVK C S C
YE GW MLEPDGES C RSLDP...
Human A2Mac 1846............GTNP C SVNNGD
C SQL C LPTSETTRS C M C
TA GY SLRSGQQA C EGVG...
Chick A2Mac 1842............GSNP C SVNNGD C
SQL C LPTSETSRS C M C
TA GY SLKSGQQS C EGVGS...
Human A2Mac 2155 ...........gtnv c avangg
c qql c lyrgrgqra c a c ah gm laedgas c reyagy...
Chick A2Mac 2151 ...........gtnv c aqnngg c
qql c lfrgggrrt c a c ah gm lsedgvs c rdyd...
Human A2Mac 2941.............INE C LSRKLSG
C SQD C EDLKIGFK C R C
RP GF RLKDDGRT C AD
Chick A2Mac 2938.............INE C LNKKLSG
C SQE C EDLKIGYK C R C
RP GF RLKDDGKT C ID
Human A2Mac 2983...............VDE C
STTFP C SQR C INTHGSYK C L C
VE GY APRGGDPHS C KAVTDE...
Chick A2Mac 2980...............IDE C STTYP
C SQK C INTLGSYK C L C
IE GY KLKPDNPTS C KAVTDE...
Human A2Mac 3290 ...........PNHP C KVNNGG
C SNL C LLSPGGGHK C A C
PT NF YLGSDGRT C V...
Chick A2Mac 3287............PNHP C KTNNAG C
SNL C LLSPGGGHK C A C
PT NF YLGSDGKT C V...
Human A2Mac 3781...........KLTS C ATNASI
C GDEAR C VRTEKAAY C A C RS GF
HTVPGQPG C QD
Chick A2Mac 3779...........KSYD C MTNTTM C
GDEAQ C IQAQSSTY C T C RR GF
QKVPDKNS C QD
Human A2Mac 3825...............INE C
LRFGT C SQL C NNTKGGHL C S C
AR NF MKTHNT C KAEGSEY...
Chick A2Mac 3823...............VNE C LRFGT
C SQL C NNTKGSHV C S C
AK NF MKTDNM C KAEGSE...
Human A2Mac 4147...............vtnp
c drkk c ewl c llspsgpv c t c pn gk rldngt
c vpvpsptp
Chick A2Mac 4146...............vtnp c drkk
c ewl c llspsgpv c t c pn gk rldngt c vlip
Human A2Mac 4196.............RPGT C
NLQ C FNGGS C FLNARRQPK C R C
QP RY TGDK C E
Chick A2Mac 4194.............TTDT C DLV C LNGGS C FLNARKQAK C R C
QP RY NGER C Q
Human A2Mac 4233............. LDQ C WEH C RNGGT C AASPSGMPT C R C
PT GF TGPK C T
Chick A2Mac 4232............. INQ C SDY C QNGGL C TASPSGMPT C R C
PT GF TGSR C D
Human A2Mac 4269............. QQV C
AGY C ANNST C TVNQGNQPQ C R C
LP GF LGDR C Q
Chick A2Mac 4268............. QQV C TNY C HNNGS C TVNQGNQPN C R C
PP TF IGDR C Q
Human A2Mac 4305............. YRQ C SGY C ENFGT C QMAADGSRQ C R C
TA YF EGSR C E
Chick A2Mac 4304............. YQQ C FNY C ENNGV C QMSRDGVKQ C R C
PP QF EGAQ C QD
Human A2Mac 4341 ............. vnk c
sr c lega c vvnkqsgdvt c n c td gr vaps c l
Chick A2Mac 4341............... nk c sr c qegk
c ninrqsgdvs c i c pd gk iaps c l
Human A2Mac 4376 ............. t c vgh
c sNGgs c tmnsk-mmpe c q c pphmtgpr c ee...
Chick A2Mac 4375 ............. t c dsy c lNGgt c sisdktqlpe c l c plevtgmr c eefivge...
Chicken nel and
nel-related proteins (Chicken 94 K embryonic protein)
Mouse Nel. 397...... ...GYDF C SEKHT C MENSV
C RNLNDRVV C S C RD GF
RALREDNAY C ED
Rat...Nel. 397......
...GYDF C SEKHT C MENSV C RNLNDRAV C S C RD
GF RALREDNAY C ED
Chick Nel. 397 ..... ...GHDF C TEGHN C MEHSV
C RNLDDRAV C S C RD GF
RALREDNAY C ED
Human Nel2 396.........KGYDF C SERHN C MENSI
C RNLNDRAV C S C RD GF
RALREDNAY C ED
Rat...Nel2 .12.....
...PGHNF C AEAPK C GENSE C KNWNTKAT C E C KN
GY ISVQGNSAY C ED
Human Nel . 52 ....... DIDE C AAKMHY
C HANTV C VNLPGLYR C D C VP GY
IRVDDFS C TE
Mouse Nel. 441......... IDE C AEGRHY C RENTM
C VNTPGSFM C V C KT GY
IRIDDYS C TE
Rat.. Nel. 441......... IDE C AEGRHY C RENTM
C VNTPGSFL C I C QT GY
IRIDDYS C TE
Chick Nel. 441......... VDE C AEGQHY C RENTM
C VNTPGSFM C I C KT GY
IRIDDYS C TE
Human Nel2 441......... IDE C AEGRHY C RENTM
C VNTPGSFM C I C KT GY
IRIDDYS C TE
Rat.. Nel2 .57......... IDE C AAKMHY C HANTV
C VNLPGLYR C D C VP GY
IRVDDFS C TE
Human Nel. 105......... HDE C GSGQHN
C DENAI C TNTVQGHS C T C
KP GY VGNGTI C R
Mouse Nel. 483......... HDE C LTTQHN C DENAL C FNTVGGHN C V C
KP GY TGNGTT C K
Rat.. Nel. 483......... HDE C LTNQHN C DENAL C FNTVGGHN C V C
KP GY TGNGTT C K
Chick Nel. 483......... HDE C VTNQHN C DENAL C FNTVGGHN C V C
KL GY TGNGTV C K
Human Nel2 483......... HDE C ITNQHN C DENAL C FNTVGGHN C V C
KP GY TGNGTT C K
Rat.. Nel2 .99......... HDD C GSGQHN C DKNAI C TNTVQGHS C T C
QP GY VGNGTI C K
Human Nel. 135 ............... AF C
EEG C RYGGT C VAPNK C V C PS GF
TGSH C EKD
Mouse Nel. 523 ............... AF C KDG C RNGGA C IAANV C A C
PQ GF TGPS C ETD
Rat.. Nel. 523 ............... AF C KDG C KNGGA C IAANV C A C
PQ GF TGPS C ETD
Chick Nel. 523 ............... AF C KDG C RNGGA C IASNV C A C
PQ GF TGPS C ETD
Human Nel2 523 ............... AF C KDG C RNGGA C IAANV C A C
PQ GF TGPS C ETD
Rat.. Nel2 139 ............... AF C EEG C RYGGT
C VAPNK C V C PS GF
TGSH C EKD
Human Nel. 168......... IDE C SEGIIE
C HNHSR C VNLPGWHH C E C RS GF
HDDGTYSLSGES C ID
Mouse Nel. 556......... IDE C SEGFVQ C DSRAN
C INLPGWYH C E C RD GY
HDNGMFAPGGES C ED
Rat.. Nel. 556......... IDE C SEGFVQ C DSRAN
C INLPGWYH C E C RD GY
HDNGMFAPGGES C ED
Chick Nel. 556......... IDE C SDGFVQ C DSRAN
C INLPGWYH C E C RD GY
HDNG C FHQVENPV
Human Nel2 556......... IDE C SDGFVQ C DSRAN
C INLPGWYH C E C RD GY
HDNGMFSPSGES C ED
Rat.. Nel2 172......... IDE C AEGFVE C HNYSR
C VNLPGWYH C E C RS GF
HDDGTYSLSGES C ID
Human Nel. 215......... ide c alrtht
c wndsa c inlaggfd c l c pcgps c sg-d c phegglkh...
Mouse Nel. 603......... ide c gtgrhs c tndti
c fnldggyd c r c phgkn c tg-d c vhegkvkhtg...
Rat.. Nel. 603......... ide c gtgrhs c andti
c fnldggyd c r c phgkn c tg-d c vhdgkvkhng...
Human Nel2 603..........ide c gtgrhs c andti
c fnldggyd c r c phgkn c tdve c efsilpeneccprc
Rat.. Nel2 219......... ide c alrtht c wndsa
c inlaggfd c l c psgps c sg-d c phegglkhngqv...
Human Nel. 433.... ...EDNAY C EDKMHY
C HANTV C VNLPGLYR C D C VP GY
IRVDDFS C TE...
Chick Nel. 631 ... HMARTAQETVSMKTKSSTMVRFG
C WRTDR C SV C S C QS GY
VM C RR...
human nel2 631 ... PHGKNCTGD C IHDGKVKHNGQIWVLENDR
C SV C S C QN GF
VM C RRMV C DCE...
human nel. 755....ladnitydirkt c ldsygvsrlsgsvwtmagsp
c tt c k c kngrv c c svdfe c lqnn
human nel2 761....QADTIRNDITKT C LDEMNVVRFTGSSWIKHGTE
C TL C Q C KNGHI C C SVDPQ C LQEL
Human transmembrane brain
protein
hTM-EFF...1272........HMP
C PENLNGY C IHGK C EFIYLLRRAS C R C ES GY TGQH C EK
Human Thyroid Peroxidase
Human Poxase.795.
...KDVNE C ADGAHPP C HASAR C RNTKGGFQ C L C
AD PY ELGDDGRT C VDSGR...
Human Hepatocyte Growth
Factor Activator Protease
HGF-AP... 160 .........ALDP
C ASGP C LNGGS C SNTQDPQSYH C S C
PR AF TGKD C GTE...
HGF-AP... 241 .........RHTA C LSSP C LNGGT C HLIVATGTTV C A C
PP GF AGRL C NIEPDER...
Pancreatic
Secretory Granule Protein
Human PSGP .34 . .. . . ...dkk c eka c rpeee
c lalnstwg c f c rqdlnssdvhslqpqld c gpre...
Rat.. PSGP 179. . .. .....apkk c eia c rpeee
c vfqnnswt c v c rqdlnvsdtlslqplld c ganei...
Dog.. PSGP 158. . . .. ...atdk c knl c rpeea
c sflngtwd c f c rsdlnssdvhslqprln c gake...
Mammalian Nidogen
mouse nid .384 . . ...SQQT
C ANNRHQ C SVHAE C RDYATGFC C R C VA NY TGNGRQ C VAEGSPQ...
human nid .386 . . ...SRQT
C ANNRHQ C SVHAE C RDYATGFC C S C VA GY TGNGRQ C VAEGS...
human nid2 481 ...YNAANKET C EHNHRQ C SRHAF
C TDYATGFC C H C QS KF YGNGKH C LPEGAPHRVNGKVSGH...
mouse nid .666
.. ...LQNP C YIGTHG C DSNAA C RPGPGTQFT
C E C SI GF RGDGQT
C Y
human nid .668 .. ...LQNP
C YIGTHG C DTNAA C RPGPRTQFT C E C
SI GF RGDGRT C Y
human nid2 740 . ...TPVNP C YDGSHM C DTTAR
C HPGTGVDYT C E C AS GY QGDGRN C V
mouse nid .708
. .. . DIDE C SEQPSR C GNHAV C NNLPGTFR C E C
VE GY HFSDRGT C VAAEDQR
human nid .710 . .. .
DIDE C SEQPSV C GSHTI C NNHPGTFR C E C
VE GY QFSDEGT C VAVVDQR
human nid2 802 . .. . DENE C ATGFHR C GPNSV
C INLPGSYR C E C RS GY EFADDRHTC ILITP
mouse nid .756
....PINY C ETGLHN C DIPQRAQ C IYMGGSSYT C S C
LP GF SGDGRA C R
human nid .758 ....PINY
C ETGLHN C DIPQRAQ C IYTGGSSYT C S C LP GF SGDGQA C Q
human nid2 849 ....PANP C EDGSHT C APAGQAR
C VHHGGSTFS C A C LP GY AGDGHQ C T
mouse nid .800.
....... DVDE C QHSR C HPDAF C YNTPGSFT C Q C
KP GY QGDGFR C MPG...
human nid .802. .......
DVDE C QPSR C HPDAF C YNTPGSFT C Q C
KP GY QGDGFR C VPGEVE...
human nid2 893. .......
DVDE C SENR C HPAAT C YNTPGSFS C R C QP GY
YGDGFQ C I...
human nid 1208. ...... .ghny c svnngg
c thl c latpgsrt c r c pd nt lgvd c ierk.
Return-to-Table-of-Contents
Mouse Cell-surface antigen
114/A10
Mouse 114/A10 233 . ....PSDL C NPNP C KGTAS
C VKLHSKHF C L C LE GY
YYNSSLSS C VKGTTFPGD...
Mouse 114/A10 385.... ...sinl c dhyg c vgndssk
c qdilq c t c kp gl drlnpqvpf c va
Mouse 114/A10 427. .. . VT C SQP C NAEEKEQ
C LKMDNGVMD C V C MP GY
QRAN-GNRK C EE...
Bone Morphogenic
Protein
Xenopus....BMP 510 ....EVDE
C SRPNNGG C EQR C VNTLGSYK C A C DP GY
ELGQDKKS C EAA...
Mouse..... BMP 552 ....EVDE
C SRPNRGG C EQR C LNTLGSYK C S C DP GY
ELAPDKRR C EAA...
Mouse..... BMP 708 ....DKDE
C SKDN-GG C QQD C VNTFGSYE C Q C RS GF
VLHDNKHD C KEAG...
Human..... BMP 547 ....EVDE
C SRPNRGG C EQR C LNTLGSYK C S C DP GY
ELAPDKRR C EAA...
Sea-urchin BMP 532.....EKDE C AQPDQGG C MDV
C VNTIGSYR C D C RP GY
ELSSDGRR C EVAAEVYS...
Prostaglandin G/H Synthase
sheep...PgG/HS1 32 ...PVNP C CYYP C QHQGI C VRFGLDRYQ C D C
TRT GY SGPN C TIPEIWTWLR...
human.. PgG/HS1 31 ...PVNP C CYYP C QHQGI C VRFGLDRYQ C D C
TRT GY SGPN C TIPGLWTWLR...
mouse.. PgG/HS1 34 ...PVNP C CYYP C QNQGV C VRFGLDNYQ C D C
TRT GY SGPN C TIPEIWTWL...
rat.... PgG/HS2 15..SHAANP C CSNP C QNRGE C MSIGFDQYK C D C
TRT GF YGEN C TTPEF...
human.. PgG/HS2 15..SHTANP C CSHP C QNRGV C MSVGFDQYK C D C
TRT GF YGEN C STPEF...
mouse.. PgG/HS2 18.....ANP
C CSNP C QNRGE C MSTGFDQYK C D C
TRT GF YGEN C TTPEFL...
chick.. PgG/HS2 18.....ANP
C CSLP C QNRGV C MTTGFDRYE C D C
TRT GY YGEN C TTPEFF...
guinpig PgG/HS2 18.....ANP C CSNP C QNRGE C LSVGFDRYK C D C
TRT GY YGEN C TTPEFL...
Mammalian Meprin A (Endopeptidase-2)
Human.. beta 604........vqdl
c sktt c kndgv c tvrdgkae c r c qs ge dwwymger c ekrgstr
Mouse...beta 606.......avhna
c sevv c qnggi c vvqdgrae c k c pa ge dwwymgkr c ekrg
Rat.....beta 606.......tvhna
c seve c qnggi c tlqegrae c k c pa ge dwwymgkr c ekrg
Human alpha 670 ........FRDP C DPNP
C QNDGI C VNVKGMAS C R C IS GH AFFYTGER C QSAEVHG
Mouse alpha 684 ........FRDP C DPNP C QNEGT
C VNVKGMAS C R C VS GH AFFYAGER C QAMHVHG
Rat...alpha 670........YFRDP
C DPNP C QNEGT C VNVKGMAS C R C VS GH AFFYTGER C QAMHVHGSL
Mouse WT Reeler
Mouse wt reeler 271 ......FYLGPG C LDN C GGHGD C LKEQ- C I C DP
GY SGPN C YLTHSLK...
Mouse wt reeler 646 ......VYIGEA C PKL C SGHGY C TTGAV C I C DE
SF QGDD C SVF...
Human Jagged-1
Human Jagged-1 232 ..NRAI C R--QG C SPKHGS C K--LP-G-D C R C
QY GW QGLY C D
Human Jagged-1 264. . . K C IPHPG C --VHGI C -NE-P--WQ C L C
ET NW GGQL C DKD
Human Jagged-1 297. . LNY C GTHQP C -LNGGT C SNTGPDKYQ C S C
PE GY SGPN C EI
Human Jagged-1 336. .AEHA C LSDP- C -HNRGS
C KET-SLGFE C E C SP GW
TGPT C STN
Human Jagged-1 375. . IDD C -SPNN C -SHGGT C -QDLVNGFK C V C
PP QW TGKT C QLD
Human Jagged-1 413. . ANE C EAKP- C -VNAKS
C KN-LIASYY C D C LP GW
MGQN C DIN
Human Jagged-1 451. . IND C LG-Q- C -QNDAS
C R-DLVNGYR C I C PP GY
AGDH C ERD
Human Jagged-1 488. . IDE C ASNP- C -LNGGH C QNE-INRFQ C L C
PT GF SGNL C QLD
Human Jagged-1 526. . IDY C EPNP- C -QNGAQ C YNRA-SDYF C K C
PE DY EGKN C SHL...
Human Jagged-1 591 ...VRY - ISSNV C GP-HGK
C KSQSGGKFT C D C NK GF
TGTY C HEN
Human Jagged-1 630. . IND C ESNP- C -RNGGT C I-DGVNSYK C I C
SD GW EGAY C ETN
Human Jagged-1 668. . IND C SQNP- C -HNGGT C R-DLVNDFY C D C
KN GW KGKT C HS
Human Jagged-1 705. .RDSQ C DEAT- C -NNGGT C YDEGD-AFK C M C
PG GW EGTT C NIAR
Human Jagged-1 745. . NSS C LPNP- C -HNGGT C VVNG-ESFT C V C
KE GW EGPI C AQ
Human Jagged-1 782. .NTND C SPHP- C -YNSGT
C V-DGDNWYR C E C AP GF
AGPD C RIN
Human Jagged-1 821. . INE C QSSP- C AF-GAT
C V-DEINGYR C V C PP GH SGAK C QEVS...
Vertebrate Tenascin
Human TNSN 148 .........LQPAT--GRLDTRPF C SGRGNFSTEGCG
C V C EP GW KGPN
C SE
Chick TNSN 148..........PNSQTAEGRLDTAPY C SGHGNYSTEICG C V C EP
GW KGPN C SE
Human TNSN 188.................PE C
PGN C HLRGR C IDGQ C I C
DD GF TGED C SQ
Chick TNSN 190.................PA C PRN C LNRGL C VRGK C I C EE
GF TGED C SQ
Human TNSN 219.................LA C PSD C NDQGK C VNGV C I C FE
GY A-AD C SR
Chick TNSN 221.................AA C PSD C NDQGK C VDGV C V C FE
GY TGPD C GE
Human TNSN 249................EI C PVP C SEEHGT C VDGL C V C HD
GF AGDD C NK
Chick TNSN 252................EL C PHG C GI-HGR C VGGR C V C HE
GF TGED C NE
Human TNSN 281.................PL C LNN C YNRGR C VENE C V C DE
GF TGED C SE
Chick TNSN 283.................PL C PNN C HNRGR C VDNE C V C DE
GY TGED C GE
Human TNSN 312.................LI C PND C FDRGR C INGT C Y C EE
GF TGED C GK
Chick TNSN 314.................LI C PND C FDRGR C INGT C F C EE
GY TGED C GE
Human TNSN 343.................PT C PHA C HTQGR C EEGQ C V C DE
GF AGVD C SE
Chick TNSN 345.................LT C PNN C NGNGR C ENGL C V C HE
GF VGDD C SQ
Human TNSN 384.................KR C PAD C HNRGR C VDGR C E C DD
GF TGAD C GE
Chick TNSN 376.................KR C PKD C NNRGH C VDGR C V C HE
GY LGED C GE
Human TNSN 405.................LK C PNG C SGHGR C VNGQ C V C DE
GY TGED C SQ
Human TNSN 436.................LR C PND C HSRGR C VEGK C V C EQ
GF KGYD C SD
Chick TNSN 407.................LR C PND C HNRGR C INGQ C V C DE
GF IGED C GE
Human TNSN 467.................MS C PND C HQHGR C VNGM C V C DD
GY TGED C RD
Chick TNSN 438.................LR C PND C HNRGR C VNGQ C E C HE
GF IGED C GE
Human TNSN 508.................RQ C PRD C SNRGL C VDGQ C V C ED
GF TGPD C AE
Chick TNSN 469.................LR C PND C NSHGR C VNGQ C V C DE
GY TGED C GE
Human TNSN 529.................LS C PND C HGQGR C VNGQ C V C HE
GF MGKD C KE
Chick TNSN 510.................LR C PND C HNRGR C VEGR C V C DN
GF MGED C GE
Human TNSN 560.................QR C PSD C HGQGR C VDGQ C I C HE
GF TGLD C GQ
Chick TNSN 531.................LS C PND C HQHGR C VDGR C V C HE
GF TGED C RE
Human TNSN 591.................HS C PSD C NNLGQ
C VSGR C I C NE GY
SGED C SEV...
Chick TNSN 562.................RS C PND C NNVGR
C VEGR C V C EE GY
MGID C SDVSPPT...
Return-to-Table-of-Contents
Verterbrate Aggrecan Cartilage-specific
Core protein
Human AGGRCN 2164.......PARS C AEEP C GAG-T
C KETEGHVI C L C PP GY
TGEH C NID...
Chick AGGRCN 1855.......DTDE C HSSP C LNGAT C VDGIDSFK C L C
LP SY GGDL C EID...
Vertebrate Versican (chondroitin
sulfate proteoglycan core protein)
Human VRSCN 3089........GPDR C KMNP C LNGGT C YPTETSYV C T C
VP GY SGDQ C ELD
Mouse VRSCN 3052........GPDL C KTNP C LNGGT C YPTETSYV C T C
AP GY SGDQ C ELD
Chick VRSCN 3254........GQDP C KSNP C LNGGT C YPRGSFYI C T C
LP GF NGEQ C ELD
Human VRSCN 3128.........FDE C HSNP
C RNGAT C VDGFNTFR C L C
LP SY VGAL C EQDTET...
Mouse VRSCN 3091........ FDE C HSNP C RNGAT C VDGFNTFR C L C
LP SY VGAL C EQDTE...
Chick VRSCN 3293 ....... IDE C QSNP C RNGAT C IDGLNTFT C L C
LP SY IGAL C EQD...
Mammalian Brevican brain
proteoglycan
mouse..BRVCN 622........SSGD
C IPSP C HNGGT C LEEKEGFR C L C
LP GY GGDL C DVG...
bovine BRVCN 647........SSGD C VPSP C HNGGT C LEEEEGVR C L C
LP GY GGDL C DVGLHF...
rat... BRVCN 622........SSGD
C IPSP C HNGGT C LEEKEGFR C L C
VP GY GGDL C DVGLHF...
Rodent Neurocan Core
protein
..Rat NRCN...950.........TDP C ENNP C LHGGT
C RTNGTMYG C S C DQ GY AGEN C EID
Mouse NRCN...960........PTDP
C ENNP C LHGGT C HTNGTVYG C S C DQ GY
AGEN C EID
..Rat NRCN...988.........IDD C LCSP C ENGGT
C IDEVNGFI C L C LP SY GGNL C EKDTE...
Mouse NRCN...999.........IDD
C LCSP C ENGGT C IDEVNGFI C L C LP SY
GGSL C EKDTEG...
Return-to-Table-of-Contents
Agrin Proteins
..Ray .AGRIN...77....ST C E C NRYGSYSKT
C SPSSGQ C S C KPGVGGLK C DR C EP GF
WNFRGIVTDEKSG C T
..Ray .AGRIN..132. ...P C N C YPLGAVRDD
C EQMSGL C S C KAGISGMK C NQ C PN GS KLGPSG
C DQDPSVS
C. el .AGRIN..918............KRLGSP C TRHEE C EKLSAQ C ITRPGRKSV C D C DD GW KSHLGI C IEISKKRKSD
..Rat..AGRIN.1218.............qqpsks
c dsqp c lHGgt c qdqdsgkgft c s c ta gr ggsv c ekv
Chick..AGRIN.1227.............kkpsrp c dshp c lHGgt c ed-d-greft
c r c pa gk ggav c ekpiry
..Ray .AGRIN..608...............vhrp
c dsqp c lHGgt c ed-d-gvsyt c s c pa gr ggav c erti
..Rat .AGRIN.1435.........gvge
c gdhp c lpnp c hggal c qaleagmfl c q c pp gr fgpt c ade
Chick .AGRIN.1441.........GVGE C GNDP C HPNP C HHGAS C HVKEAEMFH
C E C LH SY TGPT C ADE
..Rat .AGRIN.1480.................ksp
c qpnp c hgaap c rvlssggak c e c pl gr sgtf c qtvlet
Chick..AGRIN.1486.................rnp c dptp c hisat c lvlpeggam
c a c pm gr egef c ervteq
..Rat .AGRIN.1706...........SPFADHP
C TQALGNP C LNGGS C VPREATYE C L C
PG GF SGLH C EKGLVEKS
Chick..AGRIN.1711...........STFRAHP C TQKP-NP C QNGGT
C SPRLESYE C A C QR GF
SGAH C EKVIIEKAA
..Ray..AGRIN.1097..............RWHA
C TKTR-NP C QNGGV C SPRLREYD C M C
QR GF SGPQ C EKALEE
Mammalian
Cartilage Oligomeric Matrix Protein
Human COMP..87......PLLH
C APGF C FPGVA C IQTESGGR C GP C PA GF
TGNGSH C TD
Rat.. COMP..85......PVAL C APGS C FPGVV C TETATGAR C GP C PP GY TGNGSH C TD
Human COMP 128.......VNE C NAHP C FPRVR
C INTSPGFR C EA C PP GY
SGPTHQGVGLAFAKANKQV C TD
Rat.. COMP 126.......VNE
C NAHP C FPRVR C INTSPGFH C EA C
PP GF SGPTHEGVGLTFAKTNKQV C TD
Human COMP 181.....INE C ETGQHN C VPNSV
C INTRGSFQ C GP C QP GF
VGDQASG C QRGA
Rat.. COMP 179.....INE
C ETGQHN C VPNSV C VNTRGSFQ C GP C
QP GF VGDQRSG C QRRG
Human COMP 226....QRF C PDGSPSE C HEHAD
C VLERDGSRS C V C RV GW
AGNGIL C GRDT...
Rat.. COMP 224....QHF
C PDGSPSP C HEKAD C ILERDGSRS C V C
AV GW AGNGLL C GRDTDL...
Mouse .CMP.227.......VSDL C ATGDHD C EQL
C VSSPGSYT C A C HE GF
TLNSDGKT C NV...
Human .CMP.223... ...VSDL C ATGDHD C EQV
C ISSPGSYT C A C HE GF
TLNSDGKT C NV...
ChickenCMP 221... ...VSDL C ATGDHD C EQI C ISTPGSYK C A C
KE GF TLNNDGKT C SA...
Human Del-1
hDEL-1 ....24 .......DI
C DPNP C ENGGI C LPGLADGSFS C E C PD GF TDPN C SSVVEVASDEEEPTSA
hDEL-1 ....76 ..GP
C TPNP C HNGGT C EISEAYRGDTFIGYV C K C PR GF NGIH C QHNI
hDEL-1 ...121 .........NE
C EVEP C KNGGI C TDLVANYS C E C PG EF
MGRN C QYK C SGPLGIEG...
Human dJ100N22.1 (novel EGF-like
domain containing protein).
...... 33 ..... ...DVDE
C SEGTDD C HIDAI C QNTPKSYK C L C KP GY KGE--GKQ
C E
...... 74 ..... ...DIDE
C ENDYYNGG C VHE C INIPGNYR C T C FD GF MLAHDGHN C L
......117 ..... ...DVDE
C QDN--NGG C QQI C VNAMGSYE C Q C HS GF FLSDNQHT C IHRSN...
Seven-Transmembrane
Hormone Receptors (Emr1 & Emr2)(Variable Splice
Combinations)
a.k.a. mouse cell-surface glycoprotein F4/80 or human Leukocyte Antigen
CD97
human EMR1 .31.. ...KGNN
C RDSTL C PAYAT C TNTVDSYY C T C KQ GF
LSSNGQNHFKDPGVR C K
mouse EMR1 .32 .. ...VNE
C QDTTT C PAYAT C TDTTDSYY C T C KR GF
LSSNGQTNFQGPGVE C Q
human EMR2. 25.......
...DSRG C ARW C PQDSS C VNATA C R C
NP GF SSFSEIITTPMET C D
Human CD97. 22.......
...DSRG C ARW C PQNSS C VNATA C R C
NP GF SSFSEIITTPTET C D
human EMR1 .80
. . DIDE C SQSPQP C GPNSS C KNLSGRYK C S C
LD GF SSPTGNDWVPGKPGNFS C T
mouse EMR1 .81 . . DVNE
C LQSDSP C GPNSV C TNILGRAK C S C LR GF SSSTGKDWILGSLDNFL C A
human EMR2. 67 .. DINE
C ATLSKVS C GKFSD C WNTEGSYD C V C SP GY EPVSGAKTFKNESENT C Q
Human CD97. 64 .. DINE
C ATPSKVS C GKFSD C WNTEGSYD C V C SP GY EPVSGAKTFKNESENT C Q
human EMR1 132 .. . DINE C LTSRV C PEHSD
C VNSMGSYS C S C QV GF
ISRNST C E
mouse EMR1 133 .. . DVDE C LTIGI C PKYSN C
SNSVGSYS C T C QP GF
VLNGSI C E
human EMR2 119 .. .DVDE C QQNPRL C KSYGT C
VNTLGSYT C Q C LP GF
KLKPEDPKL C T
human EMR1 172 .. . DVNE C ADPRA C PEHAT C NNTVGNYS C F C
NP GF ESSSGHLSCQGLKAS C E
mouse EMR1 173 .. . dede c vtrdv c pehat c
hntlgsyy c t c ns gl essgggpmfqgldes c e
human EMR2 163 ... DVNE C TSGQNP C HSSTH C
LNNVGSYQ C R C RP GW
QPIPGSPNGPNNTV C E
human EMR1 221 . .... DIDE C TEM C PINST
C TNTPGSYF C T C HP GF
APSSGQLNFTDQGVE C RD
mouse EMR1 222 . . DVDE C SRNSTL C GPTFI C
INTLGSYS C S C PA GF
SLPTFQILGHPADGN C TD
human EMR2 212 ... DVDE C SSGQHQ C DSSTV C
FNTVGSYS C R C RP GW KPRHGIPNNQKDTV C EDMTFSTWT...
Human CD97 116 ... DVDE C SSGQHQ C DSSTV C
FNTVGSYS C R C RP GW KPRHGIPNNQKDTV C EDM...
human EMR1 268 . . DIDE C RQDPST C GPNSI C
TNALGSYS C G C IV GH PNPEGS--QKDGNFS C QRV
mouse EMR1 272 . .... DIDE C DDT C PLNSS C
TNTIGSYF C T C HP GF
ASSNGQLNFKDLEVT C E
mouse EMR1 329 .. .DIDE C TQDPLQ C GLNSV C
TNVPGSYI C G C LP DF
QMDPEGSQGYGNFN C KR...
Mammalian Fibulin
1 & 2
mouse Fib2 ..39... ...llen
c ieealepga c cat c vqqg c a c egyqyyd c vqggfvdg...
Human Fib2 .604.
...DQDE C LLLPGEL C QHL C INTVGSYH C
A C FP GF SLQDDGRT
C RPEGHP...
Mouse Fib2 .594. ...DQDE
C LMLPGEL C QHL C INTVGSYR C A C
FP GF ELQGDGRT C RPD...
Human Fib1 .176...
...LNDR C RGGGP C KQQ C RDTGDEVV C S
C FV GY QLLSDGVS
C EDVNE...
Mouse Fib1 .178. . ...LNDR
C RGGGP C KQQ C RDTGDEVI C S C
FV GY QLQSDGVS C ED...
Human Fib2 .679... ...QPNT
C KDNGP C KQV C STVGGSAI C S C
FP GY AIMADGVS C ED...
Mouse Fib2 .669... ...QPNT
C KDNGP C RQV C RVVGDTAM C S C
FP GY AIMADGVS C EDQ...
Human Fib1 .356....DVDE
C APPAEP C GKGHR C VNSPGSFR C E C
KT GY YFDGISRM C V
Mouse Fib1 .359. ...VDE
C APPAEP C GKGHH C LNSPGSFR C E C
KA GF YFDGISRT C V
Human Fib2 .858....DVNE
C ETGVHR C GEGQV C HNLPGSYR C D C KA GF QRDAFGRG C I
Mouse Fib2 .896. ...VNE
C ETGVHR C GEGQL C YNLPGSYR C D C KP GF QRDAFGRT C I
Human Fib1 .399.
.. DVNE C QRYPGRL C GHK C ENTLGSYL C
S C SV GF RLSVDGRS
C E
Mouse Fib1 .401. .. DINE
C QRYPGRL C GHK C ENTPGSFH C S C
SA GF RLSVDGRS C E
Human Fib2 .901. .. DVNE
C WASPGRL C QHT C ENTLGSYR C S C
AS GF LLAADGKR C E
Mouse Fib2 .938. .. DVNE
C WVSPGRL C QHT C ENTPGSYR C S C
AA GF LLAADGKH C E
Human Fib1 .441.
... . DINE C SSSP C SQE C ANVYGSYQ C
Y C RR GY QLSDVDGVT
C E
Mouse Fib1 .443. ...
. DVNE C LNSP C SQE C ANVYGSYQ C Y C RR GY QLSDVDGVT C
E
Human Fib2 .943. .....
DVNE C EAQR C SQE C ANIYGSYQ C Y C RQ GY QLAE-DGHT C
T
Mouse Fib2 .980. ...
. DVNE C ETRR C SQE C ANIYGSYQ C Y C RQ GY QLAE-DGHT C
T
Human Fib1D 481
..DIDE C ALPTGGHI C SYR C INIPGSFQ C
S C PSS GY RLAPNGRN
C QD
Mouse Fib1 .483 ..DIDE
C ALPTGGHI C SYR C INIPGSFQ C S C
PSS GY RLAPNGRN C QD
Human Fib2 .982 ..DIDE
C AQGA-GIL C TFR C LNVPGSYQ C A C
PEQ GY TMTANGRS C KD...
Mouse Fib2 1019 ..DIDE C AQGA-GIL C TFR C VNVPGSYQ C A C
PEQ GY TMMANGRS C K
Fibrillin I (DEFECTIVE IN MARFAN'S SYNDROME) & Fibrillin II
Human. F-I . 76
...ggnq c ivpi c rhs c gdgf c srpnm c t c ps gq iaps c gsr
Mouse. F-I . 76
...ggnq c ivpi c rhs c gdgf c srpnm c t c ps gq isps c gsr
Bovine F-I . 76 ...ggnq c ivpi c rhs c gdgf
c srpnm c t c ps gq iaps c gsr
Human. FII. 106
...ggnq c ivpi c rns c gdgf c srpnm c t c ss gq isst c gsk
Human. F-I. 115 . . . . . SIQH C NIR C MNGGS
C SDDH C L C QK GY
IGTH C G
Mouse. F-I. 116 . . . . . SIQH C NIR C MNGGS
C SDDH C L C QK GY
IGTH C G
Bovine F-I. 115 . ..
. . .SIQH C NIR C MNGGS C SDDH C L C QK GY IGTH C G
human. FII. 145 . . . . . SIQQ C SVR C MNGGT
C ADDH C Q C QK GY
IGTY C G
human. F-I. 147 . . . . . QPV C ESG C LNGGR
C VAPNR C A C TY GF
TGPQ C ERDYRTGP...
Mouse. F-I. 147 . . . . . QPV C ESG C LNGGR
C VAPNR C A C TY GF
TGPQ C ERD...
bovine F-I. 147 . . .
. . QPV C ESG C LNGGR C VAPNR C A C TY GF TGPQ C ERD...
human. FII. 177 . . . . . QPV C ENG C QNGGR
C IA-QP C A C VY GF
TGPQ C ERDY...
Mouse. F-I. 247 . ...VDE C QAIPGM C QGGN C INTVGSFE C K C PA GH KFNEVSQK C E
human. F-I. 288 . . DIDE C STIPGI C EGGE C TNTVSSYF C K C
PP GF YTSPDGTR C IDVRPGY...
Mouse. F-I. 288 . . DIDE C STIPGV C DGGE C TNTVSSYF C K C
PP GF YTSPDGTR C VD...
bovine F-I. 288 ... DIDE
C STIPGI C DGGE C TNTVSSYF C K C PP GF
YTSPDGTR C IDV...
human. FII. 317 ... DIDE C SIIPGI C ETGE C SNTVGSYF C V C
PR GY VTSTDGSR C IDQ...
human. F-I. 449 ....VTDY C QLVRYL C QNGR
C IPTPGSYR C E C NK GF
QLDL-RGE C I
Mouse. F-I. 449 ....VTDY C QLVRYL C QNGR
C IPTPGSYR C E C NK GF
QLDI-RGE C I
bovine F-I. 449 ....VTDY
C QLFRYL C QNGR C IPTPGSYR C E C
NK GF QLDL-RGE C I
human. FII. 493 ....TIDI C KHHANL C LNGR
C IPTVSSYR C E C NM GY
KQDA-NGD C I
human. F-I. 490 . . . DVDE C EKNP C AGGE C INNQGSYT C Q C
RA GY QSTLTRTE C R
Mouse. F-I. 490 . . . DVDE C EKNP C TGGE C INNQGSYT C H C
RA GY QSTLTRTE C R
bovine F-I. 490 . . .
DVDE C EKNP C AGGE C INTQGSYT C Q C
RP GY QSTLTRTE C R
human. FII. 534 . . . DVDE C TSNP C TNGD
C VNTPGSYY C K C HA GF
QRTPTKQA C I
human. F-I. 530 . . DIDE C LQNGRI C NNGR
C INTDGSFH C V C NA GF
HVTRDGKN C E
Mouse. F-I. 530 . . DIDE C LQNGRI C NNGR
C INTDGSFH C V C NA GF
HVSSEGKN C E
bovine F-I. 530 . . DIDE
C LQNGRI C NNGR C INTDGSFH C V C
NA GF HVTRDGKN C E
human. FII. 574 . . DIDE C IQNGVL C KNGR
C VNSDGSFQ C I C NA GF
ELTTDGKN C V
human. F-I. 572 . .. DMDE C SIRNM C LNGM
C INEDGSFK C I C KP GF
QLASDGRY C K...
Mouse. F-I. 572 . .. DMDE C RTPNM C PNGM
C INEDGSFK C I C KP GF
QLASDGRY C K
bovine F-I. 572 . ..
DMDE C SIRNM C LNGM C INEDGSFK C I C KP GF QLASDGRY C
KD...
human. FII. 616 . .. DHDE C TTTNM C LNGM
C INEDGSFK C I C KP GF
VLAPNGRY C T
Mouse. F-I. 613 . . .dine c etpgi c mngr c vntdgsyr c e c fp gl
avgldgrw c vdth...
human. FII. 657 . . .dvde c qtpgi c mngh c insegsfr c d c pp gl
avgmdgrv c vdth...
human. F-I. 723 ....DINE C ALDPDI C PNGI
C ENLRGTYK C I C NS GY
EVDSTGKN C V
bovine F-I. 723 ....DINE
C ALDPDI C PNGI C ENLRGTYK C I C
NS GY EVDSTGKN C V
Mouse. F-I. 721...GTDINE
C ALDPDI C PNGI C ENLRGTYK C I C
NS GY EVDITGKN C V
human. FII. 767 ....DINE C ALDPDI C ANGI
C ENLRGSYR C N C NS GY
EPDASGRN C I
human. F-I. 765 . . DINE C VLNSLL C DNGQ
C RNTPGSFV C T C PK GF
IYKPDLKT C EDIDE...
bovine F-I. 765 . . DINE
C VLNSLL C DNGQ C RNTPGSFV C T C
PK GF IYKPELKT C ED...
Mouse. F-I. 765 . . DINE C VLNSLL C DNGQ
C RNTPGSFV C T C PK GF
VYKPDLKT C E...
human. FII. 809 . . DIDE C LVNRLL C DNGL
C RNTPGSYS C T C PP GY
VFRTETET C E...
Mouse. F-I. 807 . . ..dide c essp c ingv c knspgsfi c e c sp es
tldptkti c ietikgtcwqtv...
human. FII. 851 . ... dine c esnp c vnga c rnnlgsfn c e c sp gs
klsstgli c id...
human. F-I. 911 . ...ide c evfpgv c kngl c vntrgsfk c q c ps gm
tldatgri c ldi...
Mouse. F-I. 911 . ...ine c evfpgv c kngl c vnsrgsfk c e c pn gm
tldatgri c ldir...
Bovine F-I. 911 . ...ide
c evfpgv c kngl c vnskgsfk c q c ps gm tldatgri c ldir...
Human. F-I. 955 . ...vne c evfpgv c pngr c vnskgsfh c e c pe gl
tldgtgrv c ldirme...
human. F-I 1028
...DINE C KMIPSL C THGK C RNTIGSFK C
R C DS GF ALDSEERN
C T
Mouse. F-I 1029 . ...INE
C KMIPSL C THGK C RNTIGSFK C R C
DS GF ALDSEERN C T
bovine F-I 1028 ...DINE C KMIPNL C THGK
C RNTIGSFK C R C DS GF
ALDSEERN C T
human. FII 1072 ...DINE
C KAFPGM C TYGK C RNTIGSFK C R C
NS GF ALDMEERN C T
human. F-I 1070
. . DIDE C RISPDL C GRGQ C VNTPGDFE
C K C DE GY ESGFMMMKN
C MDIDE...
Mouse. F-I 1070 . . DIDE
C RISPDL C GRGQ C VNTPGDFE C K C
DE GY ESGFMMMKN C M
bovine F-I 1070 . . DIDE C RISPDL C GRGQ C VNTPGDFE C K C
DE GY ESGFMMMKN C MD...
human. FII 1114 . . DIDE
C RISPDL C GSGI C VNTPGSFE C E C FE GY
ESGFMMMKN C M
Mouse. F-I 1113
. . DIDE C QRDPLL C RGGI C HNTEGTYR C E C PP GH QLSPNISA C I
human. FII 1157 . . DIDG
C ERNPLL C RGGT C VNTEGSFQ C D C PL GH ELSPSRED C V
human. F-I 1155 . . DINE
C ELSAHL C PNGR C VNLIGKYQ C A C
NP GY HSTPDRLF C V
Mouse. F-I 1155 . . DINE
C ELSANL C PHGR C VNLIGKYQ C A C
NP GY HPTHDRLF C V
bovine F-I 1155 . . DINE C ELSAHL C PHGR C VNLIGKYQ C A C
NP GY HSTPDRLF C V
human. FII 1199 . . DINE
C SLSDNL C RNGK C VNMIGTYQ C S C
NP GY QATPDRQG C T
human. F-I 1197
. . DIDE C SIMNGG C ET-F C TNSEGSYE C S C
QP GF ALMPDQRS C T
Mouse. F-I 1197 . . DIDE
C SIMNGG C ET-F C TNSDGSYE C S C QP GF
ALMPDQRS C T
bovine F-I 1197 . . DIDE C SIMNGG C ET-F C
TNSEGSYE C S C QP GF
ALMPDQRS C T
human. FII 1241 . . DIDE
C MIMNGG C DT-Q C TNSEGSYE C S C SE GY
ALMPDGRS C A
human. F-I 1238
. . DIDE C EDNPNI C DGGQ C TNIPGEYR C L C
YD GF MASEDMKT C V
Mouse. F-I 1238 . . DIDQ
C EDNPNI C DGGQ C TNIPGEYR C L C YD GF
MASEDMKT C V
bovine F-I 1238 . . DIDE C EDNPNI C DGGQ C
TNIPGEYR C L C YD GF
MASEDMKT C V
human. FII 1282 . . DIDE
C ENNPDI C DGGQ C TNIPGEYR C L C YD GF
MASMDMKT C I
human. F-I 1280 . . DVNE
C DLNPNI C LSGT C ENTKGSFI C H C DM GY
SGKKGKTG C T
Mouse. F-I 1280 . . DVNE
C DLNPNI C LSGT C ENTKGSFI C H C DM GY
SGKKGKTG C T
bovine F-I 1280 . . DVNE C DLNPNI C LSGT C
ENTKGSFI C H C DM GY
SGKKGKTG C T
human. FII 1324 . . DVNE
C DLNSNI C MFGE C ENTKGSFI C H C
QL GY SVKKGTTG C T
human. F-I 1322
. .DINE C EIGAHN C GKHAV C TNTAGSFK
C S C SP GW IGDGIK
C T
Mouse. F-I 1222 . .DINE
C EIGAHN C GRHAV C TNTAGSFK C S C
SP GW IGDGIK C T
bovine F-I 1322 . .DINE C EIGAHN C DRHAV C TNTAGSFK C S C
SP GW IGDGIK C T
human. FII 1366 . .DVDE
C EIGAHN C DMHAS C LNIPGSFK C S C
RE GW IGNGIK C I
human. F-I 1363
. .DLDE C SNGTHM C SQHAD C KNTMGSYR
C L C KE GY TGDGFT
C T
Mouse. F-I 1363 . .DLDE
C SNGTHM C SQHAD C KNTMGSYR C L C
KD GY TGDGFT C T
bovine F-I 1363 . .DLDE C SNGTHM C SQHAD C KNTMGSYR C L C
KE GY TGDGFT C T
human. FII 1407 . .DLDE
C SNGTHQ C SINAQ C VNTPGSYR C A C
SE GF TGDGFT C S
human. F-I 1404
. ..DLDE C SENLNL C GNGQ C LNAPGGYR
C E C DM GF VPSADGKA
C E
Mouse. F-I 1404 . . DLDE
C SENLNL C GNGQ C LNAPGGYR C E C
DM GF VPSADGKA C E
bovine F-I 1404 . . DLDE C SENLNL C GNGQ C LNAPGGYR C E C
DM GF VPSADGKA C E
human. FII 1448 . . DVDE
C AENINL C ENGQ C LNVPGAYR C E C
EM GF TPASDSRS C Q
human. F-I 1446
.. . DIDE C SLPNI C VFGT C HNLPGLFR
C E C EI GY ELDRSGGN
C T
Mouse. F-I 1446 .. .
DIDE C SLPNI C VFGT C HNLPGLFR C E C EI GY ELDRSGGN C
T
bovine F-I 1446 . .. DIDE C SLPNI C VFGT C HNLPGLFR C E C
EI GY ELDRSGGN C T
human. FII 1490 . ..
DIDE C SFQNI C VSGT C NNLPGMFH C I C
DD GY ELDRTGGN C T
human. F-I 1487
.. . DVNE C LDPTT C ISGN C VNTPGSYI C D C
PP DF ELNPTRVG C VDTRSGN...
Mouse. F-I 1487 .. .
DVNE C LDPTT C ISGN C VNTPGSYT C D C
SP DF ELNPTRVG C VDTR...
bovine F-I 1487 .. . DVNE C LDPTT C ISGN C
VNTPGSYT C D C PP DF
ELNPTRVG C VDT...
human. FII 1531 .. .
DIDE C ADPIN C VNGL C VNTPGRYE C N C PP DF QLNPTGVG C
VDNRV...
human. F-I 1606 . .
DIDE C QELPGL C QGGK C INTFGSFQ C R C PT GY YLNEDTRV C D
Mouse. F-I 1606 . . DIDE
C QELPGL C QGGK C INTFGSFQ C R C PT GY
YLNEDTRV C D
bovine F-I 1606 . . DIDE C QELPGL C QGGK C
INTFGSFQ C R C PT GY
YLNEDTRV C D
human. FII 1649 . . DIDE
C QELPGL C QGGN C INTFGSFQ C E C PQ GY
YLSEDTRI C E
human. F-I 1648
. . DVNE C E-TPGI C GPGT C YNTVGNYT C I C
PP DY MQVNGGNN C MDMRRSL...
Mouse. F-I 1648 . . DVNE
C E-TPGI C GPGT C YNTVGNYT C I C PP DY MQVNGGNN C MDM...
bovine F-I 1648 . . DVNE C E-TPGI C GPGT C
YNTVGNYT C I C PP DY
MQVNGGNN C MDM...
human. FII 1691 .. .DIDE
C FAHPGV C GPGT C YNTLGNYT C I C PP EY MQVNGGHN C MDMR...
human. F-I 1766
. . DIDE C REIPGV C ENGV C INMVGSFR
C E C PV GF FYNDKLLV
C E
Mouse. F-I 1766 . . DIDE
C REIPGV C ENGV C INMVGSFR C E C
PV GF FYNDKLLV C E
bovine F-I 1766 . . DIDE C REIPGV C ENGV C INMVGSFR C E C
PV GF FYNDKLLV C E
human. FII 1807 . . DIDE
C KEIPGI C ANGV C INQIGSFR C E C
PT GF SYNDLLLV C E
human. F-I 1808
. .DIDE C QNG-PV C QRNAE C INTAGSYR
C D C KP GY RFTSTGQ
C N
Mouse. F-I 1808 . .DIDE
C QNG-PV C LRNAE C INTAGSYR C D C
KP GY RLTSTGQ C N
bovine F-I 1808 . .DIDE C QNG-PV C QRNAE C INTAGSYR C D C
KP GY RFTSTGQ C N
human. FII 1849 . .DIDE
C SNGDNL C QRNAD C INSPGSYR C E C
AA GF KLSPNGA C V
human. F-I 1849
. . DRNE C QEIPNI C SHGQ C IDTVGSFY
C L C HT GF KTNDDQTM
C L
Mouse. F-I 1850 . . DRNE
C QEIPNI C SHGQ C IDTVGSFY C L C
HT GF KTNEDQTM C L
bovine F-I 1849 . . DRNE C QEIPNI C SHGQ C IDTVGSFY C L C
HT GF KTNADQTM C L
human. FII 1891 . . DRNE
C LEIPNV C SHGL C VDLQGSYQ C I C
HN GF KASQDQTM C M
human. F-I 1891 ..
.. DINE C ERDA C GNGT C RNTIGSFN C R
C NH GF ILSHNND
C I
Mouse. F-I 1891 . ...
DINE C ERDA C GNGT C RNTIGSFN C R C NH GF ILSHNND C I
bovine F-I 1891 . ... DINE C ERDA C GNGT C RNTIGSFN C R C
NH GF ILSHNND C I
human. FII 1933 . ...
DVDE C ERHP C GNGT C KNTVGSYN C L C YP GF ELTHNND C L
human. F-I 1930
.. DVDE C ASGNGNL C RNGQ C INTVGSFQ
C Q C NE GY EVAPDGRT
C V
Mouse. F-I 1930 .. DVDE
C ATGNGNL C RNGQ C VNTVGSFQ C R C
NE GY EVAPDGRT C V
bovine F-I 1930 .. DVDE C ATGNGNL C RNGQ C INTVGSFQ C Q C
NE GY EVAPDGRT C V
human. FII 1972 .. DIDE
C SSFFGQV C RNGR C FNEIGSFK C L C
NE GY ELTPDGKN C I
human. F-I 1973
. . DINE C LLEPRK C APGT C QNLDGSYR C I C
PP GY SLQNEK C E
Mouse. F-I 1973 . . DINE
C VLDPGK C APGT C QNLDGSYR C I C PP GY
SLQNDK C E
bovine F-I 1973 . . DINE C LLDPRK C APGT C
QNLDGSYR C I C PP GY
SLQNDK C E
human. FII 2015 . . DTNE
C VALPGS C SPGT C QNLEGSFR C I C PP GY
EVKSEN C I
human. F-I 2013
. . DIDE C VEEPEI C ALGT C SNTEGSFK C L C
PE GF SLSSSGRR C QDLRMSY...
Mouse. F-I 2013 . . DIDE
C VEEPEI C ALGT C SNTEGSFK C L C PE GF
SWSSSGRR C QDL...
bovine F-I 2013 . . DIDE C VEEPEI C ALGT C
SNTEGSFK C L C PD GF
SLSSTGRR C QDL...
human. FII 2055 . . DINE
C DEDPNI C LFGS C TNTPGGFQ C L C
PP GF VLSDNGRR C F...
human. F-I 2127
....DMDE C KE-PDV C KHGQ C INTDGSYR
C E C PF GY TLA--GNE
C V
Mouse. F-I 2127 ....DMDE
C KE-PDV C RHGQ C INTDGSYR C E C
PF GY ILE--GNE C V
bovine F-I 2127 ....DMDE C KE-PDV C KHGQ C INTDGSYR C E C
PF GY ILQ--GNE C V
human. FII 2170 ....DVNE
C LESPGI C SNGQ C INTDGSFR C E C
PM GY NLDYTGVR C V
human. F-I 2166
. .. DTDE C SVGNP C GNGT C KNVIGGFE
C T C EE GF EPGPMMT
C E
Mouse. F-I 2166 . ..
DTDE C SVGNP C GNGT C KNVIGGFE C T C EE GF EPGPMMT C E
bovine F-I 2166 . .. DTDE C SVGNP C GNGT C KNVIGGFE C T C
EE GF EPGPMMT C E
human. FII 2212 . ..
DTDE C SIGNP C GNGT C TNVIGSFE C N C NE GF EPGPMMN C E
human. F-I 2206
.. . DINE C AQNPLL C AFR C VNTYGSYE
C K C PV GY VLREDRRM
C K
Mouse. F-I 2206 .. .
DINE C AQNPLL C AFR C VNTYGSYE C K C PV GY VLREDRRM C
K
bovine F-I 2206 .. . DINE C AQNPLL C AFR C VNTYGSYE C K C
PA GY VLREDRRM C K
human. FII 2252 .. .
DINE C AQNPLL C ALR C MNTFGSYE C T C
PI GY ALREDQKM C K
human. F-I 2247
. DEDE C EEGKHD C TEKQME C KNLIGTYM
C I C GP GY QRRPDGEG
C V
Mouse. F-I 2247 . DEDE
C AEGKHD C TEKQME C KNLIGTYM C I C
GP GY QRRPDGEG C I
bovine F-I 2247 . DEDE C EEGKHD C AEKQME C KNLIGTYL C I C
GP GY QRRPDGEG C V
human. FII 2293 . dlde
c aeglhd c esrgmm c knligtfm c I c pp gm arrpdgeg c v
human. F-I 2291
. . DENE C QTKPGI C ENGR C LNTRGSYT
C E C ND GF TASPNQDE
C LDNREGY...
Mouse. F-I 2291 . . DENE
C QTKPGI C ENGR C LNTLGSYT C E C
ND GF TASPTQDE C LDA...
bovine F-I 2291 . . DENE C QTKPGI C ENGR C LNTRGSYT C E C
ND GF TASPNQDE C LDN...
human. FII 2337 . . DENE
C RTKPGI C ENGR C VNIIGSYR C E C
NE GF QSSSSGTE C LDN...
human. F-I 2402
. . DIDE C KVIHDV C RNGE C VNDRGSYH
C I C KT GY TPDITGTS
C V
Mouse. F-I 2402 . . DVDE
C KVIHDV C RNGE C VNDRGSYH C I C
KT GY TPDITGTS C V
bovine F-I 2402 . . DIDE C KVIHDV C RNGE C VNDRGSYH C I C
KT GY TPDITGTA C V
human. FII 2448 . . DIDE
C KVMPNL C TNGQ C INTMGSFR C F C
KV GY TTDISGTS C I
human. F-I 2444
. .. DLNE C NQAPKP C NFI C KNTEGSYQ
C S C PK GY ILQEDGRS
C K
Mouse. F-I 2444 . ..
DLNE C NQAPKP C NFI C KNTEGSYQ C S C PN GY ILQEDGRS C
K
bovine F-I 2444 . . .DLNE C NQAPKP C NFI C KNTEGSYQ C S C
PK GY ILQEDGRS C K
human. FII 2490 . . .DLDE
C SQSPKP C NYI C KNTEGSYQ C S C
PR GY VLQEDGKT C K
human. F-I 2485
. . .DLDE C ATKQHN C QFL C VNTIGGFT
C K C PP GF TQHHTS
C I
Mouse. F-I 2485 . . .DLDE
C ATKQHN C QFL C VNTIGGFT C K C
PP GF TQHHTA C I
bovine F-I 2485 . . .DLDE C ATKQHN C QFL C VNTIGSFT C K C
PP GF TQHHTA C I
human. FII 2531 . . .DLDE
C QTKQHN C QFL C VNTLGGFT C K C
PP GF TQHHTA C I
human. F-I 2524
. .DNNE C TSDINL C GSKGI C QNTPGSFT
C E C QR GF SLDQTGSS
C E
Mouse. F-I 2524 . .DNNE
C TSDINL C GSKGI C QNTPGSFT C E C
QR GF SLDQSGAS C E
bovine F-I 2524 . .DNNE C TSDINL C GSKGI C QNTPGSFT C E C
QR GF SLDPTGAS C E
human. FII 2570 . .DNNE
C GSQPLL C GGKGI C QNTPGSFS C E C
QR GF SLDATGLN C E
human. F-I 2567
. ... DVDE C EGNHR C QHG C QNIIGGYR
C S C PQ GY LQHYQWNQ
C V
Mouse. F-I 2567 .. .
.DVDE C EGNHR C QHG C QNIIGGYR C S C PQ GY LQHYQWNQ C
V
bovine F-I 2567 . .. .DVDE C EGNHR C QHG C QNIIGGYR C S C
PQ GY LQHYQWNQ C V
human. FII 2613 . . ..DVDE
C DGNHR C QHG C QNILGGYR C G C
PQ GY IQHYQWNQ C V
human. F-I 2607
. .. DENE C LSAHI C GGAS C HNTLGSYK C M C
PA GF QYEQFSGG C Q
Mouse. F-I 2607 . . .DENE
C LSAHV C GGAS C HNTLGSYK C M C PA GF
QYEQFSGG C Q
bovine F-I 2607 . . .DENE C LSAHI C GGAS C
HNTLGSYK C M C PA GF
QYEQFSGG C Q
human. FII 2653 . . .DENE
C SNPNA C GSAS C YNTLGSYK C A C PS GF
SFDQFSSA C H
human. F-I 2648
. . .DINE C GSAQAP C SYG C SNTEGGYL
C G C PP GY FRIGQGH
C VSGM...
Mouse. F-I 2648 . . .DINE
C GSSQAP C SYG C SNTEGGYL C G C
PP GY FRIGQGH C LSGM
bovine F-I 2648 . . .DINE C GSAQAP C SYG C SNTEGGYL C A C
PP GY FRIGQGH C VSGM...
human. FII 2694 . . .DVNE
C SSSKNP C NYG C SNTEGGYL C G C
PP GY YRVGQGH C VSGMG...
Mamalian MEGFs (multi-EGF motif proteins of unknown function)
hMEGF1 .169 .......C
LHSDY C SQNT C LNGGK C SWTHGAGYV C K C PP QF
SGKH C EQG
hMEGF1 .211 ..........
.REN C TFAP C LEGGT C ILSPKGAS C N C PH PY TGDR C EME...
hMEGF2 . .1 . ..................
P C PPHAD C RDLWQTFS C T C QP GY
YGPG C V
hMEGF2 . 30 ....... .
DA C LLNP C QNQGS C RHLPGAPHGYT C D C VG GY FGHH C EHRM
rMEGF3 .453 ...DDNI
C LREP C ENYMR C VSVLRFDSSAPF
rMEGF3 .481........................IASSSVLFRPIHPVGGLR
C R C PP GF TGDY
C ET
rMEGF3 .512 ...........EVDL
C YSRP C GPHGH C RSREGGYT C L C
RD GY TGEH C EVSA
rMEGF3 .553 .........
RSGR C TPGV C KNGGT C VNLLVGGFK C D C PS GD
FEKPF C QVTTR...
rMEGF3 .800 ........
...KNV C DSNT C HNGGT C VNQWDAFS C E C PL GF GGKS C AQEMAN..
rMEGF3 1021 .......... ...P C DSNP C PTNSY
C SNDWDSYS C S C DP GY
YGDN C T
rMEGF3 1055 ......... NV C DLNP C EHQSA C TRKPSAPHGYI
C E C LP NY LGPY C ETR...
hMEGF4 .131 .............L
C LSSP C QNQGT C HNDPLEVYR C A C PS GY
KGRD C EV
rMEGF4 .927 ............DP
C LSSP C QNQGT C HNDPLEVYR C T C PS GY
KGRN C EV
hMEGF4 .167 ........SLDS
C SSGP C ENGGT C HAQEGEDAPFT C S C
PT GF EGPT C GV
rMEGF4 .964 ........SLDS
C SSNP C GNGGT C HAQEGEDAGFT C S C
PS GF EGLT C GM
hMEGF4 .208 ...........NTDD
C VDHA C ANGGV C VDGVGNYT C Q C
PL QY EGKA C EQ
rMEGF4 1005 ...........NTDD C VKHD C VNGGV C VDGIGNYT C Q C
PL QY TGRA C EQ
hMEGF4. 246 ...........LVDL
CSPDLNPC QHEAQ C VGTPDGPR C E C MP GY
AGDN C SE
rMEGF4 1043 ...........LVDF CSPDLNPC QHEAQ
C VGTPEGPR C E C VP GY
TGDN C SK
hMEGF4 .286 ...........NQDD
C RDHR C QNGAQ C MDEVNSYS C L C
AE GY SGQL C EIPPHLPA
rMEGF4 1083........... NQDD C KDHQ C QNGAQ C VDEINSYA C L C
AE GY SGQL C EIPPAP
hMEGF4 .330 ...........PKSP
C EGTE C QNGAN C VDQGNRPV C Q C
LP GF GGPE C EKLL...
rMEGF4 1025 ............RNS C EGTE C QNGAN C VDQGSRPV C Q C
LP GF GGPE C EKLLSVNF...
hMEGF4 .544 ..............P
C RKLY C LHGI C QPNATPGPM C H C
EA GW VGLH C DQP
rMEGF4 1331 .......GVVPGCEP C RKLY C LHGI C QPNATPGPV C H C
EA GW GGLH C DQP
hMEGF4 .580 ...........ADGP
C HGHK C VHGQ C VPLDALSYS C Q C
QD GY SGAL C NQAGA
rMEGF4 1374 ...........VDGP C HGHK C VHGK C VPLDALAYS C Q C
QD GY SGAL C NQVGA
hMEGF4 .621 ...........LAEP
C RGLQ C LHGH C QASGTKGAH C V C
DP GF SGEL C EQGQGPPS
rMEGF4 1415 ...........VAEP C GGLQ C LHGH C QASATRGAH C V C
SP GF SGEL C EQES
hMEGF5 135........... ...A C LSSP C KNNGT C TQDPVELYR C A C
PY SY KGKD C TVP
hMEGF5 171...........INT C IQNP C QHGGT
C HLSDSHKDGFS C S C PL GF
EGQR C EINPD
hMEGF5 214 ...............D C EDND C ENNAT
C VDGINNYV C I C PP NY
TGEL C DEV
hMEGF5 250 ...........IDH C VPELNL C QHEAK
C IPLDKGFS C E C VP GY
SGKL C ETDND
hMEGF5 293 ...............D C VAHK C RHGAQ C VDTINGYT C T C
PQ GF SGPF C EHPPPM
hMEGF5 332 .........VLLQTSP C DQYE C QNGAQ C IVVQQEPT C R C
PP GF AGPR C EKLITVN...
hMEGF5 544 .........VSPG C KS C TV C KHGL C RSVEKDSVV C E C
RP GW TGPL C DQE
hMEGF5 484............. ARDP C LGHR C HHGK C VATGTSYM C K C
AE GY GGDL C DNKND
hMEGF5 524 ............SANA C SAFK C HHGQ C HISDQGEPY C L C
QP GF SGEH C Q...
hMEGF6 ..1...................
GL C W C QHGAP C DPISGR C L C
PA GF HGHF C E
hMEGF6..31.. RG
C EPGSFGEG C HQR C D C DGGAP C DPVTGL C L C PP GR SGAT C N
hMEGF6 .74...LD
C RRGQFGPS C TLH C D C GGGAD C DPVSGQ C H C
VD GY MGPT C REGGPLRLPEN...
hMEGF7 ..25 ...........GEEN
C NVNNGG C AQK C QMVRGAVQ C T C
HT GY RLTEDGHT C Q
hMEGF7 ..66 ............DVNE
C AEEGY C SQG C TNSEGAFQ C W C ET GY
ELRPDRRS C K...
hMEGF7 .369............GKNR
C GDNNGG C THL C LPSGQNYT C A C
PT GF RKISSHA C A...
rMEGF7 ..91 ...........GKNR
C GDNNGG C THL C LPSGQNYT C A C
PT GF RKINSHA C AQSL...
hMEGF7 .673 .........VSTP
C AMENGG C SHL C LRSPNPSGFS C T C PT GI NLLSDGKT
C SPGMNS...
rMEGF7 .395......
...VTTP C AVENGG C SHL C LRSPSPSGFS C T C PT
GI NLLLDGKT C SPGM...
hMEGF7 .980........
...GFNK C GSRNGG C SHL C LPRPSGFS C A C PT
GI QLKGDGKT C D...
rMEGF7 .702........
...GFNK C GSRNGG C SHL C LPRPSGFS C A C PT
GI QLKGDGKT C DPSP...
hMEGF7 1285 ...........GTNA C GVNNGG
C THL C FARASDFV C A C PD EP DSQP C S...
rMEGF7 1006 ...........GTNA C GVNNGG
C SHL C FARASDFV C A C PD EP DSHP C SLVP...
hMEGF8 ...1..........DGISH
C NRTCLED C GHGV C SGPPDFT C V C
DL GW TSDLPPPTPAPGPPAPR C S...
hMEGF8 .296 ......SAR
C GSGGPGS C PVPQE C VPQDGAAGAGL C R C PQ GW AGPH C R
hMEGF8 .333 ............MAL
C PEN C NAHTGAGT C NQSLGV C I C AE GF
GGPD C A...
hMEGF8 1071 ..........EDE C ANGHHD
C NETQN C HDQPHGYE C S C KT GY
TMDNMTGL C
rMEGF8 .206 .........PEDE
C ANGHHD C NETQN C HDQPHGYE C S C
KT GY TMDNVTGV C R
hMEGF8 1112................. RPV C AQG C VNGS C VEPDH C R C HF
GF VGRN C S...
rMEGF8 .250 ..................PV
C AQG C VNGS C VEPDH C R C
HF GF VGRN C ST...
Drosophila Delta Protein
and Delta-like (DLK) Homologues
Human Small-cell Lung Carcinoma; Neuroendocrine Tumor Homeotic Protein;
Mouse Preadipocyte factor 1
(Mouse-DLK; Mouse SCP-1; Mouse PREF-1)
Drosophila 227 ...........HIPK C AKG
C E--HGH C DKPNQ C V C
QL GW KGAL C N
human dlk ..21 ..........TYGAE
C FPA C NPQNGF C EDDNV C R C
HV GW QGPL C D
mouse dlk ..21 ..........TYGAE
C DPP C DPQYGF C EADNV C R C HV GW
EGPL C D
Drosophila 259 ..............E C VLEPN
C IHGT C NKPWT C I C
NE GW GGLY C NQ
human dlk ..56 ..............Q
C VTSPG C LHGL C GEPGQ C I C
TD GW DGEL C DRD
mouse dlk ..56 ..............K
C VTAPG C VNGV C KEPWQ C I C
KD QW DGKF C EID
Drosophila 291 ......DLNY C TNHRP C
KNGGT C FNTGEGLYT C K C
AP GY SGDD C EN
human dlk ..89 .......VRA
C S-SAP C ANNGT C VSLDGGLYE C S C
AP GY SGKD C QKK
mouse dlk ..89 .......VRA
C T-STP C ANNGT C VDLEKGQYE C S C
TP GF SGKD C QHK
Drosophila 331 ...EIYS C DADVNP C QNGGT C IDEPHTKTGYK C H C
RN GW SGKM C EE
human dlk .127 ....DGP
C VINGSP C QHGGT C VDDEGRASHAS C L C
PP GF SGNF C EIV
mouse dlk .128 ....AGP
C VINGSP C QHGGA C VDDEGQASHAS C L C
PP GF SGNF C EIV
Drosophila 372 ..KVLT C SDKP C HQGI C RNVRPGLGSKGQGYQ C E C PI GY SGPN C DL
human dlk .171 .........ANS
C TPNP C ENDGV C TDIGGDFR C R C PA GF
IDKT C SRP
mouse dlk .171 .......AATNS
C TPNP C ENDGV C TDIGGDFR C R C PA GF
VDKT C SRP
Drosophila 411 ...........QLDN C SPNP C INGGS C QPSGK C I C PS GF SGTR C ET
Drosophila 453 ........NIDD C LGHQ C
ENGGT C IDMVNQYR C Q C
VP GF HGTH C SS
Drosophila 491 ........KVDL C LIRP C ANGGT C LNLNNDYQ C T C
RA GF TGKD C SV
Drosophila 529 ........DIDE C SSGP C HNGGT C MNRVNSFE C V C
AN GF RGKQ C DEESYD...
human dlk .209 ........VTN
C ASSP C QNGGT C LQHTQVSYE C L C
KP EF TGLT C VKKRAL
mouse dlk .211 ........VSN
C ASGP C QNGGT C LQHTQVSFE C L C
KP PF MGPT C AKKR
Return-to-Table-of-Contents
Verterbrate Delta Homologues
Z-fish ....166.....
...FGEA C SDY C RPRDDTLGHYT C DENGNKE C LV GW
QGDY C SD
Mouse .....223 .............PI
C LPG C DDQHGY C DKPGE C K C
RV GW QGRY C D
Rat .......223 .............PI
C LPG C DDQHGY C DKPGE C K C
RV GW QGRY C D
Chicken ...230 ............EPI
C LPG C DEQHGF C DKPGE C K C
RV GW QGRY C D
Z-fish ....206 .............PI
C SSD C SERHGY C ESPGE C K C
RL GW QGPS C S
Mouse .....255 ..............E
C IRYPG C LHGT C QQPWQ C N C
QE GW GGLF C NQD
Rat .......255 ..............E
C IRYPG C LHGT C QQPWQ C N C
QE GW GGLF C NQD
Chicken ...263 ..............E
C IRYPG C LHGT C QQPWQ C N C
QE GW GGLF C NQD
Z-fish ....238 ..............E
C VHYPG C LHGT C SQPWQ C V C
KE GW GGLF C NQD
Mouse .....287 .......LNY
C THHKP C RNGAT C TNTGQGSYT C S C
RP GY TGAN C ELE
Rat .......288 .......LNY
C THHKP C RNGAT C TNTGQGSYT C S C
RP GY TGAN C ELE
Chicken... 296 .......LNY
C THHKP C KNGAT C TNTGQGSYT C S C
RP GY TGSS C EIE
Z-fish ....271 .......LNY
C TNHKP C ANGAT C TNTGQGSYT C T C
RP GF GGTN C ELE
Mouse .....328 .........VDE
C APSP C KNGAS C TDLEDSFS C T C
PP GF YGKV C ELSA
Rat ...v...328 .........VDE
C APSP C RNGGS C TDLEDSYS C T C
PP GF YGKV C ELSA
Chicken ...336 .........INE
C DANP C KNGGS C TDLENSYS C T C
PP GF YGKN C ELS
Z-fish ....311 .........INE
C DCNP C KNGGS C NDLENDYS C T C
PQ GF YGKN C EII
Mouse .....367 .........MT
C ADGP C FNGGR C SDNPDGGYT C H C
PL GF SGFN C EKK
Rat .......367 .........MT
C ADGP C FNGGR C SDNPDGGYT C H C
PA GF SGFN C EKK
Chicken ...374 ........AMT
C ADGP C FNGGR C TDNPDGGYS C R C
PL GY SGFN C EKK
Z-fish ....349 ........AMT
C ADDP C FNGGT C EEKFTGGYV C R C
PP TF TGSN C EKR
Mouse .....405 .........MDL
C GSSP C SNGAK C VDLGNSYL C R C
QA GF SGRY C EDN
Rat .......405 .........IDL
C SSSP C SNGAK C VDLGNSYL C R C
QT GF SGRY C EDN
Chicken ...413 .........IDY
C SSSP C ANGAQ C VDLGNSYI C Q C
QA GF TGRH C DDN
Z-fish ....388 .........LDR
C SHKP C ANGGE C VDLGASAL C R C
RP GF SGSR C ETN
Mouse .....443 .........VDD
C ASSP C ANGGT C RDSVNDFS C T C
PP GY TGKN C SAP
Rat .......443 .........VDD
C ASSP C ANGGT C RDSVNDFS C T C
PP GY TGRN C SAP
Chicken ...451 .........VDD
C ASFP C VNGGT C QDGVNDYS C T C
PP GY NGKN C STP
Z-fish ....426 .........IDD
C ARYP C QNAGT C QDGINDYT C T C TL GF
TGKN C SLR
Mouse .....481 .........VSR
C EHAP C HNGAT C HQRGQRYM C E C
AQ GY GGPN C QFLLP...
Rat .......481 .........VSR
C EHAP C HNGAT C HQRGQRYM C E C
AQ GY GGAN C QFLLP...
Chicken ...489 .........VSR
C EHNP C HNGAT C HERSNRYV C E C
AR GY GGLN C QFLLPEP...
Z-fish ....465 .........ADA
C LTNP C LHGGT C FTHFSGPV C Q C
VP GF MGST C EF...
Homologues to Drosophila
Notch Protein
Human .......20 .......RGPR
C SQPGET C LNGGK C EAANGTEA- C V C
GG AF VGPR C Q
Mouse .......20 .......RGLR
C SQPSGT C LNGGR C EVASGTEA- C V A SG SF VGQR C QD
Rat .........20 .......RGLR
C SQPSGT C LNGGR C EVANGTEA- C V C
SG AF VGQR C QDP
Zebrafish ...21 .......QGQR
C SE---Y C QNGGI C EYKPSGEAS C R C
PA DF VGAQ C QPN
Xenopus .....17 .......QGLR
C TQTAEM C LNGGR C EMTPGGTGV C L C
GN LY FGER C QF
Drosophila ..57 ......LVAAS
C TS-VG- C QNGGT C VTQLNGKTY C A C
DS HY VGDY C EH
Human .......59 ......DPNP
C LSTP C KNAGT C HVVDRRGVADYA C S C AL GF SGPL C LTP
Mouse .......60 .......PNP
C LSTR C KNAGT C YVVDHGGIVDYA C S C PL GF SGPL C LTPL
Rat ......./.61 ........SP
C LSTP C KNAGT C YVVDHGGIVDYA C S C PL GF SGPL C LTP
Zebrafish ...61 .........P
C NPSP C RNGGV C RPQMQGNEVGVK C D C
VL GF SDRL C LTP
Xenopus .....59 .......PNP
CTIKNQ C MNFGT C EPVLQGNAIDFI C H C PV GF TDKV C LT
Drosophila ..97 .......RNP
C NSMR C QNGGT C QVTFRNGHPGIS C K C
PL GF DESL C EI
Human ......102 ........LDNA
C LTNP C RNGGT C DLLTL-TEYK C R C
PP GW SGKS C Q
Mouse ......103 .........DKP
C LANP C RNGGT C DLLTL-TEYK C R C
SP GW SGKS C QQ
Rat ........102 ........LANA
C LANP C RNGGT C DLLTL-TEYK C R C
PP GW SGKS C Q
Zebrafish ..101 ........VNHA
C MNSP C RNGGT C SLLTLDT-FT C R C
QP GW SGKT C Q
Xenopus ....101 .......PVDNA
C VNNP C RNGGT C ELLNSVTEYK C R C
PP GW TGDS C QQ
Drosophila .138 .......AVPNA
C DHVT C LNGGT C QLKTLE-EYT C A C
AN GY TGER C ET
Human ......140 ........QADP
C ASNP C ANGGQ C LPFE--ASYI C H C
PP SF HGPT C RQ
Mouse ......141 .........ADP
C ASNP C ANGGQ C LPFE--SSYI C R C
PP GF HGPT C RQD
Rat ........140 ........QADP
C ASNP C ANGGQ C LPFE--SSYI C G C
PP GF HGPT C RQDVN
Zebrafish ..139 ........LADP
C ASNP C ANGGQ C SAFE--SHYI C T C
PP NF HGQT C RQ
Xenopus ....142 .........ADP
C ASNP C ANGGK C LPFE--IQYI C K C
PP GF HGAT C KQDI
Drosophila .178 .........KNL
C ASSP C RNGAT C TALAGSSSFT C S C
PP GF TGDT C SYD
Human ......178 ........DVNE
C GQKPRL C RHGGT C HNEVGSYR C V C RA TH TGPN
C ER
Mouse ......179 .........VNE
C SQNPGL C RHGGH C HNEIGSYR C A C CA TH TGPH
C ELP
Rat ........181 ...........E
C SQNPGL C RHGGT C HNEIGSYR C A C RA TH TGPH
C EL
Zebrafish ..177 ........DVNE
C AVSPSP C RNGGT C INEVGSYL C R C
PP EY TGPH C QR
Xenopus ....181 ..........NE
C SQNP-- C KNGGQ C INEFGSYR C T C
QN RF TGRN C DEP
Drosophila .218 .........IEE
C QSNP-- C KYGGT C VNTHGSYQ C M C PT GY TGKD C DTK
Human ......218 .........PYVP
C SPSP C QNGGT C RPTGDVTHE C A C
LP GF TGQN C EE
Mouse ......219 ..........YVP
C SPSP C QNGAT C RPTGDTTHE C A C
LP GF AGQN C EEN
Rat ........218 .........PYVP
C SPSP C QNGGT C RPTGDTTHE C A C
LP GF AGQN C EE
Zebrafish ..217 .........LYQP
C LPSP C RSGGT C VQTSDTTHT C S C LP GF
TGQT C EH
Xenopus ....218 ..........YVP
C NPSP C LNGGT C RQTDDTSYD C T C
LP GF SGQN C EEN
Drosophila .256 ..........YKP
C SPSP C QNGGI C RSNG-LSYE C K C
PK GF EGKN C EQN
Human ......257 ..........NIDD
C PGNN C KNGGA C VDGVNTYN C P C
PP EW TGQY C TE
Mouse ......258 ...........vdd
c pgnn c kNGga c vdgvntyn c r c pp ev tgqy
c ted
Rat ........257 ..........NVDD
C PGNN C KNGGA C VDGVNTYN C R C
PP EW TGQY C TED
Zebrafish ..256 ..........NVDD
C TQHA C ENGGP C IDGINTYN C H C
DK HW TGQY C TE
Xenopus ....257 ...........IDD
C PSNN C RNGGT C VDGVNTYN C Q C
PP DW TGQY C TED
Drosophila .294 ...........YDD
C LGHL C QNGGT C IDGISDYT C R C
PP NF TGRF C QDD
Human ......295 .......DVDE
C -QLMPNA C QNGGT C HNTHGGYN C V C
VN GW TGED C SE
Mouse ......296 ........VDE
C -QLMPNA C QNAGT C HNTHGGYN C V C VN GW TGED C SEN
Rat ........296 ........VDE
C -QLMPNA C QNAGT C HNSHGGYN C V C VN GW TGED C SD
Zebrafish ..294 .......DVDE
C -ELSPNA C QNGGT C HNTIGGFH C V C
VN GW TGDD C SE
Xenopus ....295 ........VDE
C -QLMPNA C QNGGT C HNTYGGYN C V C
VN GW TGED C SEN
Drosophila .332 ........VDE
C AQRDHPV C QNGAT C TNTHGSYS C I C
VN GW AGLD C SNN
Human ......335 ..........nidd
c asaa c fhgat c hdrvasfy c e c ph gr tgll c h
Mouse ......336 ...........idd
c asaa c fqgat c hdrvasfy c e c ph gr tgll c hl
Rat ........335 ..........nidd
c asaa c fqgat c hdrvasfy c e c ph gr tgll c h
Zebrafish ..334 ..........nidd
c asaa c shgat c hdrvasff c e c ph gr tgll c h
Drosophila .373 ...........tdd
c kqaa c fygat c idgvgsfy c q c tk gk tgll c h
Human ......372 ........LNDA
C ISNP C NEGSN C DTNPVNGKAI C T C PS GY TGPA C SQ
Mouse ......373 .........KHA
C ISNP C NEGSN C DTNPVNGKRI C T C PS GY TGPA C SQD
Rat ........372 ........LNDA
C ISNP C NEGSN C DTNPVNGKAI C T C PR GY TGPA C SQ
Zebrafish ..371 ........LDDA
C ISNP C QKGSN C DTNPVSGKAI C T C PP GY TGSA C NQ
Xenopus ....372 .........DNA
C ISNP C NEGSN C DTNPVNGKAI C T C PP GY TGPA C NND
Drosophila .410 .........DDA
C TSNP C HADAI C DTSPINGSYA C S C AT GY KGVD C SED
Human ......412 ........DVDE
C SLGANP C EHAGK C INTLGSFE C Q C LQ GY TGPR C EI
Mouse ......413 .........VDE
C DLGANR C EHAGK C LNTLGSFE C Q C LQ GY TGPG C EID
Rat ........412 ........DVDE
C ALGANP C EHAGK C LNTLGSFE C Q C LQ GY TGPR C EI
Zebrafish ..411 ........DIDE
C SLGANP C EHGGR C LNTKGSFQ C K C
LQ GY EGPR C EM
Xenopus ....412 .........VDE
C SLGANP C EHGGR C TNTLGSFQ C N C
PQ GY AGPR C EI
Drosophila .450 .........IDE
C DQG-SP C EHNGI C VNTPGSYR C N C
SQ GF TGPR C ETN
Human ......452 ..........DVNE
C VSNP C QNDAT C LDQIGEFQ C M C MP GY
EGVH C EV
Mouse ......453 ...........VNE
C ISNP C QNDAT C LDQIGEFQ C I C MP GY
EGVY C EIN
Rat ........452...........DVNE
C ISNP C QNDAT C LDQIGEFQ C I C MP GY
EGVY C EI
Zebrafish ..451 ..........DVNE
C KSNP C QNDAT C LDQIGGFH C I C MP GY
EGVF C QI
Xenopus ....451 ..........DVNE
C LSNP C QNDST C LDQIGEFQ C I C MP GY
EGLY C ETN
Drosophila .489 ...........INE
C ESHP C QNEGS C LDDPGTFR C V C MP GF
TGTQ C EID
Human ......490 ..........NTDE
C ASSP C LHNGR C LDKINEFQ C E C
PT GF TGHL C QY
Mouse ......491 ...........TDE
C ASSP C LHNGH C MDKIHEFQ C Q C
PK GF NGHL C QYD
Rat ........490 ..........NTDE
C ASSP C LHNGR C VDKINEFL C Q C
PK GF SGHL C QY
Zebrafish ..489 ..........NSDD
C ASQP C L-NGK C IDKINSFH C E C
PK GF SGSL C QVD
Xenopus ....490 ...........IDE
C ASNP C LHNGK C IDKINEFR C D C
PT GF SGNL C QHD
Drosophila .527 ...........IDE
C QSNP C LNDGT C HDKINGFK C S C AL GF
TGAR C QIN
Human ......528 ..........DVDE
C ASTP C KNGAK C LDGPNTYT C V C
TE GY TGTH C EV
Mouse ......529 ...........VDE
C ASTP C KNGAK C LDGPNTYT C V C
TE GY TGTH C EVD
Rat ........528 ..........DVDE
C ASTP C KNGAK C LDGPNTYT C V C
TE GY TGTH C EV
Zebrafish ..527 ...........VDE
C ASTP C KNGAK C TDGPNKYT C E C
TP GF SGIH C EL
Xenopus ....528 ...........FDE
C TSTP C KNGAK C LDGPNSYT C Q C
TE GF TGRH C EQD
Drosophila .565 ...........IDD
C QSQP C RNRGI C HDSIAGYS C E C PP GY
TGTS C EIN
Human ......566 ...........DIDE
C DPDP C HYGS C KDGVATFT C L C RP GY
TGHH C ET
Mouse ......567 ............IDE
C DPDP C HYGS C KDGVATFT C L C QP GY
TGHH C ETN
Rat ........566 ...........DIDE
C DPDP C HIGL C KDGVATFT C L C QP GY
TGHH C ET
Zebrafish ..564 ...........DINE
C ASSP C HYGV C RDGVASFT C D C RP GY
TGRL C ET
Xenopus ....566 ............INE
C IPDP C HYGT C KDGIATFT C L C RP GY
TGRL C DND
Drosophila .603 ............IND
C DSNP C HRGK C IDDVNSFK C L C DP GY
TGYI C QKQ
Human ......603 ..........nine
c ssqp c rlrgt c qdpdnayl c f c lk gt tgpn c ei
Mouse ......604 ...........ine
c hsqp c rHGgt c qdrdnsyl c l c lk gt tgpn
c ein
Rat ........603 ..........nine
c hsqp c rHGgt c qdrdnyyl c l c lk gt tgpn
c ei
Zebrafish ..601 ..........nine
c lsqp c rNGgt c qdrenayi c t c pk gt tgvn
c ei
Xenopus ....603 ...........ine
c lskp c lNGgq c tdrengyi c t c pk gt tgvn
c etk
Drosophila .640 ...........ine
c esnp c qfdgh c qdrvgsyy c q c qa gt sgkn c e
Human ......641 ..........NLDD
C ASSP C DSG-T C LDKIDGYE C A C EP GY
TGSM C NS
Mouse ......642 ...........LDD
C ASNP C DSG-T C LDKIDGYE C A C EP GY
TGSM C NVN
Rat ........641 ..........NLDD
C ASNP C DSG-T C LDKIDGYE C A C EP GY
TGSM C NV
Zebrafish ..639 ..........NIDD
C KRKP C DYG-K C IDKINGYE C V C EP GY
SGSM C NI
Xenopus ....641 ...........IDD
C ASNL C DNG-K C IDKIDGYE C T C
EP GY TGKL C NIN
Drosophila .678 ...........VNE
C HSNP C NNGAT C IDGINSYK C Q C
VP GF TGQH C EKN
Human ......678 ..........NIDE
C AGNP C HNGGT C EDGINGFT C R C
PE GY HDPT C LS
Mouse ......679 ...........IDE
C AGSP C HNGGT C EDGIAGFT C R C
PE GY HDPT C LSE
Rat ........678 ..........NIDE
C AGSP C HNGGT C EDGIAGFT C R C
PE GY HDPT C LS
Zebrafish ..676 ..........NIDD
C ALNP C HNGGT C IDGVNSFT C L C
PD GF RDAT C LS
Xenopus ....678 ...........INE
C DSNP C RNGGT C KDQINGFT C V C
PD GY HDHM C LSE
Drosophila .716 ...........VDE
C ISSP C ANNGV C IDQVNGYK C E C
PR GF YDAH C LSD
Human ......716 ..........EVNE
C NSNP C VHG-A C RDSLNGYK C D C
DP GW SGTN C DI
Mouse ......717 ...........VNE
C NSNP C IHG-A C RDGLNGYK C D C
AP GW SGTN C DI
Rat ........716 ..........EVNE
C NSNP C IHG-A C RDGLNGYK C D C
AP GW SGTN C DI
Zebrafish ..714 ..........QHNE
C SSNP C IHG-S C LDQINSYR C V C
EA GW MGRN C DI
Xenopus ....716 ...........VNE
C NSNP C IHG-A C HDGVNGYK C D C
EA GW SGSN C DIN
Drosophila .754 ...........VDE
C ASNP C VNEGR C EDGINEFI C H C PP GY
TGKR C ELD
Human ......753 ..........NNNE
C ESNP C VNGGT C KDMTSGIV C T C
RE GF SGPN C QT
Mouse ......753 ..........NNNE
C ESNP C VNGGT C KDMTSGYV C T C
RE GF SGPN C QTN
Rat ........753 ..........NNNE
C ESNP C VNGGT C KDMTSGYV C T C
RE GF SGPN C QT
Zebrafish ..751 ..........NINE
C LSNP C VNGGT C KDMTSGYL C T C
RA GF SGPN C QM
Xenopus ....753 ...........NNE
C ESNP C MNGGT C KDMTGAYI C T C
KA GF SGPN C QTN
Drosophila .792 ..........vIDE
C SSNP C QHGGT C YDKLNAFS C Q C
MP GY TGQK C ETN
Human ......791 ..........NINE
C ASNP C LNKGT C IDDVAGYK C N C LL PY
TGAT C EV
Mouse ......792 ...........INE
C ASNP C LNQGT C IDDVAGYK C N C PL PY
TGAT C EVV
Rat ........791 ..........NINE
C ASNP C LNQGT C IDDVAGYK C N C PL PY
TGAT C EV
Zebrafish ..789 ..........NINE
C ASNP C LNQGS C IDDVAGFK C N C ML PY
TGEV C EN
Xenopus ....791 ...........INE
C SSNP C LNHGT C IDDVAGYK C N C
ML PY TGAI C EAV
Drosophila .830 ...........IDD
C VTNP C GNGGT C IDKVNGYK C V C
KV PF TGRD C ESK
Human ......830 .........lap
c apsp c rNGge c rqsedyesfs c v c pt ag akgqt
c evd
Mouse ......830 .........LAP
C ATSP C KNSGV C KESEDYESFS C V C PT GW QGQT C EVD
Rat ........829 ........VLAP
C ATSP C KNSGV C KESEDYESFS C V C PT GW QGQT C EI
Zebrafish ..827 ........VLAP
C SPRP C KNGGV C RESEDFQSFS C N C
PA GW QGQT C EV
Xenopus ....829 .........LAP
C AGSP C KNGGR C KESEDFETFS C E C
PP GW QGQT C EID
Drosophila .868 .........MDP
C ASNR C KNEAK C TPSSNFLDFS C T C KL GY TGRY C DED
Human ......870 ..........DINE
C VLSP C RHGAS C QNTHGXYR C H C
QA GY SGRN C ET
Mouse ......870 ...........INE
C VKSP C RHGAS C QNTNGSYR C L C
QA GY TGRN C ESD
Rat ........869 ..........DINE
C VKSP C RHGAS C QNTNGSYR C L C
QA GY TGRN C ES
Zebrafish ..867 ..........DINE
C VRNP C TNGGV C ENLRGGFQ C R C
NP GF TGAL C EN
Xenopus ....869 ...........MNE
C VNRP C RNGAT C QNTNGSYK C N C
KP GY TGRN C EMD
Drosophila .947 ...........TDD
C ASFP C QNGGT C LDGIGDYS C L C
VD GF DGKH C ETD
Drosophila .908 ..........IDE
C SLSSP C RNGAS C LNVPGSYR C L C
TK GY EGRD C AIN
Human ......908 ..........DIDD
C RPNP C HNGGS C TDGINTAF C D C
LP GF RGTF C EE
Mouse ......908 ...........IDD
C RPNP C HNGGS C TDGINTAF C D C
LP GF QGAF C EEDINECA
Rat ........907 ..........DIDD
C RPNP C HNGGS C TDGVNAAF C D C
LP GF QGAF C EE
Zebrafish ..905 ..........DIDD
C EPNP C SNGGV C QDRVNGFV C V C
LA GF RGER C AE
Xenopus ....907 ...........IDD
C QPNP C HNGGS C SDGINMFF C N C
PA GF RGPK C EED
Drosophila .983 ...........INE
C LSQP C QNGAT C SQYVNSYT C T C
PL GF SGIN C QTN
Human ......946 ..........DINE
C ASDP C RNGAN C TDCVDSYT C T C
PA GF SGIH C EN
Mouse ......946 ...........INE
C ASNP C QNGAN C TDCVDSYT C T C
PV GF NGIH C ENN
Rat ........945 ..........DINE
C ATNP C QNGAN C TDCVDSYT C T C
PT GF NGIH C EN
Zebrafish ..943 ..........DIDE
C VSAP C RNGGN C TDCVNSYT C S C
PA GF SGIN C EI
Xenopus ....945 ...........INE
C ASNP C KNGAN C TDCVNSYT C T C
QP GF SGIH C ESNT
Drosophila 1061 ...........LNK C DSNP C LNGAT C HEQNNEYT C H C
PS GF TGKQ C SEY
Human ......984 ..........NTPD
C TESS C FNGGT C VDGINSFT C L C
PP GF TGSY C QH
Mouse ......984 ...........TPD
C TESS C FNGGT C VDGINSFT C L C
PP GF TGSY C QYD
Rat ........983 ..........NTPD
C TESS C FNGGT C VDGINSFT C L C
PP GF TGSY C QY
Zebrafish ..981 ..........NTPD
C TESS C FNGGT C VDGISSFS C V C
LP GF TGNY C QH
Xenopus ....983 ............PD
C TESS C FNGGT C IDGINTFT C Q C
PP GF TGSY C QHD
Drosophila 1023 ...........DED C TESS C LNGGS C IDGINGYN C S C
LA GY SGAN C QYK
Human......1022 ..........VVNE
C DSRP C LLGGT C QDGRGLHR C T C PQ GY
TGPN C QN
Mouse .....1022 ...........VNE
C DSRP C LHGGT C QDSYGTYK C T C
PQ GY TGLN C QNL
Rat .......1021 ..........DVNE
C DSRP C LHGGT C QDSYGTYK C T C
PQ GY TGLN C QN
Zebrafish .1019 ..........DVNE
C DSRP C QNGGS C QDGYGTYK C T C
PH GY TGLN C QS
Xenopus ...1021 ...........INE
C DSKP C LNGGT C QDSYGTYK C T C
PQ GY TGLN C QNL
Drosophila 1061 ...........LNK C DSNP C LNGAT C HEQNNEYT C H C
PS GF TGKQ C SEY
Human .....1060 ..........LVHW
C DSSP C KNGGK C WQTHTQYR C E C
PS GW TGLY C DV
Mouse .....1060 ...........VRW
C DSAP C KNGGR C WQTNTQYH C E C
RS GW TGVN C DVL
Rat .......1059 ..........LVRW
C DSAP C KNGGK C WQTNTQYH C E C
RS GW TGFN C DV
Zebrafish .1057 ..........LVRW
C DSSP C KNGGS C WQQGASFT C Q C
AS GW TGIY C DV
Xenopus ...1059 ...........VRW
C DSSP C KNGGK C WQTNNFYR C E C
KS GW TGVY C DVP
Drosophila 1099 ...........VDW C GQSP C ENGAT C SQMKHQFS C K C
SA GW TGKL C DVQ
Human .....1098 PSVS C EVAAQRQGVDVARL
C QHGGL C VDAGNTHH C R C
QA GY TGSY C ED
Mouse .....1098 .SVS
C EVAAQKRGIDVTLL C QHGGL C VDEGDKHY C H C QA GY TGSY C EDE
Rat .......1097 LSVS C EVAAQKRGIDVTLL C QHGGL C VDEEDKHY C H C
QA GY TGSY C ED
Zebrafish .1095 PSVS C EVAARQQGVSVAVL C RHAGQ
C VDAGNTHL C R C QA GY
TGSY C QE
Xenopus ...1097 .SVS
C EVAAKQQGVDIVHL C RNSGM C VDTGNTHF C R C QA
GY TGSY C EEQ
Drosophila 1137 .TIS C QDAADRKGLSLRQL C NNG-T C KDYGNSHV C Y C
SQ GY AGSY C QKE
Human .....1146 ..........LVDE
C SPSP C QNGAT C TDYLGGYS C K C
VA GY HGVN C SE
Mouse .....1146 ...........VDE
C SPNP C QNGAT C TDYLGGFS C K C
VA GY HGSN C SEE
Rat .......1145 ..........EVDE
C SPNP C QNGAT C TDYLGGFS C K C
VA GY HGSN C SE
Zebrafish .1143 ..........QVDE
C QPNP C QNGAT C TDYLGGYS C E C
VP GY HGMN C SK
Xenopus ...1145 ...........VDE
C SPNP C QNGAT C TDYLGGYS C E C
VA GY HGVN C SEE
Drosophila 1184 ...........IDE C QSQP C QNGGT C RDLIGAYE C Q C
RQ GF QGQN C ELN...
Human .....1184 ..........eide
c lshp c qNGgt c ldlpntyk c s c pr gt qgvh
c ei
Mouse .....1184 ...........ine
c lsqp c qNGgt c idltnsyk c s c pr gt qgvh
c ei
Rat .......1183 ..........eine
c lsqp c qNGgt c idltntyk c s c pr gt qgvh
c ei
Zebrafish .1181 ..........eine
c lsqp c qNGgt c idlvntyk c s c pr gt qgvh
c ei
Xenopus ...1182 ...........ine
c lshp c qNGgt c idlintyk c s c pr gt qgvh
c ein
Drosophila 1221 ...........idd c apnp c qNGgt c hdrvmnfs c s c pp gt mgii c ein
Human .....1222 ..NVDD
C NPPVDPVSRSPK C FNNGT C VDQVGGYS C T C PP GF VGER C EG
Mouse .....1221 ..NVDD
C HPPLDPASRSPK C FNNGT C VDQVGGYT C T C PP GF VGER C EG
Rat .......1221 ..NVDD
C HPPLDPASRSPK C FNNGT C VDQVGGYT C T C PP GF VGER C EG
Zebrafish .1219 ..DIDD
C SPSVDPLTGEPR C FNGGR C VDRVGGYG C V C PA
GF VGER C EG
Xenopus ...1221 ...VDD
C TPFYDSFTLEPK C FNNGK C IDRVGGYN C I C PP GF VGER C EG
Drosophila 1260 ...KDD C KP------G--A C HNNGS C IDRVGGFE C V C
QP GF VGAR C EG
Human .....1268 ........DVNE
C LSNP C DARGTQN C VQRVNDFH C E C RA GH TGRR C ES
Mouse .....1267 ........DVNE
C LSNP C DPRGTQN C VQRVNDFH C E C RA GH TGRR C ES
Rat .......1267 ........DVNE
C LSNP C DPRGTQN C VQRVNDFH C E C RA GH TGRR C ES
Zebrafish .1265 ........DVNE
C LSDP C DPSGSYN C VQLINDFR C E C RT GY TGKR
C ET
Xenopus ...1266 ........DVNE
C LSNP C DSRGTQN C IQLVNDYR C E C RQ GF TGRR
C ES
Drosophila 1297 ........DINE C LSNP C SNAGTLD
C VQLVNNYH C N C RP GH MGRH C EHK
Human .....1308 .......VING
C KGKP C KNGGT C AVASNTARGFI C K C
PA GF EGAT C EN
Mouse .....1307 .......VING
C RGKP C KNGGV C AVASNTARGFI C R C
PA GF EGAT C EN
Rat .......1307 .......VING
C RGKP C RNGGV C AVASNTARGFI C R C
PA RF EGAT C EN
Zebrafish .1305 .......VFNG
C KDTP C KNGGT C AVASNTKHGYI C K C
QP GY SGSS C EY
Xenopus ...1306 .......VVDG
C KGMP C RNGGT C AVASNTERGFI C K C
PP GF DGAT C EYD
Human .....1349 ..........DART
C GSLR C LNGGT C ISGPRSPT C L C
LG PF TGPE C QFP
Mouse .....1348 ..........DART
C GSLR C LNGGT C ISGPRSPT C L C
LG SF TGPE C QFP
Rat .......1348 ..........DART
C GSLR C LNGGT C ISGPRSPT C L C
LG SF TGPE C QFP
Zebrafish .1346 ..........DSQS
C GSLR C RNGAT C VSGHLSPR C L C
AP GF SGHE C QTR
Xenopus ...1348 ...........SRT
C SNLR C QNGGT C ISVLTSSK C V C
SE GY TGAT C QYP
Drosophila 1338 ...........VDF C AQSP C QNGGN C NIRQSGHH C I C
NN GF YGKN C ELS...
Drosophila 1376 ..........gqd c dsnp c rvgn
c vvadegfgyr c e c pr gt lgeh c eidt
Human .....1388 .......ASSP
C LGGNP C YNQGT C EPTSESPFYR C L C PA KF NGLL C HILD...
Mouse .....1387 .......ASSP
C VGSNP C YNQGT C EPTSENPFYR C L C PA KF NGLL C HIL...
Rat .......1387 .......ASSP
C VGSNP C YNQGT C EPTSESPFYR C L C PA KF NGLL C HILDYSFT...
Zebrafish .1385 .......MDSP
C LV-NP C YNGGT C QPISDAPFYR C S C
PA NF NGLL C HILDYSFS...
Xenopus ...1386 .......VISP
C A-SHP C YNGGT C QFFAEEPFFQ C F C
PK NF NGLF C HILDYEF....
Drosophila 1415 .......TLDE C SP-NP C AQGAA
C EDLL-G-DYE C L C PS KW
KGKR C DIY...
NOTCH 2 Proteins
Human ........1 ...................................YVNSYT
C K C QA GF DGVH
C ENN
Drosophila ...1 ...................................YVNSYT
C K C QA GF DGVH
C ENN
Human ........22 ...........INE
C TESS C FNGGT C VDGINSFS C L C
PV GF TGSF C LHE
Drosophila ...22 ...........INE
C TESS C FNGGT C VDGINSFS C L C
PV GF TGSF C LHE
Human ........60 ...........INE
C SSHP C LNDGT C VDGLGTYR C S C PL GY
TGKN C QTL
Drosophila ...60 ...........INE
C SSHP C LNDGT C VDGLGTYR C S C PL GY
TGKN C QTL
Human ........98 ...........VNL
C SRSP C KNKGT C VQKKASPQ C L C PS GW
VGAY C DVP
Drosophila ...98 ...........VNL
C SRSP C KNKGT C VQKKASPQ C L C PS GW
VGAY C DVP
Human ........136 NVS C DIAASRRGVLVEHL
C QHSGV C INAGNTHY C Q C PV GY
TGSY C EEQ
Drosophila ...136 NVS C DIAASRRGVLVEHL C QHSGV
C INAGNTHY C Q C PV GY
TGSY C EEQ
Human ........184 ..........LDE
C ASNP C QHGAT C SDFIGGYR C E C
VP GY QGVN C EYE
Drosophila ...184 ..........LDE
C ASNP C QHGAT C SDFIGGYR C E C
VP GY QGVN C EYE
Human ........222 ..........vde
c qnqp c qNGgt c idlvnhfk c s c pp gt rgll
c eeniddc
Drosophila ...222 ..........vde
c qnqp c qNGgt c idlvnhfk c s c pp gt rgll
c eeniddc
Mouse Notch-3
Mouse Not3 ....38 .....AAAPP
CLDGSP C ANGGR C THQQP--S-LEAA C L C
LP GW VGER C QL
Mouse Not3 ....80 .......EDP
C HSGP C AGRGV C QSSVVAGT-ARFS C R C LR GF QGPD C SQ
Mouse Not3 ...121 .......PDP
C VSRP C VHGAP C SVG-PDG---RFA C A C
PP GY QGQS C QSD
Mouse Not3 ...160 .......IDE
C RSGTTC RHGGT C LNTP--GS---FR C Q C
PL GY TGLL C ENP
Mouse Not3 ...199 .......VVP
C APSP C RNGGT C RQSS-D---VTYD C A C
LP GF EGQN C EVN
Mouse Not3 ...238 .......VDD
C PGHR C LNGGT C VDGV-----NTYN C Q C
PP EW TGQF C TED
Mouse Not3 ...276 .......VDE
CQLQPNAC HNGGT C FNLL--G---GHS C V C
VN GW TGES C SQN
Mouse Not3 ...316 .......idd
c atav c fHGat c hdr-va----sfy c a c pm gk
tgll c hl
Mouse Not3 ...353 .......DDA
C VSNP C HEDAI C DTNPVSG-R--AI C T C PP GF TGGA C DQD
Mouse Not3 ...393 .......VDE
CSIGANPC EHLGR C VNTQ--G---SFL C Q C GR GY TGPR C ETD
Mouse Not3 ...433 .......VNE
C LSGP C RNQAT C LDRI--G---QFT C I C MA GF TGTY C EVD
Mouse Not3 ...471 .......IDE
C QSSP C VNGGV C KDRVN-G----FS C T C
PS GF SGSM C QLD
Mouse Not3 ...509 .......VDE
C ASTP C RNGAK C VDQPD-G----YE C R C
AE GF EGTL C ERN
Mouse Not3 ...547 .......VDD
C SPDP C HHG-R C VDGIA-----SFS C A C
AP GY TGIR C ESQ
Mouse Not3 ...584 .......vde
c rsqp c ryggk c ldlvd---k--yl c r c pp gt tgvn c evn
Mouse Not3 ...622 .......IDD
C ASNP C TFG-V C RDGIN---R--YD C V C QP GF TGPL C NVE
Mouse Not3....659 .......ine
c assp c geggs c vdgen-g----fh c l c pp gs lppl c lpa
Mouse Not3 ...697 .......NHP
C AHKP C SHG-V C HDAPG-G----FR C V C
EP GW SGPR C SQSLA
Mouse Not3 ...736 .......PDA
C ESQP C QAGGT C TSDGI-G----FR C T C AP GF QGHQ C EV
Mouse Not3 ...773 .......LSP
C TPSL C EHGGH C ESDPD---R-LTV C S C
PP GW QGPR C QQD
Mouse Not3 ...812 .......VDE
CAGASP C GPHGT C TNLP--G---NFR C I C
HR GY TGPF C DQD
Mouse Not3 ...851 .......IDD
C DPNP C LHGGS C QDGV--G---SFS C S C
LD GF AGPR C ARD
Mouse Not3 ...889 .......VDE
C LSSP C GP-GT C TDHVA-----SFT C A C PP GY GGFH C EID
Mouse Not3 ...926 .......LPD
C SPSS C FNGGT C VDGVS-----SFS C L C
RP GY TGTH C QYE
Mouse Not3 ...964 ........DP
C FSRP C LHGGI C NPTHP-G----FE C T C
RE GF TGSQ C QNP
Mouse Not3 ..1002 .......VDW
C SQAP C QNGGR C VQT---G---AY- C I C
PP GW SGRL C DIQslp
Mouse Not3 ..1038 .c
teaaaqmgvrleql c qeggk c idk---g-r-shy c v c pe gr tgsh c ehe
Mouse Not3 ..1086.....
VDP C TA--QP C QHGGT C RGYMG-G----YV C E C PA GY AGDS C EDN
Mouse Not3 ..1124 .....ide
c as--qp c qNGgs c idlva---r--yl c s c pp gt
lgvl c ein
Mouse Not3 ..1162 EDD C DLGPSLDSGVQ C LHNGT C VDLVG-G----FR C N C
PP GY TGLH C EAD
Mouse Not3 ..1207 .....INE
C RPGA C HAAHTRD C LQDPG-G---HFR C V C HP GF TGPR C QIA
Mouse Not3 ..1248 .....LSP
C ES--QP C QHGGQ C RHSLGRGGGLTFT C H C VP PF WGLR C ERV
Mouse Not3 ..1291 .....ars
c re--lq c pvgip c qqtar-g----pr c a c pp gl sgps c rvsraspsga
Mouse Not3 ..1336 ....TNAS
C AS--AP C LHGGS C LPVQSVP---FFR C V C AP GW GGPR C ETPSAAP
Mouse Not3 ..1381 ..........EVPEEPR
C PRAA C QAKRGDQN C DRE C NTPG C GW DGGD C
SL
Mouse Notch-4
Mouse Not4 ....21.......RELL C GGSP-EP C ANGGT C -LRLSQGQGI-- C Q C AP GF LGET C QF
Mouse Not4 ....72
......KNGG - SCQALLP
- TPPSS - RSPTSPLTPHFS C T C PS GF
TGDR C QTHLEEL
Mouse Not4 ...116
.......EEL C ---PPSF
C SNGGH C YVQA---SGRPQ
C S C EP GW TGEQ C QL
Mouse Not4 ...154
.......RDF C SAN---P
C ANGGV C -LATYPQIQ---
C R C PP GF EGHT C ERD
Mouse Not4 ...232
.......KGA C --P-PGS
C LNGGT C QLVPEGHSTFHL
C L C PP GF TGLD C EMN
Mouse Not4 ...274
.......PDD C VRHQ---
C QNGAT C LD----GLDTYT
C L C PK TW KGWD
C SE D
Mouse Not4 ...312
.......IDE C EARGPPR
C RNGGT C QN----TAGSFH
C V C VS GW GGAG C EEN
Mouse Not4 ...354
........dd c aaat---
c apgst c idrv---g-sfs c l c pp gr tgll c hle
Mouse Not4 ...390
.......EDM C L---SQP
C HVNAQ C -STNPLTGST-L C I C QP GY SGST C HQD
Mouse Not4 ...430
.....LDE C QMAQQGPSP
C EHGGS C INTP--G--SFN
C L C LP GY TGSR C EAD
Mouse Not4 ...473
.......hne c l--s-qp
c hpgst c ld----llatfh c l c pp gl egrl c eve
Mouse Not4 ...549
.......MDE C -S-S-TP
C ANGGR C RDQP----GAFY
C E C LP GF EGPH C EK
Mouse Not4 ...511
.......VNE C TS-N--P
C LNQAA C HDLL----NGFQ C L C LP GF TGAR C EKD
Mouse Not4 ...586
......EVDE C LS-D--P
C PVGAS C LD--LPGA--FF C L C RP GF TGQL C EV
Mouse Not4 ...624
........pl c -tpnm--
c qpgqq c qg----qehrap c l c pd gs pg-- c vpa
Mouse Not4 ...659
.......EDN C ------P
C HHGH- C QR-------S-L
C V C DE GW TGPE C ETE
Mouse Not4 ...689
.......LGG C I---STP
C AHGGT C HPQP----SGYN
C T C PA GY MGLT C SEE
Mouse Not4 ...727
.......VTA C HS-G--P
C LNGGS C SIRPE----GYS
C T C LP SH TGRH
C QTA
Mouse Not4 ...765
.......VDH C VSA-S--
C LNGGT C VNKP--G--TFF
C L C AT GF QGLH C EEK
Mouse Not4 ...803
......TNPS C AD---SP
C RNKAT C -QDTPRGAR--- C L C SP GY TGSS C QTL
Mouse Not4 ...842
.......IDL C A---RKP
C PHTAR C LQS---GP-SFQ C L C LQ GW TGAL C DF
Mouse Not4 ...878
PLS C QMAAMSQGIEISGL C QNGGL C IDT---GS-SYF C R C PP GF
QGKL C QDN
Mouse Not4 ...927
.......MNP C ---EPNP
C HHGST C VPQP---SG-YV
C Q C AP GY EGQN C SK
Mouse Not4 ...964
......VLEA C -Q--SQP
C HNHGT C TSRP-GG---FH
C A C PP GF VGLR C E G
Mouse Not4 ..1002
......DVDE C LDRPCHP
- SGTAA C HSLA-N---AFY C Q C LP GH TGQR C EVE
Mouse Not4 ..1043
.......MDL C -Q--SQP
C SNGGS C EITT-GPPPGFT
C H C PK GF EGPT C SHK
Mouse Not4 ..1084
.......ALS C ---GIHH
C HNGGL C LPSPKPGSPP-L
C A C LS GF GGPD C LTPP
Mouse Not4 ..1127
.......ppg c --gppsp
c lhNGt c tetpglgnpgfq
c t c pp ds pgpr c qrpg...
Drosophila
Spitz protein
DM spitz ...71
......ITFPTYK
C PETFDAWY C LNDAH C FAVKIADLPVYS C E C AI GF
MGQR C EYKEIDNTY...
Drosophila Gurken
DM gurken .178
........QMLP C
SEAYNTSF C LNGGH
C FQHPMVNNTVFHS C L C
VN DY DGER C AYKSWNGD...
Drosphila Dorsal-Ventral Talloid protein
DM-DVT ....581
.................DVDE
C KFTDHG C QHL
C INTLGSYQ C G C
RA GY ELQANGKT
C EDA...
DM-DVT ....741
...............VIDVDE
C SMNNGG C QHR
C RNTFGSYQ C S C
RN GY TLAENGHN
C TETR...
Drosophila Giant Lens Protein (ARGOS)
DM argos ..366...YLFA
C SPLTRLR C QRKQP C KLFTVRKRQEFLDEVNINSL C Q C PK GH R C PSHHTQSGVIAGESFLE...
Drosophila
Serrate Protein
DM serrate 284.............EEAI C KAG- C DPVHGK C DRPGE C E C RP GW
RGPL C N
DM serrate 318................E C MVYPG C KHGS C NGSAWK C V C DT NW
GGIL C DQDLNF...
DM serrate 353...........nf c gthep c kHGgt c entapdkyr c t c ae gl sgeq c eiv
DM serrate 451...............LNGSSSSGLVSLGSLQLQQQLAPDFT
C D C AA GW TGPT C EIN
DM serrate 491 ..........NIDE C AGGP C EHGGT C IDLIGGFR C E C PP EW
HGDV C QVD
DM serrate 564 ...........VASTSLAIGP C INAKE C RNQPGSFA C I C KE GW GGVT C AEN
DM serrate 620 ............LDD C VGQ C RNGAT C IDLVNDYR C A C AS GF
TGRD C ETD
DM serrate 649 ...........IDE C ATSP C RNGGE C VDMVGKFN C I C PL GY
SGSL C EEA
DM serrate 687 ...........ken c tpsp c leg-h c lntpegyy c h c pp dr agkh c eql
DM serrate 761 ............KMAKPSGLP C SGHGS C EMSDVGTF C K C HV GH TGTF C EHN
DM serrate 799 ..........NLNE C SPNP C RNGGI C LDGDGDFT C E C MS GW
TGKR C SE
DM serrate 837 ......RATG
C YAGQ C QNGGT
C MPGAPDKALQPH C R C
AP GW TGLF C AE
DM serrate 879 ..........AIDQ C RGQP C HNGGT C ESGAGWFR C V C AQ GF
SGPD C RI...
DM serrate 918 ..........nvne c spqp c qggat c idgiggys c i c pp gr hglr c eillsdpk
Drosophila
Slit protein
HU Slit ...337
...DINIVAK C NACLSSP C KNNGT C TQDPVELYR C A C PY SY
KGKD C TV
MO Slit ...408
...DITIQAK C NPCLSNP C KNDGT C NNDPVDFYR C T C PY GF KGQD C DV
DM Slit ...909
........NA C F---EQP
C QNQAQ C VALPQREYQ C L C QP GY
HGKH C EF
HU Slit ...382 .......PINT
C IQNP C QHGGT
C HLSDSHKDGFS C S C
PL GF EGQR C EI
MO Slit ...453
.......PIHA C
ISNP C KHGGT C
HLKEGENAGFW C T C
AD GF EGEN C EV
DM Slit ...946
.......MIDA C
YGNP C RNNAT C TVLEE--GRFS C Q C AP GY
TGAR C ET
HU Slit ...423 .........NPDD C ED-ND C ENNAT C VDGINNYV C I C PP NY TGEL C DE
MO Slit ...494
.........NIDD
C ED-ND C ENNST C VDGINNYT C L C PP EY
TGEL C EE
DM Slit ...985
.........NIDD
C LGEIK C QNNAT C IDGVESYK C E C QP GF
SGEF C DT
HU Slit ...461 ........VIDH
C VPELNL C QHEAK C IPLDKGFS C E C VP GY
SGKL C
MO Slit ...532
........KLDF C
AQDLNP C QHDSK C ILTPKGFK C D C TP GY
IGEH C
DM Slit ..1024
........KIQF C
SPEFNP C ANGAK
C MDHFTHYS C D C
QA GF HGTN C TD
HU Slit ...499 ........ETDNDD
C VAHK C RHGAQ
C VDTINGYT C T C
PQ GF SGPF C
MO Slit ...570
........DIDFDD
C QDNK C KNGAH
C TDAVNGYT C V C
PE GY SGLF C EFSPPM
DM Slit ..1064
..........NIDD
C QNHM C QNGGT
C VDGINDYQ C R C
PD DY TGKY C EGHNMI...
MO Slit ...614 .......VLPRTSP C DNFD C QNGAQ C IIRINEPI C Q C LP GY LGEK C
HU Slit ...654 ......GVSPG
C KSCT--V C KHGL
C RSVEKDSVV C E C
RP GW TGPL C
MO Slit ...822
....QTGILPG C
EPCHKKV C AHGM
C QPSSQSGFT C E C
EE GW MGPL C
HU Slit ...692 .......DQEARDP
C LGHR C HHGK
C VATG-TSYM C K C
AE GY GGDL C
MO Slit ...864
.......DQRTNDP
C LGNK C VHGT
C LPINAFSYS C K C
LE GH GGVL C
HU Slit ...730 .....DNKNDSANA
C SAFK C HHGQ
C HISDQGEPY C L C
QP GF SGEH C
MO Slit ...903
.....DEEEDLFNP
C QMIK C KHGK
C RLSGVGQPY C E C
NS GF TGDS C
DM Slit ..1111 .......QTSP
C QNHE C KHGV-
C FQPNAQGSDYL C R C
HP GY TGKW C EYLTSIS
DM Slit ..1353
.......pvdp c
lenk c rrgsr c vpnsnardgyq c k c kh gq rgry c dqgegstep...
Drosophila
Crumbs protein
DM crumbs .266
.......LPDNF C
LNDP C MGHGT C
SSSPE-G-YE C R C
TA RY SGKN C QKDN
DM crumbs .306
..........SP C
AKNP C ENGGS C
LENSE-GNYQ C F C DP NH SGQH C ETEVNI
DM crumbs .347......... HPL C QTNP C LNNGA C VVIGGSGALT C E C PK GY AGAR C EVD
DM crumbs .387
.........TDE C
ASQP C QNNGS C
IDRI-NG-FS C D CSGT
GY TGAF C QTN
DM crumbs .426
.........VDE C
DKNP C LNGGR C
L-H-TYGWYT C Q C
LD GW GGEI C DR
DM crumbs .463
.........PMT C
QTQQ C FNGGT C
LDKP--IGFQ C L C
PP EY TGEL C QIAPS
DM crumbs .502
.........aps c
-aqq c pidse c vg------gk c v c kp gs sgyn c qtstgdgas
DM crumbs .541
....ALALTPIN CNATNGKC
LNGGT C SMNG----TH
C Y C AV GY SGDR C EKAEN
DM crumbs .583
.........aen c
spln c qepmv c vq------nq c l c pe nk v c
DM crumbs .611
..........NQ C
ATQP C QNGGE C
VDLPNGD-YE C K C
TR GW TGRT C GND
DM crumbs .647
.........VDE CTLHPKIC
GNG-I C -KNEKGS-YK
C Y C TP GF TGVH C DSD
DM crumbs .686
.........VDE C
LSFP C LNGAT C
-HN-KINAYE C V C
QP GY EGEN C EVD
DM crumbs .724
.........ide c
gsnp c sNGst c
idrin-n-ft c n c ip gm rgri c did
DM crumbs .764
.........IDD C
VGDP C LNGGQ C
IDQLG-G-FR C D CSGT
GY EGEN C ELN
DM crumbs .803
.........IDE C
LSNP C TNGAK C
LD--RVKDYF C D C
HN GY KGKN C EQD
DM crumbs .841
.........ine c
esnp c qyNGn c
le-.. -gye c v
c vp gi igkn c ein
..................................-rsnitlyqmsritdlpkvfsqpfsfenas-
DM crumbs .905
..........INE
C DS-NP C SKHGN
C NDGIGTYT C E C
EP GF EGTH C EIN
DM crumbs .943
..........IDE
C DRYNP C -QRGT C YDQIDDYD C D C DA NY
GGKN C SVL
DM crumbs .981
......LKG C DQNP
C LNGGA C LPYLINEVTHLYT
C T C EN GF QGDK C EKTTTLSMVA...
DM crumbs.1206
.........PRTEQ
C KPNP C HSNVE C TDLWHTFA C H C PR PF
FGHT C QHNMTAAT...
DM crumbs.1481
..........SDDL
C RKNA C LHNAE C RNTWNDYT C K C PN GY
KGKK C ARR...
DM crumbs.1761
............IL
C FQSD C KNDGF C QSPSDEYA C T C QP GF
EGDD C GTD
DM crumbs.1798
...........IDE
C LNTE C LNNGT
C INQVAAFF C Q C
QP GF EGQH C EQN
DM crumbs.1836
...........IDE
C ADQP C HNGGN
C TDLIASYV C D C
PE DY MGPQ C DVL
DM crumbs.1874
.....KQMT C ENEP
C RNGST C QNGFNASTGNNFT
C T C VP GF EGPL C D
DM crumbs.1916
...........IPF
C EITP C DNGGL
C LTTGAVPM C K C
SL GY TGRL C EQD
DM crumbs.1954
...........ine
c esnp c qNGgq
c kdlvgrye c d c ra ri rgir c en d
DM crumbs.1992
.........IDE C
NMEGDY C GGLGR C FNKPGSFQ C I C QK PY
CGAY C NF
DM crumbs.2037
.........TDL C
L-NGGR C V-E-S C GAKP-DYY C E C PE GF
AGKN C TAPITAKE...
Drosophila
Cadherin-Related Tumor Supressor (FAT)
DM FAT ..3961
...ENGGVCSATMRLLDAHSFVIQDSPALVLSGPRVVHDYS C Q C TS GF SGEQ C SRRQDP
DM FAT ..4013
.........rqdp
c lpnp c hsqvq c rr-l-gsdfq c m c pa nr dgkh c eker
DM FAT ..4051
........ERSDV
C YSKP C RNGGS
C QRSPDGSSYF C L C
RP GF RGNQ C ESVSDS
DM FAT ..4092
.........vsds
c rpnp c lhggl c vslkpg--yk c n c tp gr ygrh c erf
DM FAT ..4321
......nrqa c qpalaaer
c ggfagq c idrwsssl c q c gg hl qspd c sdslepitl
Drosophila
Vitillogenin Receptor
DM-vitR ..306
..........SKPD
C DAKK-- CALGAKC HMMPASGAE C F C PK GF
RLA-KFEDK C ED
DM-vitR ..349
...........VDE
C KEQDDL C SQG C ENTS-GGYR C V C DA GY
LL-DKDNRT C RAV...
DM-vitR ..659
.........nsphg
c -e-nat c sHl cllaepeigghsc a c pd gm rla-pdhrr c mlmek
DM-vitR ..984
..........ESHP
C QQQNGG C SHI C VGEGPYHSI C L C PA GF
VYRDAGNRT C VE...
DM-vitR .1375
...........ATA
CRSASGRQVC QHK C RATP-AGAV C S C FD GY
RL-DADQKS C LD
DM-vitR .1419
...........IDE
C QEQQP- C AQL C ENTL-GGYQ C Q C HA DF
MLR-QDRVS C KSLQ...
Fibropellin
I (SEA URCHIN SUSEGFI)
.19 ..........GQGE C DSDP C ENGST C QEGEGSYI C Q C PM GY DGQN C DRFTGS...
176 ..........DGDD
C DPNL C QNGAA
C TDLVNDYA C T C
PP GF TGRN C EI
214 ..........DIDE
C ASDP C QNGGA
C VDGVNGYV C N C
VP GF DGDE C EN
252 ..........NINE
C ASSP C LNGGI
C VDGVNMFE C T C
LA GF TGVR C EV
290 ..........NIDE
C ASAP C QNGGI
C IDGINGYT C S C
PL GF SGDN C EN
328 ..........NDDE
C SSIP C LNGGT
C VDLVNAYM C V C
AP GW TGPT C AD
366 ..........NIDE
C ASAP C QNGGV
C IDGVNGYM C D C
QP GY TGTH C ET
404 ..........DIDE
C ARPP C QNGGD
C VDGVNGYV C I C
AP GF DGLN C EN
442 ..........NIDE
C ASRP C QNGAV
C VDGVNGFV C T C
SA GY TGVL C ET
480 ..........DINE
C ASMP C LNGGV
C TDLVNGYI C T C
AA GF EGTN C ET
518 ..........DTDE
C ASFP C QNGAT
C TDQVNGYV C T C
VP GY TGVL C ET
557 ..........dine
c asfp c lNGgt
c ndqvngyv c v c aq dt svst c et
594 ..........DRDE
C ASAP C LNGGA
C MDVVNGFV C T C
LP GW EGTN C EI
632 ..........NTDE
C ASSP C MNGGL
C VDQVNSYV C F C
LP GF TGIH C GT
670 ..........EIDE
C ASSP C LNGGQ
C IDRVDSYE C V C
AA GY TAVR C QI
708 ..........NIDE
C ASAP C QNGGV
C VDGVNGYV C N C
AP GY TGDN C ET
746 ..........EIDE
C ASMP C LNGGA
C IEMVNGYT C Q C
VA GY TGVI C ET
784 ..........DIDE
C ASAP C QNGGV
C TDTINGYI C A C
VP GF TGSN C ET
822 ..........NIDE
C ASDP C LNGGI
C VDGVNGFV C Q C
PP NY SGTY C EI
860 ..........SLDA
C RSMP C QNGAT
C VNVGADYV C E C
VP GY AGQN C EI
898 ..........DINE
C ASLP C QNGGL
C IDGIAGYT C Q C
RL GY IGVN C EEVGF...
Fibropellin
II (SEA URCHIN)
.48 ...TKGQ C
ESDTNK C NNHGT
C -IE--GRW-GTYY C K C
EM PF RVGIPDSS
C YPPPE...
107 ...SENR C LSDTSN C DGHGI C QLSTFGRNE-RYI C F C AL GF
RNNN-YGG C SPYTPRE...
178 ...SLGR C KSDTHN C DEAGQ C VTKTYGRYAGEYI C V C NH GY RNNA-YGG C SPMTTRE...
252 ...SLSE C SQGTND C NENGE C -VEEDGK----YW C E C GE GY
EENE-DGG C SPIVTRAT...
Fibropellin III (SEA
URCHIN)
.18 .........YGQGE C GSNP C ENGSV C RDGEGTYI C E C QM GY DGQN C DRFTGA
176 ..........DGDD
C TPNP C LNGAT
C VDQVNDYQ C I C
AP GF TGDN C ET
214 ..........DIDE
C ASAP C RNGGA
C VDQVNGYT C N C
IP GF NGVN C EN
252 ..........NINE
C ASIP C LNGGI
C VDGINQFA C T C
LP GY TGIL C ET
290 ..........DINE
C ASSP C QNGGS
C TDAVNRYT C D C
RA GF TGSN C ET
328 ..........NINE
C ASSP C LNGGS
C LDGVDGYV C Q C
LP NY TGTH C EI
366 ..........SLDA
C ASLP C QNGGV
C TNVGGDYV C E C
LP GY TGIN C EI
404 ..........DINE
C ASLP C QNGGE
C INGIAMYI C Q C
RQ GY AGVN C EEVG...
64K
Sperm Flagellar Membrane protein (Sea
Urchin)
.42...........PDP C ASNP C TIASTH
C VAAGESHT C E C
RP GY FETNGN C
TVAQQFAG...
202 .. ....dfde
c asaddnd c dpnan c tntagsft c e c dt el ydnspnteepgrv c i
251 . ......AP
C DPGL C TRPNEI C NNGGTIEDDNL C K C IE GY
DYTQYGD C DPMARSTDF...
Exogastrula-inducing
polypeptide (Sea Urchin)
.48 .......TKGG C ERATNN C NGHGD
C ---VQGR-WGQYY C K C
TL PY RVGGSESS
C YMPKDKEEDVEIETKD
107 .......TVAR
C ERDTKN C DGHGT C QLSTFGRRTGQYI C F C DA GY
RKPNSYGG C SPSSARELEYLSYVARDVE
173 MEMLARDSVYQ C NRDTNS C DGFGK C EKSTFGRTTGQYI C N C DD GY R-NNAYGG C SPRTEREIEYLSMIARDQE
245 LEMQARDSLPQ C NRDTNY C DGFGQ C VKSTFGRTTGQYI C S C ND GY E-NNLYGG C SPK...
Mussel
Mytilus galloprovincialis ADHESIVE PLAQUE MATRIX PROTEIN 2 PRECURSOR
.47
... ....NP C LK-KP
C KYNGV C KPR--GGSYK
C F C KG GY YGYN C NLK
.84 ........NA C KP-NQ C KNKSR
C VPVGKT--FK C V C
RN GN FGRL C EK
120 ........NV
C SP-NP C KNNGK
C SPLGKTG-YK C T C
SG GY TGPR C EV
157 ........HA
C KP-NP C KNKGR C FPDGKTG-YK C R C VD GY
SGPT C QE
194........ NA
C KP-NP C SNGGT
C SADKF-GDYS C E C
RP GY FGPE C ER
231 ........YV
C AP-NP C KNGGI
C SSDGSGG-YR C R C
KG GY SGPT C KV
268 ........NV
C KP-TP C KNSGR C VN--KGSSYN C I C KG GY
SGPT C GE
304 ........NV
C KP-NP C QNRGR C YPDNSDDGFK C R C VG GY
KGPT C ED
342 ......KPNP
C NT-KP C KNGGK
C NY--NGKIYT C K C
AY GW RGRH C TDKA
382 .....YKPNP
C VVSKP C KNRGK C IW--NGKAYR C K C AY GY
GGRH C TKKS
424 .....YKKNP
C AS-RP C KNRGK C TDKGNG--YV C K C AR GY
SGRY C SLKSPPSYDDDEY
Caenorhabditis elegans EGF-like Proteins (Scroll down to view)
C. elegans
Lin-3
.150
.............KEAK
C KDY C HHNAT C HVEVIFREDRVSAVVPS C H C PQ GW EGTR C DRHYVQA...
C. elegans
Extracellular Mechanosensation response gene product MEC-9L
.157
........LSPQIA
C ----DH C DLRTSF C --KSNS-----KFNYT C E C RS GY EKNQYGE C ID
.200 ...........IDE C RGYKAV C
DRNA-W C -V--NEIG-----SYK C E C MA SY RGDGKH C TYVGLGRSS
.248 ............ID C --K-D- C
SMHA-T C -M--N--GV------- C Q C KE GY EGDGFN C TD
.280 ...........VNE C LRRPEM C
NKNA-E C -I--NREG-----SFI C T C LE GY AGNGYN C TVSKNS...
.545....GDIYKPNLTDT C LAKNP- C
KNNG-T C -I-F----VWKKDTHY
C K C QP GF HGN--N
C DKVV
.591 ..........DYDP C AEK-P- C
L-NGAT C QIKYNDDDVDEKPTFE
C F C AA GF GRPK--
C DQ
.637 ............RP C -ESNP- C
LNNG-T C --RTTK-G-Y--STYF
C E C AN GF GG-K-N
C DVS...
C. elegans
TM cell-adhesion receptor MUA-3 muscle attachment protein
.378
.............. ...DIR
C GN---KT C GLHES CQKNSESKY-E C I C RE GF TIFEGT C REL
.419 ............. ....IDE C AQG-KHD
C HPEAR C VDALIGY-E C L C RE GY LDTSIDPKARPGRK C RKL
.469 ............ .....INE C TNALMND
C SQNAR C LDKPIGY-T C R C QD DY VDVSREGARKPGRN C TQA
.520 ........... ......INE C ASN-LHN
C DTHAI C QDQPVGY-S C R C PF GF IDSSPTALEPGRK C VQAN
.570 .......... ............... NEAATTSTTTSQ
C IKEKNGETV C K C LL GY KNVGTKTHLN C QMEKR
.615 ......... ........ANP C QDYSLHD
C DPVAE C FSEQPGYFQ C Q C PK GF TDSSADKRFPGRK C VRA
.666 ........ .........VDE C ALG-RHT
C DPHAD C IDTHQGY-T C K C RS GW SDTSLDPLRSPGRS C KK
.715 ....... ..........ADM C SN---ID
C AAEAE C RETPIGP-M C Q C VS GY VDVSRQHGRPAGRV C RAV
.763 ...... ...........VNE C AEG-RHD
C SSHAT C IDTADGF-T C R C KD SY RDESSDTLKHPGKN C VRTVQPD
.817 ..... ............PPE C DVSDPMS
CDPAKREVC IFVENTY-K C R C AN GY SRLPDGR C VV
.862 .... .............INE C AEPRLNT
C GKNAE C IDLAEGY-T C Q C RS GY ADISPVSQPGRI C RAR...
.911 ... ..............VNE CSNKEKYNVDC
SENAI C ADTEHSY-S C R C RP GF ADVSAAFNKLPGRR C IEA
.964 .. ...............VNE C ASPSLND
C SKNAF C EDAKEGY-I C T C RP GY VDNSPNAARHPGRI C TKP
1015 . .VEKIKTDLKDTSFSTDDG
C DPK-NPK C GANEA C VQ-RHGQHN C E C VE TA FRYTDGS C RV
1072 ..................YSA C SKR--NT C DKNAI C LNRFDSY-T C Q C RP GY IDLSADLTNAPGRI C KEL
1121 ..................INE C ASS-DNE C SPYAR C IDATNGY-A C Q C LD GF IDVSSRYNKPPGRQ C TNS
1171 ..................NNE C SEKSLNT C DENAD C VDTPDGY-T C Q C YG GF VDVSSNANLPPGRV C TV...
1422 ..................EDV C NPRTQTG CDRSLNEHC AVEN-GRPR C V C PE GF TRHPFTRV C G
1467 ..................GDL C NPQLITS C IFPEE C QITPYKNFR C S C PE GY NRDYRSGF C VSVKEVQISPQH
1522 ..................DAN C HNG-GVR C SENER C TND-GSDWF C E C LP GF ERIRNGQ C AY
1564 ..................PGS C NPNDPMS CDVRKRQQC LPR-GNIYT C Q C GR NE KRHPITDI C L
1609 ..................KNE C LTG-EHD C DRSAR C IDT-DESYI C A C QS GF IDHSPNPSERPGRV C VAL
1659 ..................QNE C LD-GSNR C SPNAL C TDT-EEGYV C R C KS GF VDYSPNPQTFPGMV C KEL
1709 ..................VNE C TNPRLNQ C DRNAH C IDT-IEGYS C I C KP GF VDMDEFGNPGRR C EQIKT
1760 ..................NDK C SP-GKND C DRNAR C IQIGDDDYS C A C PP GF KDKSPSSSRPGRL C IPV
1810 ..................IPE C DNPTLND CDSPDRAVC TDT-DDGYM C R C RQ GF LDISPSISVKPGRL C KPL
1863 ..................QNE C AL-GIDD C ARDGGIC EDN-PDSFT C R C AM NY LDVSFDRVTRPGRK C KRL
1914 ..................INE C QT-GQND C SEEAT C TDT-EDSYI C A C PQ SH IDLSPDTVNRPGRR C
LMR
1964 ..................INE C TS-NRHD C SPNAD C IDT-PESYK C R C RD DF VDESPDSSRRPGRI C RPAL
2015 ..................VDE C RT-GKHD C HVNAI C QDL-PQGYT C Q C SA DF VDVSPHRASHPGRL C QPRPTPP
2069 ..................PPE C RLDGGNQ CKVHLNEVC RLM-GGEPK C S C PV NY QRDSSGS C SI
2114 ..................INE C LFTQLND C HTAAD C IDQ-VQGYT C Q C RD GF KDIGDRRRPGRM C KPM
2163 ..................VNE C QYPHLND C HQNAA C IDL-EEGYE C K C NQ GF MDHSHGRPGRI C KQL
2211 ..................TNE C LRPSLNS C DRNAR C IDK-EEGYE C E C RD GF IDVSPSPTLKGRA C REL
2261 ..................VNE C ANSRLND C DKNAR C KDT-MDSYE C D C PV NS KDISPSPSFPGRV C LMF
2311 ..................INE C ES-GVHD C DPSAT C RDN-EQSFT C E C PS GF VDRSPNKHARPGRV C VKL
2361 ..................VDE C RE-GRHT C SSHAD C RDL-EEGYT C E C RD GY VDRSPNLASQPGRV C SA
2410 ..................PEV C PP--NHD C SSAAV C EPLGGMKYQ C V C IQ GY VDQSPGSQKGRV C VR
2457 ..................NNA C HDPRLNT C SRNAI C YDE-PRGYR C E C KR GF MDRSPDSSQRGRV C EPPPPPSPPP
2514 ..................RHP C QDPERND C HPAGT C RATGAQSYT C E C LS GY ADRSPDPRNKPGRL C VLT
2566 ..................EPV C LDPEQND C HAAAI CSEVNGPEKYT C K C RD GY TDESPDPLRRPGRI C KGL
2619 ..................INE C LDRSLND C HSLAV C KDL-PNGYT C Q C PI NA KDQSPDPRKPGRI C SLS
2669 ..................VNE C ANPSLNS C SAFAD C FDE-ENGYR C R C RN GY HDDDPAHPGHR C SFM
2717 ..................INE C DSSNLND C DRNAN C IDT-AGGYD C A C KA PY RDEGPPQSPGRI C R
2764 ...............L...NE
C LNPNRNT C DRNAD C RDL-DYGYT C T C RH GF YDQSPNPQEPGRI C IEFQQEEHIE
2821 .....RVKVTTVQSEPRREFP
C GR---DD CIKARGEVC IS----GEY C G C KP GE GRSASTGK C QEVQETPFE
3010 ..................WGN C GG---MS CKEHLKEVC IA----GHI C G C PD GM KRRDANSE C RVVESWNVPLWVV
3177 ..................FNP C FKN---D C DPHGK C IEISKYAYK C E C GV GY RDINPQSPGKK C LPVHG
3125 ..................FNE C ERKEDNE C SENAR C ID-LEHLYK C E C LP SY YDTSPVGSVPGSL C V
3173 ..................LDY C SDV--NF C PTNTT C KN-MEQQAE C K C DA GF MDIRKSEKRTALMLGDDTL C MHVRD
3329 ..................VDE C ALG-LNN C SGVAH C ID-RAVGYT C K C PD GY IDGNPDEPGRV C G
3375 ..................ALL C D-----L C NAHGD CVHNTATNNIT C V C TD GW TGPQ C QVAPSNASLVLL
C.elegans APX-1
(Polarity)
....173 .............SNPI C --AGG
C SNRGR C -V-----APNQ C S C AD GF NGTR C E
....206 ................Q C LPRAG
C VNG-D C -V-N-E-TPNT C K C RD GF IGDR C DI
....240............. DIKI C SLEKP
C ANGGI C SIDSSSSTGYK C H C PF EF VGSQ C KTPL
....284 .............SKVR C SAEHV
C KNGGA C -I-SMDDTNIQ C K C RR GF SGKF C EI
C. elegans
Lin-12
....19
..........LHIGS
C LGLI C GRNGH C HAGPVNGTQTSYW C R C DE GF GGEY C E...
....114 ...........TQGW C YPSV C MNGGQ
C -IG-A-GN--RAK C A C PD GF KGER C ELD
....155 ............VNE CEENKNAC GNRST
C -MNTL-GT---YI C V C PQ GF LPPD C LKPGNTSTVEF
....201 KQPV C
FLEISADHPDGRSMY C QNGGF C -------DKASSK C Q C PP GY HGST C ELLEK
....251 ............EDS C ASNP C SHG-V
C -IS-FSGG---FQ C I C DD GY SGSY C QE
....287 ...........gkdn c vnnk c eagsk
c -in---g-vnsyf c d c pp er tgpy c ek
....325 .............MD CSAIPDIC NHG-T
C IDSPLS-EKA-FE C Q C EP GY EGIL C EQD
....366 ............KNE C LSENMC LNNGT
C VNLP--G---SFR C D C AR GF GGKW C DEP
....405 ............LNM C QDFH C ENDGT
C MHTS---DH-SPV C Q C KN GF IGKR C EKECPIGFGGVRCDLR
....457 ..........LEIGI CSRQGGKC FNGGK
C LS----G----F- C V C PP DF TGNQ C EVNRKNGKSSL
....497 ...........SENL C LSDP C MNNAT
C IDVD---AHIGYA C I C KQ GF EGDI C ERH
....544 ............KDL C LENP C SNGGV
C HQH-----RESFS C D C PP GF YGNG C EQE
....582 ...........KMFR C LKST C QNGGV
C INEEE---KGR-K C E C SY GF SGAR C EEK...
C.elegans SPE-9
....476...GNGR
C V-----P C ESDADDLMPL C NDNDNRQ-GFR C L C EA GY LPPF C KV
....519 ..Htnp c yq---nl c qns-----at
c hidp-kqrsyd c q c vn gt rgsl c en
....560 ..VDDS C DAFGNKI C V--HG----T
C INDEYFHRGFS C E C DD GF EGLD C N
C.elegans LAG-2
protein
....171
....DPRK C S C
ENDGI C VSSMIHPSQPNQTSSNEQLI C E C TN GF TGTR C EIFGFNQFQLTA
....228 ...........PRPDA C SVKDA C
LNGAK C FPNGPKVF C S C AV GF IGEF C EISLT...
C. elegans
GLP-1
.....18
......LMGGE C
GREG-A C SVNG-K C Y-NGKLIETYW C R C KK GF GGAF C E
....118 ........VNP C DSD-P- C N-NG-L
C YP---FYGGFQ C I C NN GY GGSY C EEG
....155 ........idh c a-q-ne c a-egst
c vns--vy-nyy c d c pi gk sgry c er
....192 .........TE C ALMGNI C N-HG-R
C IPNRDEDKNFR C V C DS GY EGEF C NKD
....233 ........KNE C LIE-ET C VNNST
C FNL----HGDFT C T C KP GY AGKY C EEA
....272 ........IDM C K-DY-V C QNDGY
C AHDS---NQMPI C Y C EQ GF TGQR C EIECPSGFGGIHCDLPLQ
....326 ........RPH C SRSNGT C YNDGR
C I------NG-F- C V C EP DY IGDR C EINRKDFKFPD
....370 ........IQS C K-YNP- C VNNAT
C IDLK---NSGYS C H C PL GF YGLN C EQ
....408 ........HLL C TPT--T C ANGGT
C EGV----NGVIR C N C PN GF SGDY C EIKD
....447 ........RQL C SR-HP- C KNGGV
C KN-T----G-Y- C E C QY GY TGPT C EEVLVIEKSKETVI...
C. elegans
Hemicentin
....4948
.........VNTD
C AGTINENGD C VDKDGKTHNLKILTGENH C PE GF AMNPHTRI C ED
....4997 .........LDE C AFYQP C -D-FE
C INYDGGFQ C -N-- C PL GY EL-AEEG- C RD
....5035 .........VNE C ES-VR C EDGKA
C FNQLGGYE C IDDP C PA NY SL-VDDRY C EP
C. elegans
EAT-20 (A & B)
.... 220
...........PPSP
C ANHE C HNNGT C LVSQEGAAT C L C RN GF TGDR C EL
.... 259 .............DV C SSVP C QNGGV
C RSNN-GIAY C E C PP AF TGLL C ESAHTDES
.....301 ...........VAPI C R-PE C SNG-Q
C VFKD-GQAQ C E C RQ GF TGAN C NVLDV...
C. elegans
LDLR
... .299
...............QNL
C PSLG C QAG C HPSPHGGE C T C PS GY KLDDRFHRT C SD
... .339 ..............ine c aefgy
c dql c anhrpgft c s c lg dc ftlqmehgpgkdnltmrgy c vsn...
... .669 ........ienp c tnad c egm
c ilskdnggfgvgyk c a c pi gq klvngkr c ids...
... .997 .......REHP C RASQ C TQL C
FATPSESHPNELEAK C A C RQ GF MINKENNHS C QKDPAE...
....1398 ..............rdl c sadragc
sfk c hnspngpi c s c pf ge qlvnktk c e
....1437 .............PENE C LDSSS
C SQR C KDEKHGFT C S C DE GY ELDVDKRT C KVADN...
....1747 .............knnp c stnp c
shl c llnnkntft c k c pm ge kldasgkk c iddak...
....2080 ...........SGHP C HINNGN C
DHI C IPLMFAQRT C T C AN GY VKDGQTS C K...
....2396 .......assp c qitdnlrkspc
tql c fatpgtqtpt c s c ar gv lkgrt c eepd...
....2728 .....TSQL C KTDNGG C DQL C
TVVADDIGLAASKVQ C S C ND TY ELVQEPGKDYPTQ C VLRGSNSE...
....3224 .........IDE C SLAEK-PL C
EQK C MDMKI-G-YK C D C FE GF AIDISDQKS C HNVNECYEG
....3266 ........NVNE C YEG--ISG C
SQK C DD-KI-GSYK C G C VD GY QLSSDDHS C KRTEM...
....3582 ........YPNP C --GDNNGG C
SHL C LIGAGGNGYT C S C PD QF VLLSDQKT C EPN...
....4066 ......HGGKTS C EAFGNNGG C
KHI C TDVR--DGFY C H C RD GF RPDPQSPKE C ID
....4132 ........DIDE C -AG-NN-T C
TQL C LNTK--GSYL C R C HE DY ENNVVVGSMTGKD C RAKGD...
....4477 ............tvsg c eraq c
shl c vslpstg-fa c l c pd gi vpqldgs c atqhve
....4521 ...............ALTMPKQ C K
C TNGGK C RLDGS C E C TS DF EGDQ C EKESSVS...
C.elegans Hypothetical
Metaloproteinases
F42A10.8 320 ...MNKIY C SAV C PSKLP C QRGGYTDPRRCDR C R C PD GF TGQY C EQVMPGYGAT...
.K04E7.3 328 ....ntaf c sni c tnrin c qhggyadpnncgq
c t c pt gl egty c erlqtsn...
...R15.1 243 .................KIQMK C SNCGITDSRNCNQ
C K C PR YF TGAS C DSLPSGT...
..R151.5 320 ...MNKIY
C SAV C PSKLP C QRGGYTDPRRCDR C R C PD GF TGQY C EQVMP
C. elegans
Hypothetical protein F33C8.1 from Chromosome X
........
62 ..............SLSS
C DKP C Y-NGV C LNKA C V C SK GW YGSQ C DH...
........201........... ...ESNR C AYN
C SNHGS C LNGK C D C ED GY KGLN C EYQV
C. elegans Hypothetical protein F58E6.3 (CAA94773)
........211
...TNQ C KASTHN C HWLAA C IDLPDENHKKMYS C K C KP GF VGNGFH C VD
........257 ............A C EGF C LNGGS
C LKTGRGETK C L C AS GF A--GKR C QATE
C.elegans Kunits/BPTI
domains
....155
.........IFNKTESGHGNGL
C QD C DPLYGT C LDGK C G C MK GF RSLGKV C ID
....199 .............LNE C DNGAV C
GPNAR C VNEIGSFQ C V C DA GF STDGD C KIGQE...
C. elegans
Hypothetical protein K08C7.3
....421...........PQP C KV C D C
DPDKHTG-A C AEETGK C E C LP RF VGED C DQ...
....464.........YDAPK C KP C E C
NVNGTIGDV C LPEDGQ C P C KA GF GGTF C ET...
....511..... ...NVTAG C VE C V C
DATGSEHGN C SASTGQ C E C KP AY AGLS C DK...
....557..........FGDD C KF C N C
DPMGTEGGV C DQTTGQ C L C KE GF AGDK C DR...
....601 .......FYGYPN C KA C A C DGAGITSPE
C DATSGQ C P C NG NF TGRT C DK...
....647 .......FYNYPD C RG C E C LLSGAKGQT
C DSN-GQ C Y C KG NF EGER C DR...
...1871 ...NGSPYD
C MA C A CPFAPTNNFAKSC DVSEEGQLLQ C N C KP GY TGDR C DR...
C. elegans CAB63409
.... 81 .........VDQ C GD-DP CGEPEYFLC
TSKI----ESHT C A C QA GY TGAD C TSE
....121 ........LGTA C AT-SP C RSGAT
C VSSNSSATTEYS C I C TD QQ FGTH C EY
....163 .........DNL C AS-NP C QNSGN
C TMVLEN--SNYL C T C SP DW QGRR C EVAD...
....312 .........ANA CTDNSTLCNANLGQGIC
INLSSDVTNGYQ C I C GP LW TDTD C ETPVA
....361 ..........SA C TP-SP C WNG-I
C VLNEQ--YNTYT C A C DD GY FGDL C QY
....398 .........PDV C TS-ST C LYGGT
C TETSSG---GYT C S C LS QY FGTN C EE
....436 .........INR C NYADP C VNG-D
C QTTVDGITTNYT C T C DS GW TGEN C DTM
....478 .........IDY C IP-NP C SYNST
C SPYFK----GFN C T C IT GL TGAN C STSRSNYSTPDQLGLQ
C. elegans CAB63408
....251 .........ENA C DNSTFCSADLGQGLC
INWQSDFTDGYA C I C KP LW TGRN C ENPV
....298......... PEA C TP-SP C LNG-T
C VLNDQYN--KFT C V C DD GF FGDK C QY
....336 .........SDI C TS-AT C LYGGT
C TELNNG---DYK C D C LL QY FGKN C EV
....374 .........INR C DYGKP C NNG-K
C ASTIDGITTNYT C T C DD GW TGTN C DTM
....416 .........IDF C IP-NP C SYNST
C KPKFKG----YD C T C IT GL TGVN C STI
....454 .........IDL CVPYKDSTGKWIKTPCNSKDDMANCTKGINTLTCSC
GD KW TDTL C DLN
C. elegans CAB11570
.... 65 ...........IVDQ C GNDP C GTPDKFN
C TSKIESYD C T C QA GF TGEK C DSE
....106 ...........IGSA C ATSQ C REGST
CEASDNSTTGYT C R C TD KQ YGTY C E
....146 ...........NDNL C ADWP C QNGGT
C SMVLSNSNWI C N C TE GW QGRR C ETKNSVLSSP...
....295 ........DPNA C EDNSTLCGAELGHGMCINWQSDVTDGFA
C I C EP LW TGPD C ENP
....343 ........VASA C TP-NP C VNG-T
C VLNEQYN--KYK C D C DD GF SGDN C E
....381 ........YSKT C TA-AT C LYGGT
C TETNNG---GFK C D C LN LY SGNR C EEI
....421 ..........NR C NYGDP C VNG-K
C ETTVDGITTNYT C T C DD GW TGAN C ET
....461 ........MIDF C TP-DP C KYNST
C TPKFKG----YD C T C LT GL TGAN C SE
....499 ........IIDLCVPYWDNEAWKWIKSPCNSKDDMANCTKKINGFTCSC
GE KY TDTL C DLNV...
C. elegans Sequences from Chromosome III
(LOCUS AAC24388)
....1670 .........PID C SA-KM C ENNAE
C SVF--MHRAQ C H C KP GY VGDR C EML
....1708 .........EDV C ST-QP C YNGGK
C EQV-G-TTYK C T C PK MF NGAR C QFE
....1746 .........SDE C NG-VK C PNGGV
C HDLPGVKSTT C L C RT GF AGPQ C EEI
....1786 .........TDI C STNNP C RNGAR
C IGEKLG-RFK C Q C VP GW EGPN C DKNIG
(LOCUS 861291)
.... 11 ....VLLSAIAANSLINPSNLTDDY
C KNGGS----IVNGK C E C TL RY EGPQ C ER
.... 57 .......................Er
c lNGgrrhsakgtvr c h c py gl sgdr c ek
.... 88 ......................VTY
C EPGKG---KLVEGK C E C FE RW TGLF C N
....116 ......................MRT
C FNGIP-T-GGLDGF C L C DV GY TGPF C DA
....146 ......................PLI
C ENGGSVTQVTTENE C A C TA GY TGDH C EQCAIGY...
(LOCUS 1065960)
....131 .......NSDLPP C SSEFDGI C
GTNGI C LMDGSRQI C H C DI GY MGET C DKVLMGAYD...
(LOCUS 1065514)
....361 .......Y C FGLYSGST C SQML
C ANGGFLPTPTSDR C Q C PE GF SGFH C QNIL...
....855 .......F C TPEFTGTY C QNII
C YNGG---TASGDH C V C PP GY AGES C EMAR...
...1490 .......V C TDYWTGSR C TVPI
C VNGGTR-NPDEAT C S C PD GY EGPN C QFEV...
(LOCUS 1495334)
.... 99...SDFVSS
C AM--- C FTRGTDF C ETTVDADGNYAYQ C H C KP KY SMST C WYT
....145... . PDA C TPTT- C NGHG--K
C -----YDYVEDVK C D C YW GY EGEH C EVNKDR...
(LOCUS 1065455)
.....91........... ...ailnns c anget
c --ig-gsv c dldt-l-r c m c py gt tpkldtls c ess
....298 .................gas c ksnei
c --vg-gsi c tlpi-g-i c l c pg dl earege c vlpaastisv
....344 ............qkvgigal c sdlae
c --dh-gst c ---vmg-r c t c vs pl vqhegk c vlrqqqkiv
....392............... gpgel c dsget
c --gk-gsi c d-sv-ipv c v c pa qt dlsnge c isvpaptsqpv
....535 .............QAGVGVR C SLNTD
C --MI-GAY C NGNTNPPS C Q C LS TH VNIEGR C EKVIYP
....583................ gqvg c rsdlq
c haahsgth c ----idri c v c pe gq rakgn c qpiqfygafk
....630............... ninnq c sskdr
c -ag--gsk c -kd-a--l c q c vd ga veltgk c kqf
....669 .......................pggh
c sngem c sggss c ylgk c r c dp sr tldnqr c vqtavs
....710 .......................igst
c rrgqq c vngaa c rfgm c m c vs kt vavlgr c vsgeaavats...
....770 ........................GIP
C SLQDF C LGGSN C QDGF C L C DD DW IQ--DNDK C VLPTTQETSSTKNE
....818 ..................IDDALEETD
C KSDDV C PVNGK C VNGM C L C LP GF KLN-GEV- C EKE...
....870 ....................katpgsq
c ttsse c sfrtk c segv c r c kk ge tiidst c rsaihh
....914 .....................vlpglt
cdpsngydc vgesi c qygv c k c kr rl v sdgqk c vpihlalm
....961 .....................vipgks
c asgep c gggsy cakdgi c r c pn de vadvnkk c vkknsvisvfn...
...1072 ....................SILAGHQ
C THNSE C PSFSF C FSNS C N C MA GF RATSG-I- C EPAIAV
...1115 .......................vgep
c vtsnq c fdese c vfgi c t c tg pn c kdtkma
...1150 ......................hpged
ctslktv c synsy cslmssvc e c ps gm atkgtk c entfe
...1194 ......................SIGKD
C VTSRN C QKSSY C DNGY C V C KN GH KIGENM C FNSPSEYKSFSILP...
...1331............. .......iafpgey c gtgqv c lgnsv
c enqf c r c lq dv aaengi c pp qvdnlrvlglqplgk
...1385.efrfsegkkiemrrtsslplen c
qneev c ennst c qsilglgri c q c ve nt vlwngn c vivedsyd
...1452 ....................ltpidgn
c dedsm c lsgse c vdgk c l c sd gk rlilgi c vfial
...1495 .......................pets
c engev c ingsv c gdsn c e c te nt ynhngn c vdikldesli...
...1571 ...................rrelasid
c andqe c qpnfk c qeyv c v c dn st en c lksivdlkvs
...1616 .....................vppgsg
c setrk c ggdsi c ykdy c v c sy ed lpendq c vsrdwhi
...1660 ......................glgfq
c stvtr c redlt c lggv c m c kf gd vk c dpsepvts
...1700 ......................ppggs
c snlre c tggsv c regw c i c pd ps mivnrgi c iqsg...
...1791 ......................vpggr
c gpidv c vggsn c iegf c l c pa gq qpsnsgr c ekftttsrq...
...1895 ........................DDE
CTAIGLI C KGNTV C RNKS C Q C PE TY VLHHDG-- C VSPEEAARRKA
...1956 ......................kpges
c tqgqt c vggsa csfrkl c e c pq dk seisqgq c vtprkle
...2001 .....................VVPGAS
C NANTV C TKGST C ESGL C R C QP GY IAVSGN-- C VALPMSTT
...2051 ....................iakples
c enget c eggsn cdydtgic m c pp gq ivfnvq c mppptqpqit
...2101 .....TRVTTPVKAAPVVTPKPIHSTD
C EIDAN C GENKI C VSGK C K C KP GF VDNSGT-- C EPLEDID
...2411 ...................KVAAVGSA
C RPIDI C LGESV C TNGF C H C PE NY IRQNGQ-- C ISKE
...2461 .......................VGET
C KNGEI C AGGSI CDYDRKRC I C AA QH VAIRGI C KQKSAPAFAA
...2508 .......................PGDT
C SMREK C TGGAT C FEGM C T C DD HH FAEDGY C RPIEARSS
(LOCUS 1914441)
....861 ................SLNIK C AHLAY
C SRRGT C -KEGI- C I C PH GY TGFD C SI
....897 ..................PLF C --N--
C SGNGL C NLLNI- C L C ND GW SGSD C SI
....928 ..................P-R C LTN--
C TGHGK C IQPNS- C E C DA GW MGET C SV
....960 ..................TS- C IDSQ-
C -THGH C GTNGL- C K C ED GW QGSR C QI
....992 ..................PL- C --NS-
C SLNGI C TRPGF- C S C FE DY GGSD C SK...
...1029 ...................EA C -DFD-
C -NHGI C EPLTKT C S C AK GW MGGA C DV...
...1161 .................lydg c -epsl
c --qgs c --vgpl c i c pq gk tgif c dviev...
(LOCUS 1753009)
.... 98 .....vspq c s--icsqpgl- c
-nsgq c vpdarfpwqyfy c v c pd ya sg--rf----- c qn
....143 ......EIK C ---KDNS---- C
GKNAD C YV-A---NHQLN C I C KP GY TA--RRNG-RD C DMKVQ...
....515 ......PST C --AEPFPEY-- C
-DQG- C V-D-----G--- C E C DP GY VIDNTVTGSIK C IRLDQ...
....580 ....TIYHE C QNGTMWSDYRP C
SDDGS C VLN----SIDMQ C K C NN GY RGD----G-YN C TD
....626 ......INE C ----VETPGI- C
-GHGQ C VNT----PGSYH C T C DD FW LGD-------N C NTYKP...
(LOCUS 1209406)
.... 47 ...... DL C --LNSP C KNNA-I
C -------ETTSSRKYT C N C TP GF YGVH C ENQ
.... 84 ......IDA C --YGSP C LNNAT-
C -------KVAQAGRFN C Y C NK GF EGDY C EKNIDD...
....128 .....VNSK C ENGGK- C VDLVRF
C SEELKNFQSFQINSYR C D C PM EY EGKH C EDK
....177 ......LEY C TKKLNP C ENNGK-
C --------IPINGSYS C M C SP GF TGNN C ETN
....217 ......IDD C -KNV-E C QNGGS-
C --------VDGILSYD C L C RP GY AGQY C EIPPMM
....258 DMEYQKTDA
C QQ-SA- C GQGE-- C ------VASQNSSDFT C K C HE GF SGPS C DRQMSV...
....477 ...SATVNF
C -AG-ID C GNGK-- C -----TNNALSPKGYM C Q C DS HF SGEH C DEKRIK...
(LOCUS 551647)
....186... ...kgle c ----------ies
c apsqwfnd c s-k---s-- c h c dg gd scdqengr c pngkcspgwigepic
....238 ....dedmde c -------emgidn
c --pneqpd c lntp-gsfl c l c fe yd eaqqk c knsk...
....367 ....VTAPAA C ---AR--------
C ---DQNAK C SN---G--V C T C SE GF TGDGFR C YD
....403 .......VDE C ---EIPG----AV
C --RDHS-I C SNT-IGSFE C T C HG GY RFEDGK- C ED
....444 .......VDE C --RELP-----KI
C GDPNKGTK C INK-DGTFE C L C KD GY EGDPSSE C RD
....489 .......VNE C --KN-P-----DA
C G-PN-SQ- C TNT-QGGYE C E C LA GF ERIAEGAH C TD
....531 .......RDE C -AVE-P-------
C ---HPAAI C SNT-RGSYK C E C RD GF VGDGKT C HETILYPI...
....780 ...grlgepl
c ---d-r------e c aaghygin c ----est-- c h c dg sv acdvitgm c pgalcra...
....835 ......DIDE C -------EMSLVT
C -AV-GSQ- C VNT-RGGYR C D C KG GF APVGKE C KP
....877 .......IDR C -LS--RFSVP---
C -SRN--AE C VESIESNPK C V C RK GY HGDGFR C TTRDNIK
(LOCUS 642185)
....336 ...RDFARF
C ----AVKP---- C HKYAT C TDSKRGPK C S C DT GF QGNG-TY C ED
....378 ......IDE C SFSQDAKEQLGG C
LSGST C RNVPGSYK C D C LP GY QMIGENT C LLLIRV
(LOCUS 2291197)
....11 ..........LIVLVA C VLRKKKP
C INGTPEGDR C Y C IE GW TGTL C HR... ***###
....64 ...IE C
ELGWAGTD C DIID C HGNGMPNYDLTE C T C TV PY SGKY C EIADTK...
(LOCUS 2088683)
.... 23 .......gngl c ipeqkv c nrind
c --------anfadesn c t c ne ne fr c qsga...
....406 .....DNPQGN C SSTRNS C DQTWN
C --------QRHAGFES C S C DD GY HLSPYDKKT C LR
....453 ........SPS C PKA-N- C -SH-F
C I-----D-RRDV-GHQ C F C AP GY ILSENQKD- C RRNDT...
....766 .....KTARHP C SQ-PAR C DN--L
C IPANTPDLTTSSDNFT C M C AQ GF RS-E-GR-S C VSE
...1247 ........GAE C HHFG-V C AQK--
C -WMAF------NHTAR C H C AP GY ARTKNDENE C EPLSKSAE...
...1588 .......qknp c qs-dy- c psntv
c vp--dqd-kngilipk c l c gp gr ffevstkk c mqlr...
...1637 ...EKDQEASQ
C --GDYF C YNNAG C -SP-----Q-----KT C V C PP GF YGRQ C ELYFTSK
...1701 ......QFKNS - DQQQML - SVTAP
- QTTLNPAAQQTFVKDF C E C DK GW TGPH C RHKAD
...1750 ........akv c --ygh- c fsgga
c ----dge---gplnlr c s c gd gl tgnr c q
...1781 ...GNR
C QN C -VG-HE C LNGGF C -SYANSNR---S-LPH C I C PS GF TGDH C EEYL...
...1823 ........eyl c -k-da- c pfgsk
c ----tyditkpmdpit c s c eq na aahntd c s
...1881 ........SPI C QKQPNW C HNGGR
C --LDTP----GY-PGK C K C LP RF AGPR C DV
...1903 ........PVQ C --DDY- C TNNSK
C -TI---T-NGTHFE-- C D C KP GF KGLR C EQETK...
...1941 ........ tk c s-e--- c sneak
c ikk--ps---gtvi-- c q c pq gl ggey c ek
...1974 ....EKITATS C -HQL-E C KNGGY
C LKPD-PSANRTSPT-- C L C SP GF KGLL C SS
...2020 .........NA C --ENF- C LHDGN
C -TLD------EYFEPQ C E C YQ AF IGDR C QYRIHQHA
....
(LOCUS 1707249)
....24 ...MIDSM
C PSE C HSRGY C FRIEGEPR C F C QA GF EGDV C QFIEST...
(LOCUS 1707244)
....64........ ...stvatiedtkhgwgvmdtllli
c i c lv vi gfy c ifmg...
(LOCUS 1938562)
....154 krd c
sdgsdehsm c hvedpktads c eygaamtidgik c y c pk nk flneqg-k c ef
....207 .........vnh c ertknglppv
c sqd c tdnkngtft c s c fd pr ltvvngth c vadkdqtips
....545 ...........KNKGKNI C KAAM
C DDM C IGHINQTSS C F C RD GY IQN-N-TK C SASKD...
....889 ..................ehf c fass
c kgsvg c epvk c g c ad gm kvlkng-k c vkdeewv
...1266 . ...........TFN C TDADDLG
C SHT C RPTPVGPI C S C PD DH YLDKNGVT C SK
...1308 .. .............KDP C RFGQ
C SQY C SPHGSSHF C Y C ED SF KLGADRSS C ISDDPR...
...1624 ... ..........ikns c dgfk
c ngi c lnngknkat c q c tn lq sttddg-- c vdiqqvllvatkngvr...
...1941 .... .........KNVPDE C KK
C PSL C LRSSDKKFQ C V C SQ GF EL-VNG-K C RSP...
...2266 ... ...lfta c snnngg c ehl
c ittpsensptvrke c l c vh st rld-ngs- c gssds...
...2590 ..... ......tsda c adnelk
c edy c rlmangqas c a c ng er rlnsdnrt c tggkf...
(LOCUS 1707269)
.... 27 ...ILDD
C -ADSP- C ALNAT-- C VDLINDY-K---- C E C PT GF SG-KR- C HIK
.... 66 ....ENL C -ASSP- C VHGL---
C IDKL--YSR-Q-- C L C QP GW TGE--N C DQN
....103.... IDE C -AASP- C QNDAK--
C IDEING-----YM C E C AD GY EGV--H C QHL
....141.... VDH C -AKQP- C HNNAT--
C TNMGAT-----YH C D C TL GF DGV--H C EMN
....179.... IDE C AENQ-- C DKLGTES
C RDAVN--D---FK C V C KP GY TGPR-- C DVK
....219 ....QDQ C -ADSP- C LNDAQ--
C VD-MGG----AYK C V C KS GW TGPK-- C EQD
....257 ....ngs c -aakp- c rnNGf--
c vslva--d---yf c v c pp gv sg-k-n c esa
....295.... PNR C -IGQP- C HNGGE--
C --GDFGSHL---E C A C PA SF TG-KG- C EFK
....333 ....NTG C ---K-T C ENGGK--
C AEAAGG--L-Q-K C E C SP GF TGER-- C ETN
....370 ....ide c sta-h- c psgat--
c vdqvnth-i---- c v c pf nl tgv--h c dkmint...
...1151 ....TNQ C ATGEYN C SWHAN--
C IDLPDENDVPSYE C R C KP GY RGNGTH C T
...1196 .....DA C --NDF- C LNDGI--
C KKNNIGN-V---E C I C KD HF SGDR-- C ELRFQASN...
(LOCUS 2429448)
....122 . ...kkgip c --gntf c sielgee
c i--agk--i- c g c pk gq krkdansp--- c raves
....288 ...LNPFSN
C --YHSD C HPDAI-- C K-EVGK-GYT C T C PD GF RDLNPSRPGRN C LSYRGV
....367 .....DIDE C ALGLHN C SAAAI--
C I-DK-KIGYE C Q C QE GY EDGNPSQPGRI C AA
....414 . ......SL C --GL-- C N-GHG-D
C IHDALSSNVT C A C LD GY TG----Q---F C ETAPSNLP
(LOCUS 1226288)
.....53 ...DE
C PKRK C QNNSQ C QFDGSESK C V C QS GY IGEF C EIDQTV...
(LOCUS 1465832)
....312 ...GI
C HCIKGQTGDK C EHFE C VHGLSVGFRFDPESLLFSEP C I C EV GW KGEM C DYR
....366 ............................PAEK
C GNKG---EWKGDR C E C VG SY FGSE C QY
....405 .............................TSK
C VEG---F-LRNGR C I C DV GF EGDY C D
....421 .............................EIV
C VYGSPDFKNRTLS C D C PD KY TGRR C EQ...
(LOCUS 1465823)
....104 ...YGVQ
C KCPDAEQQQLDQH C RQLPA C QNGGYRSQSIG-RR C S C PQ PY FGEY C E
....155 .............................KL
C DQGQVLAGIDGNNY C S C LP FY QGET C SD
....186 .............................LV
C LNG----GHEFRGR C S C PH NF VGYH C EI...
(LOCUS 1125759)
....320..LPYQ
C ILPPVSTTTVPAPTTTAAPLTT C QNGGQVLKDSSGSPY C Y C FG LY TGRD C SQM
....378 .............................L
C ANGGFLPTPTS--EH C E C PE GF TGFH C QN...
....825 .........RAGTML C SAVHPTPPPQHQ
C QNGG-VMNPTNTT-- C F C TP QF TGTY C QNI
....872 .............................V
C YNGGTV---SGGQ-- C V C PP GY AGES C EVPR...
...1462 . .......FGNTFQRLTFGH C SPATIV
C GNGG--IR-QNGQ-- C I C TD YW TGSR C TVP
...1507 .............................I
C VNGGT--KNSDEAT- C T C PD GY AGLN C QFE...
(LOCUS 1519683)
.... 31.....................TEET
C QNDP C LNGGT C TPGKLS C T C AT GW MGRY C HRK
(LOCUS 1458242)
....210 ....................SPQSG
C SGGAA C ICGARGNCM C E C AT DF GYTLASDGKT C QRVRR
....254 ................RLKEK C KTDME
C SAAFSE C SSGG C R C KR GF KRNGDGG C EPLGE...
....488 ...YYGDS
C HISSQ C VYSKSPDAAEEYAEVAKME C ARSI C S C PA GF SYADGQ C KRI...
(LOCUS 1226304)
.... 34 ..............EPQ C G-YTPKA
C TEQ C IMNT C D C KD GF VRNSLGK C VEVSE...
.... 95 ...............AT C EKPNPTV
C TKQ C IVNV C Q C SK GF VRH-GLR C IDKKDCPK...
(LOCUS 1226303)
.... 35 .............ETQ C G-YTPKV
C LSAQ C IENA C D C KK GF VRNSLGK C VD...
.... 96 .............EPT C EKPGPRP
C -TRQ C IVNV C Q C SS GF VRN-GYR C T ELKECPK...
(LOCUS 1226302)
.... 32 .............EPQ C G-FDPTV
C SLE- C KPNA C V C KD GY VRNTKND C VRRLE...
.... 92 .............qpt c ddpypts
c ehdr c irnv c r c lp gl vrnsgt- c tsld...
(LOCUS 1125776)
.... 19 ....NRA C --SRNT C L-NGGT
C TVNDET--RMFQ C E C PK GF SGLL C Q
.... 57 .....DN C --SLH- C L-HG-N
C VKGTFG--EE-T C Q C SE GW MGSL C DNLVTDDDTAQ...
....125 ...NEPS
C --ATHT C QNNG-T C -VAENG--NV-K C A C PP GF VGDH C ETD
....164.... EDE C --KENF C Q-NGAD
C E-NLKG-S--YE C K C LK GF SGKY C EIQD
....203 ....KKQ C --TSDY C HNNG-Q
C -IST-G-SDL-S C K C SP GF DGAF C ELK
....241 ..AEVNE C I----- C ENPAHV
C SLVN-GTSRTTQ C E C PS GF MGAD C KE LQ
....283 ....ARP C --DREP C L-NGGH
C VD--DGQ-NLFT C F C LP SF TGIY C GE
....321 ....PVD C LVNGSD C K-NGGK
C VFAL---AAT-T C Q C PE GF NGSN C EISNSYR
....365 ...SHPT C --SDIR C L-NGGS
C KL--DAEGEPF- C V C EE GF DGPF C E
....403 ...PKSG C --TINP C Q-NGGT
C QDA-DGQ--YF- C H C TS GF GGVH C ET VDEPSTPIP...
....468 ...SKIS
C ---ED- C V-NSSN C -L-DV-ESGP-V C I C DD GY FGQK C DQK
....505 ....HDK C --SKVS C P-SGQT
C SQVNDNVNITAQ C G C EI GH FGQQ C EMVTSATFSAKSLYIHQ
....714... RSEQ C --TRAY C Q-NEGI
C -I-DHWESSS-- C K C KP PY LKPN C VYFLPKTTFGHLDQ...
...1174 ....TDQ C GLY-ST C L-NGAT
C -V-DIWNKRK-- C V C PA GF AGEN C EDN
...1213 ....VND C --KFVD C GKHG-Y
C -L-D-GIDEA-K C I C NN GF HGEH C ELA
...1251 ....KDE C --EGVE C H-NGGK
C -VKNRSEKI--V C Q C GN SW MGDS C NVT
...1290 ...KTTN C --KDSP C QNFGQ-
C -MQKTDTFFE-- C N C MD GY SGEL C EQRD
...1331 ....VNE C --NHYD C -NRGH-
C -VMTVS--GP-A C Q C EM GY TGRF C EKL
...1368 ....LNQ C --SSNT C SSRGA-
C SP--VWNNT--V C N C DN NW RGAH C Q HQ
...1406 ....MDT C --LDFP C NNDGV-
C -RTN-DEN-TFS C E C QK FF MGTR C EI
...1444 ....EGS C --LKAQ C -VHGE-
C -IQLSPETHT-- C S C NI GY EGDA C DKK
...1482 ....IDY C --KAGP C -LNGAN
C ENKLTG----YK C T C AV GF EGAD C EIN
...1520 ....IDE C --ALEF C -KNGAK
C -R---DKINDYE C V CDGT GF EGRN C TTD
...1559 ....INE C -ANPNN C -INGE-
C TNTLG---N-YK C A C RN GF IGPR C SVRNP C TAQIASNNISS
...1611 .....VT C --VHGK C VNPVVQ----IEKNREVAKYE
C A C DR GY TGPT
C SQR...
(LOCUS 1118117)
....118 ...............pdlt c sydnq
c agyplai c hsv c q c vk ga lntgtt c iasstaiqtsvacpagqtyirea
....177 .....gvcmtvqqpgep c qysqq
c salepgay c lkmr c e c vy gm kkssng c tfvnndckerghifiseigec
....244 .......revfppggkg c shnlq
c sgaypdat c fmqt c t c pp nl pvaadgt c grscpnnqvysgvtgec
....306 ........lpekqpgqd c iyssq
c qasfgglv c dknt c r c pn gl vfdglk c shgcpphkrvidkeicve...
....380 ...........VKQVSIGQP C VANAQ
C NFGSF C QSGT C Q C PP GF YVQDEQ C QAIES
....425 ...............EPNES C QNNEK
C TKGSV C YNGK C S C PR NH ELINGH C QQNRAAAHA...
....505 .........dndtvpigsa c vrigvt
c dggsv c vagi c v c pl gk tprngv c iehvaa
....551 .............rpgts c qneee
c vdhsy c spetnk c e c mk as qmvigge c rerlka
....596 ...............hpgyg c tmgem
c vgnsv c vngk c a c vd gk veinki c idqvs
....637 ............akpgdt c gkgii
c eggsy c ntdsgk c a c rr ge nsingi c kgftf
....681 .............vypgdl c tditsr
c tggsy c argr c e c pp rm saidkk c vhqqta
....725 ...............apgep c sekva
c spfsv c ennv c k c vn nm mirdkm c vqrrkv
....761 .........VQRRKVNIGNS C NNEDQ
C LGNST C MDNN C Q C GI GF VASMDV C VLRKTGKP
....811............ NFLTPGYL C NPEDI
C TGQSV C IKGV C Q C QP DY KQMHNI C VKKNIGI
....857 ...............egsp c ssrdd
c geglm c gasgk c s c pe gl fsvngk c rsyvq
....898 ...............LGQT C TSDDR
C AERNAQ C QENY C T C RT GY TNINGQ C AANIVTPAEPETLSQVKCMSYQ
....956 ......FLFLSPSGLLGHI C TSNDH
C KIAHSQ C RRNV C Q C ID GY RIFGSTQ C IPRPGKPKERKTEKESKLV
...1021 ...............ELGDK C DKLSL
C SKGAI C EKGV C S C PE TF FESDGA C VKNVAKIKV
...1067 ..............VPPLSS C LGGEE
C SGNSE C VHGI C F C KE EF TLFEGK C QRLRIIE
...1198 ..............skpghm c dnkth
c tncsv c vngf c r c pe gl vhygdk c vsei
...1239 ................datk c lasnq
c psgaq c vkge c r c kp gl gitrygf c vpitfa
...1281 ...............epgts c aygeh
c qkdsh c edgl c t c ne pl vlkenk c vvsprekrfisdvhrkllrftpk
...1340 ............KLAKLGEY C FRNSH
C ESQRQ C LKNV C K C AS NF VQSSFS C VPRMSVISS
...1388 ..............LALPGES C RKGF
C VGGST C ENFM C K C PD DY FKKGDS C VRYESR
...1431 ................IGAA C GTASG
C SGGAT C TSSF C Q C QD QY DADVDE C YPYEP...
...1521 ..............apgga c edgtil
c tgnsv c annv c i c pg ge tvqngt c vsi
...1561 ..........ntysspgdp c dltnti
c tgnsq c idgi c k c pn nq gaingr c snmg
...1606 ..................NAN C GNIQ
C GTNQI C IQDS C Q C RP GY YQQPGS C LQDR C
...1644 .........N C IQEVESDS C LNRQ
C GMNQV C IQDQ C Q C RS GY LVLQET C ISDR C
...1689 .......N C VQPSVDAISGG C MNQ
C GNNQV C IQDQ C L C RN GY YAQPET C TGDR C
...1736 ........N C VQHVVPDMGN C QRQ
C GNNQV C IQDQ C Q C RN GY YAQTET C VAD...
...1809 ........qfiglpgkm c dlrpnaip
c rndaq c vnny c i c ps nr visgsn c vfylgdal
...1861 ...............PGQS C QNNGMI
C RGGSS C NQNI C Q C AV GF SVDNGR C TPTIEVRFTMLPVTTAI
...1914 pvfiielnpgqtc
dps c ayqpcmqr c sggss c snsi c s c pq gs gvlnnv c spsfpqndnynltrtar
...1981 ................PGDS C DNTIV
C IGGSS C LIGT C L C DS GY EPSSDRSS C VLNDRYNVR
...2027 ............SRSYPKTF C TFDSE
C TGGSI C IDKR C A C RN DH EMINGV C QLAN
...2070 .......lpgsr c htsf c --sk-gae
c -rngy- c v c ak tn ysdstld- c vssinan
...2113 ...................QGSMAYPGSK
C NATTS C QQN S S C FF GY CVTPQDEIDREANIK
...2155 ...........irhiekkk c gsykd
c gksqt c ssdrl c e c tfn t nlvnge c v
(LOCUS 1086823)
.... 53 ...FQGS
C AAEHVL C EQL C EALSPETYE C S C WE GH VLQDDGLS C RVDTDVQ...
....561 ...fqdp
c --stfg c pnk c lahnsmp-v c q c ew pm sgrk c svtses
Plasmodium ookinete surface antigens
Plas24K .11 .....FFIQLAIRYNNAKITVDTI C
KGGKLIQMSNHYE C K C PS GY ALKTENT C EP
Plas25K .21 ...............nakvtvdtv c
kkgfliqmsghle c k c en dl vlvneet c ee
Plas28K .11 ......FIQIAIILTIAAPSDDEP C
KNGYLIEMSNHIE C K C NN DY VLTNRYE C EP
Plas24K .61 .IVK
C DKLENINKV C GEYSI C INQGNFGLEKAFV C M C TN GY MLSQNI C KP
Plas25K .61 KVLK
C DE-TTVNKP C GDFSK C IKID--GSPISYA C K C NP GY DMVNNV C IL
Plas28K .60 .KNK C TSLEDTNKP C ADYAR C
LEDPYKDNKSNFY C L C NR GY IQYEDK C IQ
Plas24K 108 .......kptr c ynye c nagk c ildsinp-nnpv c s c di gk ilq---ngk c tgt
Plas25K 108 .........ne
c knvt c gngk c ildtsnpvktgv c s c ni gk vpnaddknk c skdg
Plas28K 109 .........ae
c nyke c gegk c vwdgih-edgaf c s c ni gk vinpednnk c tkdg
Plas25K 151 .....KDGETK C SLK C LKENET C KAVD-G-I-YK C D C KD GF IIDNESSI C TAFSAY...
Plasmodium Merozoite
surface proteins
PlasMSP-4 201 ...EDEDL C KHNNGD C GDDKL C EYVGNRRVK C K C KE GY KLEGIE C VE...
PlasMSP-5 206 ...QNRKS C AINNGG C SDDQI C ININNIGVK C I C KD GY LL-GTK C I...
Toxoplasma gondii RH micronemal protein MIC6
ToxgMIC6 .36 ...IADN
C SGNP C GGT-AAGT C IN-TPS-GYD C R C EP GY VLGVE-NDQVT C MMPSGVPMANFVQLSE
ToxgMIC6 .96 ...TPAA C SSNP C GP-EAAGT
C KE-TNS-GYI C R C NQ GY RISLDGTGNVT C IVR
ToxgMIC6 144 ...QESG
C EENG C GPPDAVQS C RRLTGTAGRL C V C KE NF IATIDASAHIT C KRVP
Yeast ATP-Dependent Permease
..........57...........FN
C MLPIFE C KQFSE C NSYTGR C E C IE GF AGDD C SLPL
Arabidopsis
.416
........TNE C
LQNNGG C WED--KTTNITA C RDTFRGRV C Q C PI VQGVKFLGDGYTH C EASG
.690 ........ALR C GINNGG C WKQTQMGKTYSA
C RDDH-SKG C K C PP GF ---IGDGLKE C KD
1118..... ...GRK
C M---SD C SGQ-------GV C NHE--FGL C R C FH GF TGED C SQKL
1154..... ...RLD
C NYEKTPEMPYGKWVVSI C SRH C DTTRAM C F C GE GTKYPNRPVPES C GFQIN