Cyanidioschyzon merolae Genome Project

This server will be shut down soon. Please migrate to NEW SERVER.

Home / Introduction / Browser / BLAST / Publications / Download / Member

CMS034C (583 aa)
by blastp 2.2.10 [Oct-19-2004] against nr [Apr-30-2005] (XML result)


structure-specific recognition protein 1 [Dictyostelium discoideum]

Unknown (protein for IMAGE:4405213) [Xenopus laevis]

Ssrp1 protein [Xenopus tropicalis]

DUF87 [Xenopus laevis]

hypothetical protein [Gallus gallus]

structure-specific recognition protein 1 [Gallus gallus]

Structure specific recognition protein 1 [Danio rerio]

SSRP1 protein [Homo sapiens]

transcription factor [Vicia faba]

structure specific recognition protein 1 [Mus musculus]

Structure specific recognition protein 1 [Homo sapiens]

Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160)

Structure specific recognition protein 1 [Rattus norvegicus]

structure-specific recognition protein 1 (HMG1 DNA-binding protein) [Arabidopsis thaliana]

early drought induced protein [Oryza sativa (indica cultivar-group)]

putative SSRP1 protein [Oryza sativa (japonica cultivar-group)]

Unknown (protein for IMAGE:7140964) [Danio rerio]

GA18454-PA [Drosophila pseudoobscura]

SD06504p [Drosophila melanogaster]

CG4817-PA [Drosophila melanogaster]

HMG1-related DNA-binding protein [Mus sp.]

single-stranded recognition protein

PREDICTED: similar to structure specific recognition protein 1; recombination signal sequence recognition protein; chromatin-specific transcription elongation factor 80 kDa subunit; high mobility group box; facilitates chromatin remodeling 80 kDa subunit; cis... [Pan troglodytes]

HMG protein [Catharanthus roseus]

putative HMG-box with DNAbinding protein [Oryza sativa (japonica cultivar-group)]

unnamed protein product [Tetraodon nigroviridis]

SPBC609.05 [Schizosaccharomyces pombe]

ENSANGP00000009009 [Anopheles gambiae str. PEST]

SSRP1 protein [Zea mays]

high mobility group protein [Arabidopsis thaliana]

high mobility group protein - Arabidopsis thaliana

PREDICTED: similar to structure specific recognition protein 1, partial [Bos taurus]

hypothetical protein FG02518.1 [Gibberella zeae PH-1]

unnamed protein product [Debaryomyces hansenii CBS767]

AER138Cp [Ashbya gossypii ATCC 10895]

hypothetical protein UM04416.1 [Ustilago maydis 521]

Hypothetical protein CBG04066 [Caenorhabditis briggsae]

unnamed protein product [Candida glabrata CBS138]

unnamed protein product [Saccharomyces cerevisiae]

hypothetical protein [Neurospora crassa]

structure-specific recognition protein 1 (SSRP1) (recombination signal sequence recognition protein) [Cryptosporidium parvum]

structure specific recognition protein [Cryptosporidium hominis]

Hypothetical protein CBG09136 [Caenorhabditis briggsae]

Hmg protein 4 [Caenorhabditis elegans]

hypothetical protein AN6687.2 [Aspergillus nidulans FGSC A4]

unnamed protein product [Kluyveromyces lactis NRRL Y-1140]

Hmg protein 3 [Caenorhabditis elegans]

unnamed protein product [Yarrowia lipolytica CLIB99]

Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160)

structure specific recognition protein, putative [Plasmodium falciparum 3D7]

hypothetical protein CNBL3050 [Cryptococcus neoformans var. neoformans B-3501A]

putative structure specific recognition protein [Plasmodium yoelii yoelii]

structure specific recognition protein, putative [Plasmodium berghei]

hypothetical protein PB300791.00.0 [Plasmodium berghei]

structure-specific recognition protein 1 [Bos taurus]

structure specific recognition protein, putative [Plasmodium chabaudi]

structure specific recognition protein, putative [Entamoeba histolytica HM-1:IMSS]

STRUCTURE-SPECIFIC RECOGNITION PROTEIN [Encephalitozoon cuniculi GB-M1]

SSRP1-like protein [Zygosaccharomyces rouxii]

hypothetical protein PC300891.00.0 [Plasmodium chabaudi]

hypothetical protein MG10128.4 [Magnaporthe grisea 70-15]

structure-specific recognition protein 1 [Plasmodium yoelii yoelii]

RNA polymerase beta' subunit-2 [Huperzia lucidula]

hypothetical protein PF10_0319 [Plasmodium falciparum 3D7]

Hypothetical protein F55F10.1 [Caenorhabditis elegans]
gb|EAL62292.1| structure-specific recognition protein 1 [Dictyostelium discoideum]
(527 aa)

Score: 284 bits (727), Expect: 5.2367e-75
Length: 466, Idn/Pos/Gap = 159/258/57 (34%/55%/12%)

Query:  20 GSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFR-GARGDQLRCTRKDGSTVNFEGFR  78
           G L+ +   + W   N  + T+S +  D+    W R   R  QL  + K G+TV F+GF+
Sbjct:  25 GILKFTTNNITWKSENGKIETVSSS--DIKRANWARVTPRIFQLILSIKGGATVKFDGFK  82

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSG-SETAFELPISD 137
             D   + +Y+ + +     +   E    G N+G+V++ G  + F +   +  FE PIS+
Sbjct:  83 EQDYEVVRKYLSDQYN-VSPLEIIELSSKGCNWGEVKVNGPMIQFTTDHGKVGFEFPISE 141

Query: 138 ISQVV--RSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEEDA---------------- 179
           +SQ V   + +NEL LEFH D      DE +VEMRF  P                     
Sbjct: 142 VSQSVIGANNKNELTLEFHHDKAMDDDDETMVEMRFFTPIRPSKEGEEGGKEKKVGEDGE 201

Query: 180 --------------IALHAELSARVGSASFM----GESLVFFEELPFIVPRGRYDLELFP 221
                         I+   +    + + S M    G+SLV F  + F+ PRGR D+E++P
Sbjct: 202 EDEEDEEDAEKEEEISALEQFQQTIMNKSDMVSNVGKSLVVFSAIQFLTPRGRIDIEMYP 261

Query: 222 TYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRD 281
           T+L +HGK+ DYK+ Y+++ R+F   +PD+ H+  +ISLDPPIRQG T Y HLV+Q + +
Sbjct: 262 TFLKLHGKTHDYKVPYESISRLFQFFRPDQKHIFFIISLDPPIRQGQTKYAHLVIQFQAE 321

Query: 282 DDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAH 341
           ++ I + LN+  +ELQ+++ D+L P ++G    ++ K+L+ L  K +  P NF +  GA+
Sbjct: 322 EN-IHLELNLT-DELQQKFKDQLSPIMNGNANALICKILKALTGKKITIPGNFQSDSGAN 379

Query: 342 ALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRLEM-------DRRFDLQV 394
           +++ +L AN+GYLYPLE CFFFV+KPPTY+++ED+  +EF R          +R FDL +
Sbjct: 380 SIKCSLKANEGYLYPLERCFFFVHKPPTYIKFEDISNIEFARYGAPSVRGGSNRTFDLSI 439

Query: 395 VMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGATTG 440
            + N +++ F N++R E+ +L+ FL+ K++       ++L   TTG
Sbjct: 440 NLKNSTSIQFVNIQREEYPSLFNFLKEKKL-------SILNPVTTG 478


gb|AAH82613.1| Unknown (protein for IMAGE:4405213) [Xenopus laevis]
(458 aa)

Score: 280 bits (717), Expect: 6.78448e-74
Length: 420, Idn/Pos/Gap = 156/250/20 (37%/59%/4%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           S+  G LRLS+AGL++     G +  IS A  D+A + W R A G  ++     G    +
Sbjct:  17 SMNDGRLRLSRAGLMYKNNKTGKVENISAA--DIAEVVWRRVALGHGIKLLTNGGHVYKY  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GFR  +   L +Y K  ++    +   +  V GWN+G VR  G+ + F  G + AFELP
Sbjct:  75 DGFRETEYDKLFDYFKSHFSVE--LVEKDLCVKGWNWGSVRFGGQLLSFDIGDQPAFELP 132

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEED-------AIALHAELS 187
           +S++SQ   +G+NE+ LEFH +D    ++  L+E+RF  P  +D       A A +    
Sbjct: 133 LSNVSQCT-TGKNEVTLEFHQND---DSEVSLMEIRFYVPPTQDDGGDSVEAFAQNVLSK 188

Query: 188 ARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
           A V  A+  G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP
Sbjct: 189 ADVIQAT--GDAVCIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLP 246

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
             D+  +  VISLDPPI+QG T Y  L+L   +D+D + + LNM  EE+++R+  KL   
Sbjct: 247 HKDQRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-MTLTLNMSEEEVERRFEGKLKKS 305

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
           +SG L+++V++V++ LV++ +  P NF+   G+  +  +  A+ G LYPLE  F +V+KP
Sbjct: 306 MSGCLYEMVSRVMKALVNRKITVPGNFLGHSGSQCITCSYKASSGLLYPLERGFIYVHKP 365

Query: 368 PTYLRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           P ++R++++  V F R     R FD ++    GS   F+++ER E+  L+ F+ +K++ +
Sbjct: 366 PVHIRFDEITCVNFARGTTTTRSFDFEIETKQGSQYTFSSIEREEYGKLFDFVNAKKLSI 425


gb|AAH74541.1| Ssrp1 protein [Xenopus tropicalis]
(629 aa)

Score: 278 bits (710), Expect: 3.97859e-73
Length: 495, Idn/Pos/Gap = 175/276/42 (35%/55%/8%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           S+  G LRLS+AGL++     G +  IS A  D+A + W R A G  ++     G    +
Sbjct:  17 SMNDGRLRLSRAGLMYKNNKTGKVENISAA--DIAEVVWRRVALGHGIKLLTNGGHVYKY  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GFR  +   L +Y K  +     +   +  V GWN+G VR  G+ + F  G + AFELP
Sbjct:  75 DGFRDTEYDKLFDYFKSHFRVE--LVEKDLCVKGWNWGSVRFGGQLLSFDIGDQPAFELP 132

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEED-------AIALHAELS 187
           +S++SQ   +G+NE+ LEFH +D    ++  L+E+RF  P  +D       A A +    
Sbjct: 133 LSNVSQCT-TGKNEVTLEFHQND---DSEVSLMEIRFYVPPTQDDGGDPVEAFAQNVLSK 188

Query: 188 ARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
           A V  A+  G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP
Sbjct: 189 ADVIQAT--GDAVCIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLP 246

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
             D+  +  VISLDPPI+QG T Y  L+L   +D+D + + LNM  EE+++R+  KL   
Sbjct: 247 HKDQRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-MTLTLNMSEEEVERRFEGKLKKN 305

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
           +SG L+++V++V++ LV++ +  P NF    G+  +  +  A+ G LYPLE  F +V+KP
Sbjct: 306 MSGCLYEMVSRVMKALVNRKITVPGNFQGHSGSQCITCSYKASSGLLYPLERGFIYVHKP 365

Query: 368 PTYLRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           P ++R+++V+ V F R     R FD ++    GS   F+++ER E+  L+ F+ +K    
Sbjct: 366 PVHIRFDEVNCVNFARGTTTTRSFDFEIETKQGSQYTFSSIEREEYGKLFDFVNAK---- 421

Query: 427 VGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-RD 485
                         K S++   ++  M  A ++ A  +   ED  +A  +  K +GK R+
Sbjct: 422 --------------KLSIKNRGLKEGMKPAYDDYADSD---EDQHDAYLERMKEEGKIRE 464

Query: 486 PQTFDDEDDEEDEDF 500
               D+  DE DE F
Sbjct: 465 NADSDESGDETDESF 479


dbj|BAA76333.1| DUF87 [Xenopus laevis]
(693 aa)

Score: 275 bits (704), Expect: 2.07592e-72
Length: 495, Idn/Pos/Gap = 174/277/42 (35%/55%/8%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           S+  G LRLS+AGL++     G +  IS A  D+A + W R A G  ++     G    +
Sbjct:  17 SMNDGRLRLSRAGLMYKNNKTGKVENISAA--DIAEVVWRRVALGHGIKLLTNGGHVYKY  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GFR  +   L +Y K  ++    +   +  V GWN+G VR  G+ + F  G + AFELP
Sbjct:  75 DGFRETEYDKLFDYFKSHFSVE--LVEKDLCVKGWNWGSVRFGGQLLSFDIGDQPAFELP 132

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEED-------AIALHAELS 187
           +S++SQ   +G+NE+ LEFH +D    ++  L+E+RF  P  +D       A A +    
Sbjct: 133 LSNVSQCT-TGKNEVTLEFHQND---DSEVSLMEIRFYVPPTQDDGGDSVEAFAQNVLSK 188

Query: 188 ARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
           A V  A+  G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP
Sbjct: 189 ADVIQAT--GDAVCIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLP 246

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
             D+  +  VISLDPPI+QG T Y  L+L   +D+D + + LNM  EE+++R+  KL   
Sbjct: 247 HKDQRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-MTLTLNMSEEEVERRFEGKLKKS 305

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
           +SG L+++V++V++ LV++ +  P NF+   G+  +  +  A+ G LYPLE  F +V+KP
Sbjct: 306 MSGCLYEMVSRVMKALVNRKITVPGNFLGHSGSQCITCSYKASSGLLYPLERGFIYVHKP 365

Query: 368 PTYLRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           P ++R++++  V F R     R FD ++    GS   F+++ER E+  L+ F+ +K    
Sbjct: 366 PVHIRFDEITCVNFARGTTTTRSFDFEIETKQGSQYTFSSIEREEYGKLFDFVNAK---- 421

Query: 427 VGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-RD 485
                         K S++   ++  M  A ++ A  +   ED  +A  +  K +GK R+
Sbjct: 422 --------------KLSIKNRGLKEGMKPAYDDYADSD---EDQHDAYLERMKEEGKIRE 464

Query: 486 PQTFDDEDDEEDEDF 500
               D+  DE DE F
Sbjct: 465 NADSDESGDETDESF 479


emb|CAG31300.1| hypothetical protein [Gallus gallus]
(706 aa)

Score: 273 bits (698), Expect: 9.88171e-72
Length: 493, Idn/Pos/Gap = 165/275/22 (33%/55%/4%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +LA   W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGVIFKNSKTGKVD-NIQASELAEGVWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G VR  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFDKLSDFFKAHYRLE--LAEKDLCVKGWNWGTVRFGGQLLSFDIGEQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGI 429
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K++ +   
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKN- 427

Query: 430 PPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQTF 489
                RG   G          ++         R + +G+  EE  +  +   G+   ++F
Sbjct: 428 -----RGLKEGMKQSYDEYADSDEDQHDAYLERMKEEGKIREENANDSSDGSGEETDESF 482

Query: 490 D--DEDDEEDEDF 500
           +  +EDD+  E+F
Sbjct: 483 NPGEEDDDVAEEF 495


ref|NP_001005796.1| structure-specific recognition protein 1 [Gallus gallus]
(706 aa)

Score: 273 bits (698), Expect: 9.88171e-72
Length: 493, Idn/Pos/Gap = 165/275/22 (33%/55%/4%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +LA   W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGVIFKNSKTGKVD-NIQASELAEGVWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G VR  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFDKLSDFFKAHYRLE--LAEKDLCVKGWNWGTVRFGGQLLSFDIGEQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGI 429
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K++ +   
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKN- 427

Query: 430 PPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQTF 489
                RG   G          ++         R + +G+  EE  +  +   G+   ++F
Sbjct: 428 -----RGLKEGMKQSYDEYADSDEDQHDAYLERMKEEGKIREENANDSSDGSGEETDESF 482

Query: 490 D--DEDDEEDEDF 500
           +  +EDD+  E+F
Sbjct: 483 NPGEEDDDVAEEF 495


gb|AAH56311.1| Structure specific recognition protein 1 [Danio rerio]
ref|NP_997967.1| structure specific recognition protein 1 [Danio rerio]
(518 aa)

Score: 273 bits (698), Expect: 1.14831e-71
Length: 521, Idn/Pos/Gap = 174/271/34 (33%/52%/6%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S   G LR SK  +++     G    S+   DL+  +W R   G  L+     G    ++
Sbjct:  17 SWNDGRLRFSKQTVVYKSHKTGKVD-SIPAPDLSEAQWRRVCLGHGLKLATSTGHIYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GF+  D   +  + K  +     +   +  V GWN+G  +  G  + F       FE+P+
Sbjct:  76 GFKETDYEKISAFFKANYKVE--LEEKDMCVKGWNWGTAKFAGSLLSFDVSDSPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP--TEEDAI----ALHAELSAR 189
           S +SQ   +G+NE+ +EFH +D A   +  L+E+RF  P  T +D      A    + ++
Sbjct: 134 SSVSQCA-TGKNEVTVEFHQNDDA---EVSLMEVRFYVPPNTGDDGSDPVEAFAQNILSK 189

Query: 190 VGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKP 249
                  G+++  F+EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  
Sbjct: 190 ADVIQATGDAVCIFKELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHK 249

Query: 250 DEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKIS 309
           D+  +  VISLDPPI+QG T Y  L+L   +D+D I +ALNM  +E++KRY  KL   +S
Sbjct: 250 DQRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLALNMSEDEVEKRYEGKLSKNMS 308

Query: 310 GELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPT 369
           G L+++V++V++ LV++ +  P NF    G+  +  A  A+ G LYPLE  F +V+KPP 
Sbjct: 309 GPLYEIVSRVMKALVNRKITVPGNFQGHSGSQCITCAYKASSGLLYPLERVFIYVHKPPV 368

Query: 370 YLRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQ--VRM 426
           +LR+E++  V F R     R FD ++     +   F+N+ER E+  L+ F+ +K+  ++ 
Sbjct: 369 HLRFEEISCVNFARGTTTTRSFDFEIETKQNNQFTFSNIEREEYGKLFDFVNAKKLTIKN 428

Query: 427 VGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDP 486
            G     ++GA     S           +  E   R E DG DD E D  ++   G    
Sbjct: 429 RGFKEG-MKGAEDYSDSDEDQHDAYLERMKEEGKIREEGDGSDDSEGDSDESFNPG---- 483

Query: 487 QTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDDDD 527
               +EDD        DV +E D   S SD+  DD D +D+
Sbjct: 484 ----EEDD--------DVPEEYDSNASVSDSEGDDGDSEDE 512


gb|AAH91486.1| SSRP1 protein [Homo sapiens]
(633 aa)

Score: 270 bits (691), Expect: 6.51292e-71
Length: 495, Idn/Pos/Gap = 167/273/39 (33%/55%/7%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +L    W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGIIFKNSKTGKVD-NIQAGELTEGIWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGI 429
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K       
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAK------- 421

Query: 430 PPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-RDPQT 488
                      K +++   ++  M  + +E A  +   ED  +A  +  K +GK R+   
Sbjct: 422 -----------KLNIKNRGLKEGMNPSYDEYADSD---EDQHDAYLERMKEEGKIREENA 467

Query: 489 FDDEDD---EEDEDF 500
            D  DD   E DE F
Sbjct: 468 NDSSDDSGEETDESF 482


emb|CAA66480.1| transcription factor [Vicia faba]
pir||T12113 transcription factor - fava bean
(642 aa)

Score: 270 bits (689), Expect: 1.15826e-70
Length: 519, Idn/Pos/Gap = 173/274/47 (33%/52%/9%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG +++   G+LW  R  G  TI V + D+  + W +  + +QL    KDG    F GFR
Sbjct:  20 PGQIKIYSGGILWK-RQGGGKTIDVDKTDIMGVTWMKVPKTNQLGVQIKDGLLYKFTGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPISDI 138
             D  +L  + +  +  T  V   +  V G N+G+V + G  + F+ GS+ AFE+ ++D+
Sbjct:  79 DQDVVSLTNFFQNTFGIT--VEEKQLSVTGRNWGEVDLNGNMLAFMVGSKQAFEVSLADV 136

Query: 139 SQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPTEEDA-IALHAELSARVGSASFM 196
           SQ    G+N++ LEFH+DDT G  + + L+EM F  P+     +      SA+V     M
Sbjct: 137 SQTNLQGKNDVILEFHVDDTTGANEKDSLMEMSFHIPSSNTQFVGDENRPSAQVFRDKIM 196

Query: 197 G---------ESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
                     +++V F+ +  + PRGRY +EL  ++L + G++ D+KI Y +V R+FLLP
Sbjct: 197 SMADVGVGGEDAVVTFDGIAILTPRGRYSVELHLSFLRLQGQANDFKIQYSSVVRLFLLP 256

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
           K ++ H  ++ISLDPPIR+G T YPH+V+Q    D  ++  L +  +    +Y DKL   
Sbjct: 257 KSNQPHTFVIISLDPPIRKGQTLYPHIVMQF-ETDTVVDSELAISEDLYNSKYKDKLELS 315

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
             G + +V T VLR L    +  P NF + Q  +A++++L A DG LYPLE  FFF+ KP
Sbjct: 316 YKGLIHEVFTTVLRGLSGGKVTKPGNFRSCQDGYAVKSSLKAEDGILYPLEKSFFFLPKP 375

Query: 368 PTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQ 423
           PT + +E++D+VEF+R          FDL + + +    LF N++R+E+  LY F+ SK 
Sbjct: 376 PTLILHEEIDYVEFERHAAGGSNMHYFDLLIRLKSEQEHLFRNIQRNEYHNLYGFISSKG 435

Query: 424 VRMVGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK 483
           ++++ I                A A +A   +A      +  + +DD+  D    +    
Sbjct: 436 LKIMNI----------------ADAQQAVGGVA------KVLENDDDDAVDPHLERIRN- 472

Query: 484 RDPQTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDD 522
              +   DE DEED DF  D +D   G+P+D   A   D
Sbjct: 473 ---EAGGDESDEEDSDFVIDKDD--GGSPTDDSGADVSD 506


ref|NP_892035.1| structure specific recognition protein 1 [Mus musculus]
gb|AAH42502.1| Structure specific recognition protein 1 [Mus musculus]
(711 aa)

Score: 268 bits (684), Expect: 4.8649e-70
Length: 483, Idn/Pos/Gap = 164/268/27 (33%/55%/5%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +L    W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGIIFKNSKTGKVD-NIQAGELTEGIWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMV-- 427
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K++ +   
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKNR 428

Query: 428 ----GIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK 483
               GI P     A + +    A   R +     EE   RE +  D   +DD   + D  
Sbjct: 429 GLKEGINPGYDDYADSDEDQHDAYLERMK-----EEGKIREENAND--SSDDSGEETDES 481

Query: 484 RDP 486
            +P
Sbjct: 482 FNP 484


gb|AAH05116.1| Structure specific recognition protein 1 [Homo sapiens]
sp|Q08945|SSRP_HUMAN Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160) (Chromatin-specific transcription elongation factor 80 kDa subunit) (FACT 80 kDa subunit)
gb|AAA58660.1| high mobility group box
ref|NP_003137.1| structure specific recognition protein 1 [Homo sapiens]
(709 aa)

Score: 267 bits (683), Expect: 5.33254e-70
Length: 495, Idn/Pos/Gap = 167/273/39 (33%/55%/7%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +L    W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGIIFKNSKTGKVD-NIQAGELTEGIWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGI 429
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K       
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAK------- 421

Query: 430 PPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-RDPQT 488
                      K +++   ++  M  + +E A  +   ED  +A  +  K +GK R+   
Sbjct: 422 -----------KLNIKNRGLKEGMNPSYDEYADSD---EDQHDAYLERMKEEGKIREENA 467

Query: 489 FDDEDD---EEDEDF 500
            D  DD   E DE F
Sbjct: 468 NDSSDDSGEETDESF 482


sp|Q04678|SSRP_CHICK Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160)
gb|AAA48685.1| HMG box (bp. 1499..1757)
(669 aa)

Score: 265 bits (678), Expect: 1.99283e-69
Length: 486, Idn/Pos/Gap = 164/267/38 (33%/54%/7%)

Query:  42 SVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPR 101
           ++   +LA   W R A G  L+   K+G    ++GFR ++   L ++ K  +     +  
Sbjct:   4 NIQASELAEGVWRRVALGHGLKLLTKNGHVYKYDGFRESEFDKLSDFFKAHYRLE--LAE  61

Query: 102 GEQGVVGWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGK 161
            +  V GWN+G VR  G+ + F  G +  FE+P+S++SQ   +G+NE+ LEFH +D A  
Sbjct:  62 KDLCVKGWNWGTVRFGGQLLSFDIGEQPVFEIPLSNVSQCT-TGKNEVTLEFHQNDDA-- 118

Query: 162 TDECLVEMRFQAP-TEEDAI----ALHAELSARVGSASFMGESLVFFEELPFIVPRGRYD 216
            +  L+E+RF  P T+ED +    A    + ++       G+++  F EL  + PRGRYD
Sbjct: 119 -EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKADVIQATGDAICIFRELQCLTPRGRYD 177

Query: 217 LELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVL 276
           + ++PT+L +HGK+FDYKI Y TV R+FLLP  D+  +  VISLDPPI+QG T Y  L+L
Sbjct: 178 IRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKDQRQMFFVISLDPPIKQGQTRYHFLIL 237

Query: 277 QLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVT 336
              +D+D I + LNM  EE++KR+  +L   +SG L+++V++V++ LV++ +  P NF  
Sbjct: 238 LFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSGSLYEMVSRVMKALVNRKITVPGNFQG 296

Query: 337 SQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR-LEMDRRFDLQVV 395
             GA  +  +  A+ G LYPLE  F +V+KPP ++R++++ FV F R     R FD ++ 
Sbjct: 297 HSGAQCITCSYKASSGLLYPLERGFIYVHKPPVHIRFDEISFVNFARGTTTTRSFDFEIE 356

Query: 396 MLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGATTGKTSMRAAAVRAEMAI 455
              G+   F+++ER E+  L+ F+ +K                  K +++   ++  M  
Sbjct: 357 TKQGTQYTFSSIEREEYGKLFDFVNAK------------------KLNIKNRGLKEGMKQ 398

Query: 456 AAEEAARREADGEDDEEADDQDTKYDGKRDPQTFDDEDD----EEDEDFAGDVEDESDGA 511
           + +E A  +   ED  +A  +  K +GK   +  +D  D    E DE F    ED+    
Sbjct: 399 SYDEYADSD---EDQHDAYLERMKEEGKIREENANDSSDGSGEETDESFNPGEEDDDVAE 455

Query: 512 PSDSDA 517
             DS+A
Sbjct: 456 EFDSNA 461


ref|NP_112383.1| Structure specific recognition protein 1 [Rattus norvegicus]
gb|AAH83588.1| Structure specific recognition protein 1 [Rattus norvegicus]
(709 aa)

Score: 265 bits (676), Expect: 4.36609e-69
Length: 483, Idn/Pos/Gap = 164/268/27 (33%/55%/5%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS+ G+++     G    ++   +L    W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSRQGIIFKNSKTGKVD-NIQAGELTEGIWRRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMV-- 427
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K++ +   
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKNR 428

Query: 428 ----GIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK 483
               GI P     A + +    A   R +     EE   RE +  D   +DD   + D  
Sbjct: 429 GLKEGINPGYDDYADSDEDQHDAYLERMK-----EEGKIREENAND--SSDDSGEETDES 481

Query: 484 RDP 486
            +P
Sbjct: 482 FNP 484


dbj|BAB03170.1| structure-specific recognition protein 1 (HMG1 DNA-binding protein) [Arabidopsis thaliana]
gb|AAO00867.1| recombination signal sequence recognition protein, putative [Arabidopsis thaliana]
sp|Q05153|SSRP_ARATH Structure-specific recognition protein 1 homolog (HMG protein)
ref|NP_189515.1| structure-specific recognition protein 1 / high mobility group protein / HMG protein [Arabidopsis thaliana]
(646 aa)

Score: 264 bits (675), Expect: 4.59023e-69
Length: 433, Idn/Pos/Gap = 154/245/33 (35%/56%/7%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG L+++  G+ W  +  G   + V R D+ S+ W +  + +QL    KDG    F GFR
Sbjct:  20 PGLLKINSGGIQWKKQGGG-KAVEVDRSDIVSVSWTKVTKSNQLGVKTKDGLYYKFVGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D  +L  + + ++  T   P  +Q  V G N+G+V + G ++ F+ GS+ AFE+ ++D
Sbjct:  79 DQDVPSLSSFFQSSYGKT---PDEKQLSVSGRNWGEVDLHGNTLTFLVGSKQAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPTEE----------------DAI 180
           +SQ    G+N++ LEFH+DDTAG  + + L+E+ F  P                   D I
Sbjct: 136 VSQTQLQGKNDVTLEFHVDDTAGANEKDSLMEISFHIPNSNTQFVGDENRPPSQVFNDTI 195

Query: 181 ALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTV 240
              A++S  V  A      +V FE +  + PRGRY++EL  ++L + G++ D+KI Y +V
Sbjct: 196 VAMADVSPGVEDA------VVTFESIAILTPRGRYNVELHLSFLRLQGQANDFKIQYSSV 249

Query: 241 RRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRY 300
            R+FLLPK ++ H  +VISLDPPIR+G T YPH+V+Q    D  +E  L++  E +  ++
Sbjct: 250 VRLFLLPKSNQPHTFVVISLDPPIRKGQTMYPHIVMQF-ETDTVVESELSISDELMNTKF 308

Query: 301 GDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENC 360
            DKL     G + +V T VLR L    +  P  F +SQ   A++++L A DG LYPLE  
Sbjct: 309 KDKLERSYKGLIHEVFTTVLRWLSGAKITKPGKFRSSQDGFAVKSSLKAEDGVLYPLEKG 368

Query: 361 FFFVNKPPTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLFTNLERSEFSTLY 416
           FFF+ KPPT + ++++D+VEF+R          FDL + +      LF N++R+E+  LY
Sbjct: 369 FFFLPKPPTLILHDEIDYVEFERHAAGGANMHYFDLLIRLKTDHEHLFRNIQRNEYHNLY 428

Query: 417 QFLESKQVRMVGI 429
            F+ SK ++++ +
Sbjct: 429 TFISSKGLKIMNL 441


gb|AAM46895.1| early drought induced protein [Oryza sativa (indica cultivar-group)]
(641 aa)

Score: 262 bits (669), Expect: 2.78318e-68
Length: 443, Idn/Pos/Gap = 157/251/24 (35%/56%/5%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG  ++   GL W  R  G  TI + + DL S+ W +  R  QL    KDG    F GFR
Sbjct:  20 PGQFKVYSGGLAWK-RQGGGKTIEIEKSDLTSVTWMKVPRAYQLGVRTKDGLFYKFIGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D ++L  ++++    +   P  +Q  V G N+G + I G  + F+ GS+ AFE+ ++D
Sbjct:  79 EQDVSSLTNFMQKNMGLS---PDEKQLSVSGQNWGGIDINGNMLTFMVGSKQAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPTEEDA-IALHAELSARVGSASF 195
           +SQ    G+ ++ LEFH+DDT G  + + L+++ F  PT     +      +A+V   + 
Sbjct: 136 VSQTQMQGKTDVLLEFHVDDTTGGNEKDSLMDLSFHVPTSNTQFLGDENRTAAQVLWETI 195

Query: 196 MG--------ESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
           MG        E++V FE +  + PRGRY +EL  ++L + G++ D+KI Y ++ R+FLLP
Sbjct: 196 MGVADVDSSEEAVVTFEGIAILTPRGRYSVELHLSFLRLQGQANDFKIQYSSIVRLFLLP 255

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
           K +  H  +V++LDPPIR+G T YPH+V+Q    +  +E  L +  E L ++Y D+L   
Sbjct: 256 KSNNPHTFVVVTLDPPIRKGQTLYPHIVIQF-ETEAVVERNLALTKEVLAEKYKDRLEES 314

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
             G + +V TKVLR L    +  P +F + Q  +A++++L A DG LYPLE  FFF+ KP
Sbjct: 315 YKGLIHEVFTKVLRGLSGAKVTRPGSFRSCQDGYAVKSSLKAEDGLLYPLEKGFFFLPKP 374

Query: 368 PTYLRYEDVDFVEFKRLEM------DRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLES 421
           PT + +E+++FVEF+R            FDL V + N    LF N++RSE+  L+ F+  
Sbjct: 375 PTLILHEEIEFVEFERHGAGGASISSHYFDLLVKLKNDQEHLFRNIQRSEYHNLFNFING 434

Query: 422 KQVRMVGIPPALLRGATTGKTSM 444
           K ++++ +     +GAT G T++
Sbjct: 435 KHLKIMNLGDG--QGATGGVTAV 455


ref|NP_914495.1| putative SSRP1 protein [Oryza sativa (japonica cultivar-group)]
dbj|BAB03358.1| putative SSRP1 protein [Oryza sativa (japonica cultivar-group)]
(641 aa)

Score: 261 bits (668), Expect: 3.42873e-68
Length: 443, Idn/Pos/Gap = 157/251/24 (35%/56%/5%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG  ++   GL W  R  G  TI + + DL S+ W +  R  QL    KDG    F GFR
Sbjct:  20 PGQFKVYSGGLAWK-RQGGGKTIEIEKSDLTSVTWMKVPRAYQLGVRTKDGLFYKFIGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D ++L  ++++    +   P  +Q  V G N+G + I G  + F+ GS+ AFE+ ++D
Sbjct:  79 EQDVSSLTNFMQKNMGLS---PDEKQLSVSGQNWGGIDINGNMLTFMVGSKQAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPTEEDA-IALHAELSARVGSASF 195
           +SQ    G+ ++ LEFH+DDT G  + + L+++ F  PT     +      +A+V   + 
Sbjct: 136 VSQTQMQGKTDVLLEFHVDDTTGGNEKDSLMDLSFHVPTSNTQFLGDENRTAAQVLWETI 195

Query: 196 MG--------ESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLP 247
           MG        E++V FE +  + PRGRY +EL  ++L + G++ D+KI Y ++ R+FLLP
Sbjct: 196 MGVADVDSSEEAVVTFEGIAILTPRGRYSVELHLSFLRLQGQANDFKIQYSSIVRLFLLP 255

Query: 248 KPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPK 307
           K +  H  +V++LDPPIR+G T YPH+V+Q    +  +E  L +  E L ++Y D+L   
Sbjct: 256 KSNNPHTFVVVTLDPPIRKGQTLYPHIVIQF-ETEAVVERNLALTKEVLAEKYKDRLEES 314

Query: 308 ISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKP 367
             G + +V TKVLR L    +  P +F + Q  +A++++L A DG LYPLE  FFF+ KP
Sbjct: 315 YKGLIHEVFTKVLRGLSGAKVTRPGSFRSCQDGYAVKSSLKAEDGLLYPLEKGFFFLPKP 374

Query: 368 PTYLRYEDVDFVEFKRLEM------DRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLES 421
           PT + +E+++FVEF+R            FDL V + N    LF N++RSE+  L+ F+  
Sbjct: 375 PTLILHEEIEFVEFERHGAGGASISSHYFDLLVKLKNDQEHLFRNIQRSEYHNLFNFING 434

Query: 422 KQVRMVGIPPALLRGATTGKTSM 444
           K ++++ +     +GAT G T++
Sbjct: 435 KHLKIMNLGDG--QGATGGVTAV 455


gb|AAH77083.1| Unknown (protein for IMAGE:7140964) [Danio rerio]
(543 aa)

Score: 259 bits (662), Expect: 1.50148e-67
Length: 529, Idn/Pos/Gap = 170/275/42 (32%/51%/7%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           S   G LR SK  +++     G + TI V   +L   +W R   G  ++     G    +
Sbjct:  17 SWNDGRLRFSKQTVVYKNSKTGKVDTIPVP--ELTQAQWRRVCLGHGIKLWTSTGHIYKY  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GF+ AD   + EY K+ +     +   +  V GWN+G  +  G  + F       FE+P
Sbjct:  75 DGFKDADLEKISEYFKDNYKVE--LTEKDMCVKGWNWGTAKFNGPLLSFDVNDSPTFEIP 132

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEE--------DAIALHAEL 186
           +S +SQ   +G+NE+ +EFH +D    T+  L+E+RF  P           +A A +   
Sbjct: 133 LSSVSQCT-TGKNEVTVEFHQND---DTEVSLMEVRFYVPPTTGDEGSDPVEAFAQNVLS 188

Query: 187 SARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLL 246
            A V  A+  G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLL
Sbjct: 189 KADVIQAT--GDAVCIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLL 246

Query: 247 PKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPP 306
           P  D+  +  VISLDPPI+QG T Y H ++ L   ++ I + LNM  +E+++R+  KL  
Sbjct: 247 PHKDQRQMFFVISLDPPIKQGQTRY-HFLILLFSKEETISLTLNMNEDEVERRFEGKLNK 305

Query: 307 KISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNK 366
            +SG L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+K
Sbjct: 306 NMSGSLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHK 365

Query: 367 PPTYLRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVR 425
           PP +LR+E++  V F R     R FD ++    G+   F+++ER E+  L+ F+ +K   
Sbjct: 366 PPVHLRFEEIACVNFARGTTTTRSFDFEIETKQGNQYTFSSIEREEYGKLFDFVNAK--- 422

Query: 426 MVGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-- 483
                          K S++    + +  +   +    ++D ED  +A  +  K +GK  
Sbjct: 423 ---------------KLSIKNRGFKEKKGMKGNDDMYSDSD-EDQHDAYLERMKEEGKIR 466

Query: 484 RDPQTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDDDDDDDDD 532
            +    DD + E DE F    EDE      DS A+  +   ++ D D+D
Sbjct: 467 EEGNDSDDSEGESDESFNPGEEDEDIAEEYDSKASASESSAEEGDSDED 515


gb|EAL24655.1| GA18454-PA [Drosophila pseudoobscura]
(722 aa)

Score: 257 bits (657), Expect: 6.91266e-67
Length: 417, Idn/Pos/Gap = 151/242/16 (36%/58%/3%)

Query:  17 LGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           L  G L+L+   +++     G +  ISV   DL + + F G  G  LR   K G+   F 
Sbjct:  18 LSSGRLKLTDQNIIFKNNKTGKVEQISVDDIDLINSQKFVGTWG--LRVFTKSGALHRFT  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L +++K+A+  ++ +   E  V GWN+G  R  G  + F   S+T FE+P+
Sbjct:  76 GFRDSEHEKLGKFIKDAY--SQEMVEKEMCVKGWNWGTARFMGSVLSFDKDSKTIFEVPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEEDAI-----ALHAELSARV 190
           S +SQ V +G+NE+ LE+H +D A      L+EMRF  P  E A        H  + ++ 
Sbjct: 134 SHVSQCV-TGKNEVTLEYHQNDDAPVG---LLEMRFHIPAVESADDDPVEKFHQNVMSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
              S  GES+  F E+  + PRGRYD+++F T+  +HGK+FDYKI   +V R+F+LP  D
Sbjct: 190 SVISASGESIAIFREIQILTPRGRYDIKIFSTFFQLHGKTFDYKIPMDSVLRLFMLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
              +  V+SLDPPI+QG T Y +LVL L   D+E  + L     EL+ +Y  KL  ++SG
Sbjct: 250 SRQMFFVLSLDPPIKQGQTRYHYLVL-LFAPDEETTIELPFSEAELRDKYEGKLEKELSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            +++V+ KV++VL+ + +  P NF+   G  A+  +  A  GYLYPLE  F +++KPP +
Sbjct: 309 PVYEVMGKVMKVLIGRKITGPGNFIGHSGTAAVGCSFKAAAGYLYPLERGFIYIHKPPLH 368

Query: 371 LRYEDVDFVEFKRL-EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           +R+E++  V F R     R FD +V + NG+  +F+++E+ E++ L+ F+  K++ +
Sbjct: 369 IRFEEISSVNFARSGGSTRSFDFEVTLKNGTVHIFSSIEKEEYAKLFDFITQKKLHV 425


gb|AAO45187.1| SD06504p [Drosophila melanogaster]
(723 aa)

Score: 257 bits (656), Expect: 8.30549e-67
Length: 417, Idn/Pos/Gap = 151/242/16 (36%/58%/3%)

Query:  17 LGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           L  G L++++  +++     G +  IS    DL + + F G  G  LR   K G    F 
Sbjct:  18 LCSGRLKMTEQNIIFKNTKTGKVEQISAEDIDLINSQKFVGTWG--LRVFTKGGVLHRFT  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L +++K A+  ++ +   E  V GWN+G  R  G  + F   S+T FE+P+
Sbjct:  76 GFRDSEHEKLGKFIKAAY--SQEMVEKEMCVKGWNWGTARFMGSVLSFDKESKTIFEVPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPT----EEDAI-ALHAELSARV 190
           S +SQ V +G+NE+ LEFH +D A      L+EMRF  P     EED +   H  + ++ 
Sbjct: 134 SHVSQCV-TGKNEVTLEFHQNDDAPVG---LLEMRFHIPAVESAEEDPVDKFHQNVMSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
              S  GES+  F E+  + PRGRYD+++F T+  +HGK+FDYKI   +V R+F+LP  D
Sbjct: 190 SVISASGESIAIFREIQILTPRGRYDIKIFSTFFQLHGKTFDYKIPMDSVLRLFMLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
              +  V+SLDPPI+QG T Y +LVL L   D+E  + L     EL+ +Y  KL  +ISG
Sbjct: 250 SRQMFFVLSLDPPIKQGQTRYHYLVL-LFAPDEETTIELPFSEAELRDKYEGKLEKEISG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            +++V+ KV++VL+ + +  P NF+   G  A+  +  A  GYLYPLE  F +++KPP +
Sbjct: 309 PVYEVMGKVMKVLIGRKITGPGNFIGHSGTAAVGCSFKAAAGYLYPLERGFIYIHKPPLH 368

Query: 371 LRYEDVDFVEFKRL-EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           +R+E++  V F R     R FD +V + NG+  +F+++E+ E++ L+ ++  K++ +
Sbjct: 369 IRFEEISSVNFARSGGSTRSFDFEVTLKNGTVHIFSSIEKEEYAKLFDYITQKKLHV 425


ref|NP_523830.2| CG4817-PA [Drosophila melanogaster]
emb|CAA48471.1| SSRP1 [Drosophila melanogaster]
gb|AAF47064.1| CG4817-PA [Drosophila melanogaster]
sp|Q05344|SSRP_DROME Single-strand recognition protein (SSRP) (Chorion-factor 5)
(723 aa)

Score: 257 bits (656), Expect: 8.30549e-67
Length: 417, Idn/Pos/Gap = 151/242/16 (36%/58%/3%)

Query:  17 LGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           L  G L++++  +++     G +  IS    DL + + F G  G  LR   K G    F 
Sbjct:  18 LCSGRLKMTEQNIIFKNTKTGKVEQISAEDIDLINSQKFVGTWG--LRVFTKGGVLHRFT  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L +++K A+  ++ +   E  V GWN+G  R  G  + F   S+T FE+P+
Sbjct:  76 GFRDSEHEKLGKFIKAAY--SQEMVEKEMCVKGWNWGTARFMGSVLSFDKESKTIFEVPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPT----EEDAI-ALHAELSARV 190
           S +SQ V +G+NE+ LEFH +D A      L+EMRF  P     EED +   H  + ++ 
Sbjct: 134 SHVSQCV-TGKNEVTLEFHQNDDAPVG---LLEMRFHIPAVESAEEDPVDKFHQNVMSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
              S  GES+  F E+  + PRGRYD+++F T+  +HGK+FDYKI   +V R+F+LP  D
Sbjct: 190 SVISASGESIAIFREIQILTPRGRYDIKIFSTFFQLHGKTFDYKIPMDSVLRLFMLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
              +  V+SLDPPI+QG T Y +LVL L   D+E  + L     EL+ +Y  KL  +ISG
Sbjct: 250 SRQMFFVLSLDPPIKQGQTRYHYLVL-LFAPDEETTIELPFSEAELRDKYEGKLEKEISG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            +++V+ KV++VL+ + +  P NF+   G  A+  +  A  GYLYPLE  F +++KPP +
Sbjct: 309 PVYEVMGKVMKVLIGRKITGPGNFIGHSGTAAVGCSFKAAAGYLYPLERGFIYIHKPPLH 368

Query: 371 LRYEDVDFVEFKRL-EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           +R+E++  V F R     R FD +V + NG+  +F+++E+ E++ L+ ++  K++ +
Sbjct: 369 IRFEEISSVNFARSGGSTRSFDFEVTLKNGTVHIFSSIEKEEYAKLFDYITQKKLHV 425


gb|AAB19500.2| HMG1-related DNA-binding protein [Mus sp.]
sp|Q08943|SSRP_MOUSE Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160)
(708 aa)

Score: 256 bits (654), Expect: 1.29246e-66
Length: 483, Idn/Pos/Gap = 162/266/27 (33%/55%/5%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S+  G LRLS +G+++     G    ++   +L    W R A G  L+   K+G    ++
Sbjct:  17 SMNDGRLRLSPSGIIFKNSKTGKVD-NIQAGELTEGIWPRVALGHGLKLLTKNGHVYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P+
Sbjct:  76 GFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARV 190
           S++S V ++ R E+ LEFH +D     +  L+E+RF  P T+ED +    A    + ++ 
Sbjct: 134 SNVSSVPQA-RIEVTLEFHQND---DPEVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
                 G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D
Sbjct: 190 DVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
           +  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG
Sbjct: 250 QRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP +
Sbjct: 309 SLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVH 368

Query: 371 LRYEDVDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMV-- 427
           +R++++ FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K++ +   
Sbjct: 369 IRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKNR 428

Query: 428 ----GIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK 483
               GI P     A + +    A   R +     EE   RE +  D   +DD   + D  
Sbjct: 429 GLKEGINPGYDDYADSDEDQHDAYLERMK-----EEGKIREENAND--SSDDSGEETDES 481

Query: 484 RDP 486
            +P
Sbjct: 482 FNP 484


gb|AAA28914.1| single-stranded recognition protein
(723 aa)

Score: 254 bits (648), Expect: 6.9726e-66
Length: 417, Idn/Pos/Gap = 150/241/16 (35%/57%/3%)

Query:  17 LGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           L  G L++++  +++     G +  IS    DL + + F G  G  LR   K G    F 
Sbjct:  18 LCSGRLKMTEQNIIFENTKTGKVEQISAEDIDLINSQKFVGTWG--LRVFTKGGVLHRFT  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR ++   L +++K A+  ++ +   E  V GWN+G  R  G  + F   S+T FE+P+
Sbjct:  76 GFRDSEHEKLGKFIKAAY--SQEMVEKEMCVKGWNWGTARFMGSVLSFDKESKTIFEVPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPT----EEDAI-ALHAELSARV 190
           S +SQ V +G+NE+ LEFH +D A      L+EMRF  P     EED +   H  + ++ 
Sbjct: 134 SHVSQCV-TGKNEVTLEFHQNDDAPVG---LLEMRFHIPAVESAEEDPVDKFHQNVMSKA 189

Query: 191 GSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPD 250
              S  GES+  F E+  + PR RYD+++F T+  +HGK+FDYKI   +V R+F+LP  D
Sbjct: 190 SVISASGESIAIFREIQILTPRRRYDIKIFSTFFQLHGKTFDYKIPMDSVLRLFMLPHKD 249

Query: 251 EIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISG 310
              +  V+SLDPPI+QG T Y +LVL L   D+E  + L     EL+ +Y  KL  +ISG
Sbjct: 250 SRQMFFVLSLDPPIKQGQTRYHYLVL-LFAPDEETTIELPFSEAELRDKYEGKLEKEISG 308

Query: 311 ELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTY 370
            +++V+ KV++VL+ + +  P NF+   G  A+  +  A  GYLYPLE  F +++KPP +
Sbjct: 309 PVYEVMGKVMKVLIGRKITGPGNFIGHSGTAAVGCSFKAAAGYLYPLERGFIYIHKPPLH 368

Query: 371 LRYEDVDFVEFKRL-EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           +R+E++  V F R     R FD +V + NG+  +F+++E+ E++ L+ ++  K++ +
Sbjct: 369 IRFEEISSVNFARSGGSTRSFDFEVTLKNGTVHIFSSIEKEEYAKLFDYITQKKLHV 425


ref|XP_508428.1| PREDICTED: similar to structure specific recognition protein 1; recombination signal sequence recognition protein; chromatin-specific transcription elongation factor 80 kDa subunit; high mobility group box; facilitates chromatin remodeling 80 kDa subunit; cis... [Pan troglodytes]
(977 aa)

Score: 253 bits (647), Expect: 9.89892e-66
Length: 386, Idn/Pos/Gap = 142/228/14 (36%/59%/3%)

Query:  15 QSLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           +SL  G LRLS+ G+++     G    ++   +L    W R A G  L+   K+G    +
Sbjct:  16 ESLNDGRLRLSRQGIIFKNSKTGKVD-NIQAGELTEGIWRRVALGHGLKLLTKNGHVYKY  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GFR ++   L ++ K  +     +   +  V GWN+G V+  G+ + F  G +  FE+P
Sbjct:  75 DGFRESEFEKLSDFFKTHYRLE--LMEKDLCVKGWNWGTVKFGGQLLSFDIGDQPVFEIP 132

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSAR 189
           +S++SQ   +G+NE+ LEFH +D A   +  L+E+RF  P T+ED +    A    + ++
Sbjct: 133 LSNVSQCT-TGKNEVTLEFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSK 188

Query: 190 VGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKP 249
                  G+++  F EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  
Sbjct: 189 ADVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHK 248

Query: 250 DEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKIS 309
           D+  +  VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +S
Sbjct: 249 DQRQMFFVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMS 307

Query: 310 GELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPT 369
           G L+++V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP 
Sbjct: 308 GSLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPV 367

Query: 370 YLRYEDVDFVEFKR-LEMDRRFDLQV 394
           ++R++++ FV F       R FDL++
Sbjct: 368 HIRFDEISFVNFAPGTTTTRSFDLEI 393


emb|CAA82251.1| HMG protein [Catharanthus roseus]
sp|Q39601|SSRP_CATRO Structure-specific recognition protein 1 homolog (HMG protein)
(639 aa)

Score: 249 bits (637), Expect: 1.4294e-64
Length: 545, Idn/Pos/Gap = 177/285/60 (32%/52%/11%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG LR+   G+LW  +  G   + V + D+  L W +  R +QL    KDG    F GFR
Sbjct:  20 PGQLRVHSGGILWK-KQGGAKAVEVDKSDMVGLTWMKVPRSNQLGVRIKDGLFYKFTGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D A+L  Y++     T   P  +Q  V G N+G+V + G  + F+ GS+ AFE+ ++D
Sbjct:  79 DQDVASLTSYLQSTCGIT---PEEKQLSVSGKNWGEVDLNGNMLTFLVGSKQAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFH---LDDTAGKTDECLVEMRFQAPTEED----------AIALHA 184
           ++Q    G+N++ LEF    L +   K    L+E+ F  P              A     
Sbjct: 136 VAQTQLQGKNDVMLEFMWMILLEQMRKNS--LMEISFHVPNSNTQFVGDENRPPAQVFRD 193

Query: 185 ELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMF 244
           ++ +     +   +++V FE +  + PRGRY++EL  ++L + G++ D+KI Y +V R+F
Sbjct: 194 KIMSMADVGAGGEDAVVTFEGIAILTPRGRYNVELHLSFLRLQGQANDFKIQYSSVVRLF 253

Query: 245 LLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKL 304
           LLPK ++ H  +V++LDPPIR+G T YPH+VLQ    D  ++ +L++  + L  +Y DKL
Sbjct: 254 LLPKSNQPHTFVVVTLDPPIRKGQTLYPHIVLQF-ETDYVVDSSLSISEDLLSTKYKDKL 312

Query: 305 PPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFV 364
            P   G + +V T +LR L    +  P  F + Q  +A++++L A DG LYPLE  FFF+
Sbjct: 313 EPTYKGLIHEVFTMILRGLSGAKVTRPGKFRSCQDGYAVKSSLKAEDGVLYPLEKSFFFL 372

Query: 365 NKPPTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLE 420
            KPPT + +E++D+VEF+R          FDL + +      LF N++R+E+  L+ F+ 
Sbjct: 373 PKPPTLILHEEIDYVEFERHAAGGSNMHYFDLLIRLKTEQEHLFRNIQRNEYHNLFDFIS 432

Query: 421 SKQVRMVGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKY 480
           SK +++                 M   A +A  AI A          EDD++A D   + 
Sbjct: 433 SKGLKI-----------------MNLGADKAADAITA-------VLQEDDDDAVDPHLE- 467

Query: 481 DGKRDPQTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDDDDDDDDDDDEALEAE 540
             +   +   DE DEEDEDF  D++DE  G+P+      DD  + + D  D  +E +  +
Sbjct: 468 --RIKNEAGGDESDEEDEDFVADIDDE--GSPT------DDSGEGESDGSDSGNEEIPTK 517

Query: 541 LEPEE 545
            +P++
Sbjct: 518 KKPKK 522


gb|AAU44310.1| putative HMG-box with DNAbinding protein [Oryza sativa (japonica cultivar-group)]
gb|AAW57821.1| putative HMG-box with DNAbinding protein [Oryza sativa (japonica cultivar-group)]
(640 aa)

Score: 248 bits (632), Expect: 5.16649e-64
Length: 446, Idn/Pos/Gap = 153/249/30 (34%/55%/6%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG  +L   GL W  R  G  TI V + D+ S+ W    R  QL  + K+G    F GFR
Sbjct:  20 PGQFKLYSGGLAWK-RQGGGKTIEVEKSDITSVTWMAIPRSYQLGVSTKEGLFYRFFGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D ++L  ++++    T   P  +Q  V G N+G + I G  + F  GS+ AFE+ ++D
Sbjct:  79 EQDISSLTNFMEKNMRIT---PEEKQLSVGGHNWGGIEINGNMLSFNVGSKEAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPTEED-----------AIALHAE 185
           ++Q    G+ ++ LEFH+DDT G  + + L+++ F  PT               +   A 
Sbjct: 136 VAQTQMQGKTDVVLEFHVDDTTGGNEKDSLMDLSFHVPTSNTQFPGDENRPSAQVLWQAI 195

Query: 186 LS-ARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMF 244
           L+ A VGS+    E++V F+ +  + PRGRY +EL  ++L + G++ D+KI Y ++ R+F
Sbjct: 196 LNKADVGSSE---EAVVTFDGIAILTPRGRYSVELHLSFLRLQGQANDFKIQYSSILRLF 252

Query: 245 LLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKL 304
           +LPK +  H  +VI+LDPPIR+G T YPH+V+Q    +  ++  L +  E L ++Y D+L
Sbjct: 253 VLPKSNNPHTFVVITLDPPIRKGQTLYPHIVIQF-ETEAVVQRDLTLSDEVLAEKYKDRL 311

Query: 305 PPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFV 364
                G + +V +KVLR L    +  P  F + Q  +A++++L A DG LYPLE  FFF+
Sbjct: 312 ENSYQGLIHEVFSKVLRGLSGAKVTRPSTFRSCQDGYAVKSSLKAEDGLLYPLEKGFFFL 371

Query: 365 NKPPTYLRYEDVDFVEFKRLEM------DRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            KPPT + +E++++VEF+R            FDL V + N    LF N++R+E+  L+ F
Sbjct: 372 PKPPTLILHEEIEYVEFERHGAGGASISSHYFDLLVKLKNDQEHLFRNIQRNEYHNLFNF 431

Query: 419 LESKQVRMVGIPPALLRGATTGKTSM 444
           +  K ++++ +  A  +G   G T++
Sbjct: 432 ISGKHLKILNLGEA--QGRAGGVTAV 455


emb|CAG04782.1| unnamed protein product [Tetraodon nigroviridis]
(669 aa)

Score: 248 bits (632), Expect: 5.20977e-64
Length: 462, Idn/Pos/Gap = 145/238/59 (31%/51%/12%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFE  75
           S   G LR SK  +++     G    S+   +L   +W R   G  ++     G    ++
Sbjct:  17 SWNDGRLRFSKQNVVYKSSKTGKVD-SIPAGELNLAQWRRVCLGHGIKLGTSTGHIYKYD  75

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GFR  D   + E+ K  +     +   +  V GWN+G  +  G  + F     TAFE+P+
Sbjct:  76 GFRDTDFEKISEFFKANYKVE--LTEKDMSVKGWNWGTAKFSGPLLQFDINENTAFEIPL 133

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRF----------QAPTEEDAIALHAE 185
           S++SQ   +G+NE+ LEFH +D    T+  L+E+RF          Q P E+   A    
Sbjct: 134 SNVSQCA-TGKNEVTLEFHQND---DTEISLMEVRFYVPPNQTDERQDPVEDSPQAFAQN 189

Query: 186 LSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFL 245
           + ++       G+++  F+EL  + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FL
Sbjct: 190 VLSKADVIQATGDAVCIFKELQCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFL 249

Query: 246 LPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLP 305
           LP  D+  +  VISLDPPI+QG T Y  L+L   ++++ I +ALNM  E++++R+  KL 
Sbjct: 250 LPHKDQRQMFFVISLDPPIKQGQTRYHFLILLFSKEEN-INLALNMSEEDVERRFEGKLS 308

Query: 306 PKISGELWQVVTKVLRVLVDKPLHAPKNFV------------------------------ 335
             +SG L+++V++V++ LV++ +  P NF                               
Sbjct: 309 KHMSGSLYEMVSRVMKALVNRKITVPGNFQGYVSNPANKTLVWLGGVCVCVCVCVGGYLQ 368

Query: 336 ----------TSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR-L 384
                     +  GA  +  +  A+ G LYPLE  F +V+KPP +LR+E++  V F R  
Sbjct: 369 AVFKCCSFFRSHSGAQCITCSFKASSGLLYPLERGFIYVHKPPVHLRFEEISCVNFARGT 428

Query: 385 EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
              R FD ++    G+   F+++ER E+  L+ F+ +K++ +
Sbjct: 429 TTTRSFDFEIETKQGNQYTFSSIEREEYGKLFDFVNAKKLNI 470


emb|CAA22834.1| SPBC609.05 [Schizosaccharomyces pombe]
ref|NP_596315.1| similar to yeast POB3 protein that binds to DNA polymerase I; putative structure specific recognition protein [Schizosaccharomyces pombe]
pir||T40576 probable structure recognition/chromatin-associated HMG protein - fission yeast (Schizosaccharomyces pombe)
(512 aa)

Score: 247 bits (631), Expect: 7.09404e-64
Length: 442, Idn/Pos/Gap = 154/245/30 (34%/55%/6%)

Query:   8 DPVYVTDQSLGPGSLRLSKAGLLW-SPRNEGLYTISVAREDLASLRWFRGARGDQLRCTR  66
           D +Y+ + S  PG LR++ +GL W SP     +T+ ++  ++    W R ARG +L+   
Sbjct:   9 DNIYL-NLSEKPGKLRIAPSGLGWKSPSLAEPFTLPIS--EIRRFCWSRFARGYELKIIL  65

Query:  67 KDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSG 126
           K    V+ +GF   D   L   +K+ +    G+ + E  + GWN+G+    G  ++F   
Sbjct:  66 KSKDPVSLDGFSQEDLDDLINVIKQNF--DMGIEQKEFSIKGWNWGEANFLGSELVFDVN 123

Query: 127 SETAFELPISDISQVVRSGRNELALEFHLDDT----AGKTDECLVEMRFQAP-------- 174
           S  AFE+PIS ++    SG+NE+ALEF   D     + + DE LVEMR   P        
Sbjct: 124 SRPAFEIPISAVTNTNLSGKNEVALEFSTTDDKQIPSAQVDE-LVEMRLYVPGTTAKEDA 182

Query: 175 -----TEEDAIALHAE-LSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHG 228
                 E++A  L  E L  R       G+++V F E+  + PRGRYD++++ T + + G
Sbjct: 183 ADGEEVEQNAANLFYESLKERADIGQAAGDAIVSFSEILLLTPRGRYDIDMYETCMRLRG 242

Query: 229 KSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVA 288
           K++DYK+ Y ++  +FLLPKPDE H+  VI L+PP+RQG T YP LV Q  RD+D +EV 
Sbjct: 243 KTYDYKVEYSSINSLFLLPKPDEQHVVFVIGLEPPLRQGQTRYPFLVTQFVRDED-MEVD 301

Query: 289 LNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALG 348
           LN++   L+++Y DK+        ++VV+++ R L  + +  P  F++ +G  A++ +  
Sbjct: 302 LNIEETVLKEKYADKVKASYDQPAFEVVSQIFRGLTGRKVTTPAEFLSHEGHAAVKCSYK 361

Query: 349 ANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLF 404
           AN+G LY L+  F F+ KP   +   D+  V   R+ M     R FDL   + +G++  F
Sbjct: 362 ANEGQLYCLDKSFLFIPKPTLLMNTSDITRVTLSRVGMSVSAARTFDLTFTLRSGTSYQF 421

Query: 405 TNLERSEFSTLYQFLESKQVRM 426
           +N+ R E S L  FLESKQ+++
Sbjct: 422 SNINRVEQSALVAFLESKQIKI 443


gb|EAA00360.2| ENSANGP00000009009 [Anopheles gambiae str. PEST]
ref|XP_320213.2| ENSANGP00000009009 [Anopheles gambiae str. PEST]
(634 aa)

Score: 246 bits (629), Expect: 1.13193e-63
Length: 516, Idn/Pos/Gap = 161/278/40 (31%/53%/7%)

Query:  16 SLGPGSLRLSKAGLLWSPRNEG-LYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNF  74
           ++ PG L+++   +++     G +  I+ +  +L + + F G+ G  LR   K+GS   F
Sbjct:  17 AMCPGKLKMTDTAMVFKSDKTGKVEQINSSDIELLNYQRFVGSFG--LRVFLKNGSLHRF  74

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
            GF   D A + E+VK+ +     +   E  + GWN+G V+ +G  + F   ++T+FE+P
Sbjct:  75 LGF-TGDEAKIAEFVKKNYKLD--MLEKELSMRGWNWGSVQFKGAVLSFDVENKTSFEIP 131

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEEDA------IALHAELSA 188
           ++ +SQ    G+NE+ +EFH +D A  +   L+EMRF  PT E A       A    +  
Sbjct: 132 LNHVSQC-NVGKNEVTVEFHRNDDAPVS---LMEMRFHIPTSESAGDVDPVEAFQENVMK 187

Query: 189 RVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPK 248
           +    S  G+++  F E+  + PRGRYD+++F ++  +HGK++D+KI   +V R+FLLP 
Sbjct: 188 QASVISVSGDAIAIFREIHCLTPRGRYDIKVFQSFFQLHGKTYDFKIPTSSVLRLFLLPH 247

Query: 249 PDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKI 308
            D   +  VISLDPPI+QG T Y H ++ L + D+E  + L    EEL+++Y DKL  ++
Sbjct: 248 KDNRQMFFVISLDPPIKQGQTRY-HFLVTLFQMDEETNIELPFTEEELKEKYEDKLTKEL 306

Query: 309 SGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPP 368
           SG +++V+ K+++V++++ L  P  F+   G  A+  +  A  GYLYPLE  F +V+KPP
Sbjct: 307 SGPVYEVLGKIMKVIINRKLTGPGTFIGHSGTPAIGCSFKAAAGYLYPLERGFIYVHKPP 366

Query: 369 TYLRYEDVDFVEFKRL-EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMV 427
            ++R+E++  V F R     R FD ++ +  G+   F+++E+ E+S L+ F+ SK++ + 
Sbjct: 367 VHIRFEEISTVNFARSGGSTRSFDFEIELKTGTVHTFSSIEKEEYSKLFDFIVSKKLNVK 426

Query: 428 GIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQ 487
                     T GK S +            ++ A  + +GE D        +   + +  
Sbjct: 427 N---------TGGKASYK------------DDFADSDNEGEPDAYLARVKAEAKERDEDD 465

Query: 488 TFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDD 523
              D ++  DEDF  + + ESD A      A+   D
Sbjct: 466 DGSDSEESTDEDFNPN-QQESDVAEEKKSTAKKSSD 500


emb|CAB96421.1| SSRP1 protein [Zea mays]
(639 aa)

Score: 246 bits (628), Expect: 1.50322e-63
Length: 431, Idn/Pos/Gap = 150/239/28 (34%/55%/6%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG  ++   GL W  R  G  TI + + D+ ++ W +  R  QL    K G    F GFR
Sbjct:  20 PGQFKVHSGGLAWK-RQGGGKTIEIDKADVTAVTWMKVPRAYQLGVRIKAGLFYRFIGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D + L  ++++    T   P  +Q  V G N+G + I+G  + F+ GS+ AFE+ + D
Sbjct:  79 EQDVSNLTNFIQKNMGVT---PDEKQLSVSGQNWGGIDIDGNMLTFMVGSKQAFEVSLPD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTD-ECLVEMRFQAPT-------EEDAIALHAELS-- 187
           ++Q    G+ ++ LE H+DDT G  + + L+++ F  PT       +E     H      
Sbjct: 136 VAQTQMQGKTDVLLELHVDDTTGANEKDSLMDLSFHVPTSNTQFVGDESRPPAHILWETI 195

Query: 188 ---ARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMF 244
              A VGS+    E +V FE +  + PRGRY +EL  ++L + G++ D+KI Y ++ R+F
Sbjct: 196 LKFADVGSSE---EPVVTFEGIAILTPRGRYSVELHLSFLRLQGQANDFKIQYSSIVRLF 252

Query: 245 LLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKL 304
           LLPK +  H  +VI+LDPPIR+G T YPH+V+Q    +  +E  L +  E L ++Y D+L
Sbjct: 253 LLPKSNNPHTFVVITLDPPIRKGQTLYPHIVIQF-ETEAVVERDLALSKELLVEKYKDRL 311

Query: 305 PPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFV 364
                G + +V TKVLR L    +  P +F + Q  +A++++L A DG LYPLE  FFF+
Sbjct: 312 EESYKGLIHEVFTKVLRGLSGAKVTRPGSFRSCQDGYAVKSSLKAEDGLLYPLEKGFFFL 371

Query: 365 NKPPTYLRYEDVDFVEFKRLEM------DRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            KPPT + +E+++FVEF+R            FDL V + N    LF N++R+E+  L+ F
Sbjct: 372 PKPPTLILHEEIEFVEFERHGAGGASISSHYFDLLVKLKNDQEHLFRNIQRNEYHNLFNF 431

Query: 419 LESKQVRMVGI 429
           +  K ++++ +
Sbjct: 432 INGKNIKIMNL 442


dbj|BAA02719.1| high mobility group protein [Arabidopsis thaliana]
(644 aa)

Score: 245 bits (626), Expect: 2.47994e-63
Length: 433, Idn/Pos/Gap = 148/236/33 (34%/54%/7%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFR  78
           PG L+++  G  W  +  G   + V R D+ S+ W +  + +QL    KDG    F GFR
Sbjct:  20 PGLLKINSGGXQWKKQGGG-KAVEVDRSDIVSVSWTKVTKSNQLGVKTKDGLYYKFVGFR  78

Query:  79 PADRATLEEYVKEAWAWTKGVPRGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISD 137
             D  +L  + + ++  T   P  +Q  V G N+G+V + G ++ F+ GS+ AFE+ ++D
Sbjct:  79 DQDVPSLSSFFQSSYGKT---PDEKQLSVSGRNWGEVDLHGNTLTFLVGSKQAFEVSLAD 135

Query: 138 ISQVVRSGRNELALEFHLDDTAGKTDEC-LVEMRFQAPTEE----------------DAI 180
           +SQ    G+N++ LEF L        +  L+E+ F  P                   D I
Sbjct: 136 VSQTQLQGKNDVTLEFMLMILLVLMRKTPLMEISFHIPNSNTQFVGDENRPPSQVFNDTI 195

Query: 181 ALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTV 240
              A++S  V  A      +V FE +  + PRGRY++EL  ++L +  ++ D+KI Y +V
Sbjct: 196 VAMADVSPGVEDA------VVTFESIAILTPRGRYNVELHLSFLRLQEQANDFKIQYSSV 249

Query: 241 RRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRY 300
            R+FLLPK ++ H  +VISLDPPIR+G T YPH+V+Q    D  +E  L++  E +  ++
Sbjct: 250 VRLFLLPKSNQPHTFVVISLDPPIRKGQTMYPHIVMQF-ETDTVVESELSISDELMNTKF 308

Query: 301 GDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENC 360
            DKL     G + +V T VLR L    +  P  F +SQ   A++++L A DG LYPLE  
Sbjct: 309 KDKLERSYKGLIHEVFTTVLRWLSGAKITKPGKFRSSQDGFAVKSSLKAEDGVLYPLEKG 368

Query: 361 FFFVNKPPTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLFTNLERSEFSTLY 416
           FFF+ KPPT + ++++D+VEF+R          FDL + +      LF N++R+E+  LY
Sbjct: 369 FFFLPKPPTLILHDEIDYVEFERHAAGGANMHYFDLLIRLKTDHEHLFRNIQRNEYHNLY 428

Query: 417 QFLESKQVRMVGI 429
            F+ SK ++++ +
Sbjct: 429 TFISSKGLKIMNL 441


pir||S35511 high mobility group protein - Arabidopsis thaliana
(643 aa)

Score: 236 bits (603), Expect: 1.17136e-60
Length: 411, Idn/Pos/Gap = 142/226/32 (34%/54%/7%)

Query:  41 ISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVP 100
           + V R D+ S+ W +  + +QL    KDG    F GFR  D  +L  + + ++  T   P
Sbjct:  40 VEVDRSDIVSVSWTKVTKSNQLGVKTKDGLYYKFVGFRDQDVPSLSSFFQSSYGKT---P  96

Query: 101 RGEQ-GVVGWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTA 159
             +Q  V G N+G+V + G ++ F+ GS+ AFE+ ++D+SQ    G+N++ LEF L    
Sbjct:  97 DEKQLSVSGRNWGEVDLHGNTLTFLVGSKQAFEVSLADVSQTQLQGKNDVTLEFMLMILL 156

Query: 160 GKTDEC-LVEMRFQAPTEE----------------DAIALHAELSARVGSASFMGESLVF 202
               +  L+E+ F  P                   D I   A++S  V  A      +V 
Sbjct: 157 VLMRKTPLMEISFHIPNSNTQFVGDENRPPSQVFNDTIVAMADVSPGVEDA------VVT 210

Query: 203 FEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDP 262
           FE +  + PRGRY++EL  ++L +  ++ D+KI Y +V R+FLLPK ++ H  +VISLDP
Sbjct: 211 FESIAILTPRGRYNVELHLSFLRLQEQANDFKIQYSSVVRLFLLPKSNQPHTFVVISLDP 270

Query: 263 PIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRV 322
           PIR+G T YPH+V+Q    D  +E  L++  E +  ++ DKL     G + +V T VLR 
Sbjct: 271 PIRKGQTMYPHIVMQF-ETDTVVESELSISDELMNTKFKDKLERSYKGLIHEVFTTVLRW 329

Query: 323 LVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFK 382
           L    +  P  F +SQ   A++++L A DG LYPLE  FFF+ KPPT + ++++D+VEF+
Sbjct: 330 LSGAKITKPGKFRSSQDGFAVKSSLKAEDGVLYPLEKGFFFLPKPPTLILHDEIDYVEFE 389

Query: 383 RLEMD----RRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGI 429
           R          FDL + +      LF N++R+E+  LY F+ SK ++++ +
Sbjct: 390 RHAAGGANMHYFDLLIRLKTDHEHLFRNIQRNEYHNLYTFISSKGLKIMNL 440


ref|XP_584174.1| PREDICTED: similar to structure specific recognition protein 1, partial [Bos taurus]
(670 aa)

Score: 235 bits (599), Expect: 3.61312e-60
Length: 405, Idn/Pos/Gap = 142/228/36 (35%/56%/8%)

Query: 106 VVGWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDEC 165
           V GWN+G V+  G+ + F  G +  FE+P+S++SQ   +G+NE+ LEFH +D A   +  
Sbjct:  24 VKGWNWGTVKFGGQLLSFDIGDQPVFEIPLSNVSQCT-TGKNEVTLEFHQNDDA---EVS  79

Query: 166 LVEMRFQAP-TEEDAI----ALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELF 220
           L+E+RF  P T+ED +    A    + ++       G+++  F EL  + PRGRYD+ ++
Sbjct:  80 LMEVRFYVPPTQEDGVDPVEAFAQNVLSKADVIQATGDAICIFRELQCLTPRGRYDIRIY 139

Query: 221 PTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRR 280
           PT+L +  K+FDYKI Y TV R+FLLP  D+  +  VISL  PI+QG T Y  L+L   +
Sbjct: 140 PTFLHLPPKTFDYKIPYTTVLRLFLLPHKDQRQMFFVISLFSPIKQGQTRYHFLILLFSK 199

Query: 281 DDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGA 340
           D+D I + LNM  EE++KR+  +L   +SG L+ +V++V++ LV++ +  P NF    GA
Sbjct: 200 DED-ISLTLNMNEEEVEKRFEGRLTKNMSGSLYPMVSRVMKALVNRKITVPGNFQGHSGA 258

Query: 341 HALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR-LEMDRRFDLQVVMLNG 399
             +  +  A+ G LYPLE  F +V+KPP ++R++++ FV F R     R FD ++    G
Sbjct: 259 QCITCSYKASSGLLYPLERGFIYVHKPPVHIRFDEISFVNFARGTTTTRSFDFEIETKQG 318

Query: 400 STLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGATTGKTSMRAAAVRAEMAIAAEE 459
           +   F+++ER E+  L+ F+ +K                  K +++   ++  M  + +E
Sbjct: 319 TQYTFSSIEREEYGKLFDFVNAK------------------KLNIKNRGLKEGMNPSYDE 360

Query: 460 AARREADGEDDEEADDQDTKYDGK-RDPQTFDDEDD---EEDEDF 500
            A  +   ED  +A  +  K +GK R+    D  DD   E DE F
Sbjct: 361 YADSD---EDQHDAYLERMKEEGKIREENANDSSDDSGEETDESF 402


gb|EAA68250.1| hypothetical protein FG02518.1 [Gibberella zeae PH-1]
ref|XP_382694.1| hypothetical protein FG02518.1 [Gibberella zeae PH-1]
(569 aa)

Score: 234 bits (598), Expect: 4.30506e-60
Length: 560, Idn/Pos/Gap = 169/285/71 (30%/50%/12%)

Query:   1 MSAAQVIDPVYVTDQSLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGD  60
           M+A +  D +Y+ D S   G  R ++ G  W P   G  T ++   ++AS +W R A+G 
Sbjct:   1 MTAIESFDNIYL-DLSKESGKCRFAETGFGWKPVGGG-DTFTLDHNNIASAQWSRAAKGY  58

Query:  61 QLRCTRKDGS-TVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGE 119
           +++  ++  S  +  +GF+  D   L +  K  W ++  +   E  + GWN+G+      
Sbjct:  59 EIKIVQRSKSGIIQLDGFQQEDYDRLAKVFKN-W-YSTVLESKEHALRGWNWGKAEFSKS 116

Query: 120 SVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDT---------------AGKTDE 164
            + F   +  AFELP S+I     +GRNE+A+E  L ++               A    +
Sbjct: 117 ELTFSVQNRPAFELPYSEIGNTNLAGRNEVAVEMALPESGANAQLGGARSKGSKAAAGRD 176

Query: 165 CLVEMRFQAP---------------------TEEDAIALHAEL---SARVGSASFMGESL 200
            LVEMRF  P                      E++A  L  E     A +G  +  G+++
Sbjct: 177 QLVEMRFYIPGVTTRKEAEGEDAGSDAGNDEQEKNAATLFYETLIDKAEIGETA--GDTI 234

Query: 201 VFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISL 260
             F ++  + PRGR+D++++     + GK++DYKI Y+ +++  +LPKPDE+H  LV+ L
Sbjct: 235 ATFLDVLHLTPRGRFDIDMYEASFRLRGKTYDYKIQYEAIKKFMVLPKPDEVHYMLVMGL 294

Query: 261 DPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVL 320
           DPP+RQG T YP +V+Q ++ D+E+ + LN+  EEL+ +Y DKL P     L QVV K+ 
Sbjct: 295 DPPLRQGQTRYPFVVMQFKK-DEEVTIDLNLNEEELKSKYQDKLEPHYEEPLHQVVAKIF 353

Query: 321 RVLVDKPLHAP-KNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFV 379
           R L ++ + +P K+F+T +  + ++ ++ A++G+LY LE  F FV KP TY+ YE    V
Sbjct: 354 RGLGNRKISSPAKDFITHRNQYGIKCSIKASEGFLYCLEKAFMFVPKPATYIAYEQTQSV 413

Query: 380 EFKRLEMD----RRFDLQVVMLNGS-TLLFTNLERSEFSTLYQFLESKQVRMVGIPPALL 434
            F R+         FD+ V++ NG+ +  F+N+ R +   L  F + K +R        +
Sbjct: 414 TFSRVSGAVSALSTFDITVLLKNGAGSSQFSNISREDLKALESFFKLKGLR--------V 465

Query: 435 RGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQTFDDEDD 494
           +       ++ AAA+  +M  + +E A +   G  DE+ +  D  +      +T  + D 
Sbjct: 466 KNEIDEDANLLAAAMNQQMDDSEDEVAAKADRGSADEDEESVDEDF------RTDSESDV 519

Query: 495 EEDEDFAGDVEDESDGAPSD 514
            E+ D A     ESDG+ SD
Sbjct: 520 AEEYDSA----HESDGSGSD 535


emb|CAG87121.1| unnamed protein product [Debaryomyces hansenii CBS767]
ref|XP_458960.1| unnamed protein product [Debaryomyces hansenii]
(540 aa)

Score: 234 bits (596), Expect: 7.46688e-60
Length: 453, Idn/Pos/Gap = 144/237/46 (31%/52%/10%)

Query:  14 DQSLGPGSLRLSKAGLLW--------SPRNEGLYTISVAREDLASLRWFRGARGDQLRCT  65
           +QS   G +R++ +GL W        S  N   + +    E++ + +W RG+RG +LR  
Sbjct:  12 NQSKNYGRMRIADSGLGWKASVTNNASASNNAPFLLP--SEEILASQWSRGSRGYELRVQ  69

Query:  66 RKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVS 125
            K+   V  +GF   D A L++ ++  +     +   E  + GWN+G+  +    ++F  
Sbjct:  70 TKNKGVVMLDGFDVEDFANLKQELQRNF--QVNLEHKEHSLRGWNWGKTDLARNELVFQV 127

Query: 126 GSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP-TEED------ 178
            ++  FE+P S+IS    +G+NE+A+EF+LD    K  + +VEMRF  P T E+      
Sbjct: 128 NNKPDFEIPYSEISNSNLTGKNEVAVEFNLDGANSKAGDEMVEMRFYIPGTLENETTPAV 187

Query: 179 ------------------AIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELF 220
                             A   + +L  +       GE++V F ++ F+ PRGRYD++++
Sbjct: 188 KNEENGEVKEEETEEISAATVFYEQLKDKADIGQVAGEAIVSFSDVLFLTPRGRYDIDMY 247

Query: 221 PTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRR 280
           PT L + GK++DYKI Y  + R+F LPKPDE H  LVI +DPP+RQG T YP LV+Q  +
Sbjct: 248 PTSLRLRGKTYDYKIQYNQIERIFSLPKPDEAHHLLVIQIDPPLRQGQTKYPFLVMQFAK 307

Query: 281 DDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGA 340
            ++EIE+ LN+  EE   +Y D+L      +   V++   + L ++ L  P +F +    
Sbjct: 308 -EEEIELDLNVSEEEYNDKYKDRLKKSYDSQTHLVMSHCFKGLTERRLVVPGSFQSRFLQ 366

Query: 341 HALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR-----LEMDRRFDLQVV 395
             +  +L A++GYLYPL+ CF FV KP  Y+ + ++  +   R     +   R FD+ + 
Sbjct: 367 PGISCSLKASEGYLYPLDRCFLFVTKPTVYIPFSEISSITMSRTGAGGVSTSRTFDMDIT 426

Query: 396 MLNGSTLL--FTNLERSEFSTLYQFLESKQVRM 426
            L GS     F ++ER E  T+  +   K +R+
Sbjct: 427 -LRGSNQSHNFGSIEREEQETIENYCLQKGLRI 458


gb|AAS52821.1| AER138Cp [Ashbya gossypii ATCC 10895]
ref|NP_984997.1| AER138Cp [Eremothecium gossypii]
(542 aa)

Score: 234 bits (596), Expect: 7.72029e-60
Length: 560, Idn/Pos/Gap = 167/274/76 (29%/48%/13%)

Query:  14 DQSLGPGSLRLSKAGLLWSPRNEGLYT-------ISVAREDLASLRWFRGARGDQLRCTR  66
           +QS   G  RL + GL W     G          I +  ++LAS++W RG RG +L+   
Sbjct:  11 NQSKVSGRFRLGEGGLGWKASATGGSAAMQNNEPILLTADELASVQWSRGCRGYELKINT  70

Query:  67 KDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSG 126
           K+   V  +GF   D   L+  ++  +     +   +  + GWN+G   +    ++F   
Sbjct:  71 KNKGVVQLDGFSQEDFTLLKNDLQRRF--NVQLEHKDHSLRGWNWGTTDLTRNELIFSLN 128

Query: 127 SETAFELPISDISQVVRSGRNELALEFHLD-DTAGKTDECLVEMRFQAP---TEED---- 178
            +  FE+P S IS    + +NE+ALEF L  D      + LVEMR   P   T+ED    
Sbjct: 129 GKPTFEIPYSHISNTNLTSKNEVALEFDLQKDGYNPAGDELVEMRLYVPGVVTQEDRHSS 188

Query: 179 -------------------AIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLEL 219
                              A A + EL A+       G++++ F+++ F  PRGRYD+++
Sbjct: 189 PAEDADVDMEKDNKEEKSIAEAFYEELRAKAEIGEVSGDAIISFQDVFFTTPRGRYDIDI 248

Query: 220 FPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLR 279
           +   + + GK+++YK+ ++ ++R+F LPK D+IH  +V+S++PP+RQG TTYP+LVLQ +
Sbjct: 249 YKNSIRLRGKTYEYKLQHRQIQRIFSLPKADDIHHLMVLSIEPPLRQGQTTYPYLVLQFQ 308

Query: 280 RDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQG 339
           + D+E EV LN++ +E ++ Y DKL  +   +   V++ VL+ L D  +  P  + +   
Sbjct: 309 K-DEETEVQLNVEDDEFERLYKDKLKKQYDAKTHVVLSHVLKGLTDTRVVVPGEYKSKHE 367

Query: 340 AHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR----LEMDRRFDLQVV 395
             A+  +  AN+G+LYPL+N F F+ KP  Y+ ++DV  V   R        R FDL+VV
Sbjct: 368 QCAVSCSFKANEGHLYPLDNAFMFLTKPTLYIPFQDVSSVNISRAGQATTSSRTFDLEVV 427

Query: 396 ML-NGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGATTGKTSMRAAAVRAEMA 454
           +  N  +  F N+ + E   L  FL+SK VR+              K   +    R + A
Sbjct: 428 LRSNRGSTTFANISKEEQQILESFLKSKNVRV--------------KNEEKETQQRLQTA 473

Query: 455 IAAEEAARREADGEDDEEADDQDTKYDGKRDPQTFDDEDDEEDEDFAGDVEDESDGAPSD 514
           +           G D E+ D          +  +  ++D+  DEDF  + ED+      D
Sbjct: 474 L-----------GSDSEDED---------VNMGSAAEDDESVDEDFQAESEDDDVAEEFD 513

Query: 515 SDAARDDDDDDDDDDDDDDD 534
           SDA   + + +  D  D +D
Sbjct: 514 SDAGVSESETEAADGADTED 533


gb|EAK85684.1| hypothetical protein UM04416.1 [Ustilago maydis 521]
ref|XP_402031.1| hypothetical protein UM04416.1 [Ustilago maydis 521]
(558 aa)

Score: 231 bits (590), Expect: 3.89602e-59
Length: 461, Idn/Pos/Gap = 147/236/57 (31%/51%/12%)

Query:  19 PGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQL-----------RCTRK  67
           PG LR+S+ GL W P      TI++  + +AS +W R AR  QL              + 
Sbjct:  18 PGKLRMSQGGLGWKPSVGEGSTITIPADQMASFQWIRVARNYQLAIYLNKDRDAPSSAQT  77

Query:  68 DGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGS 127
           +    NF+GF   D   L  ++++   + K +   E    GWN+GQ +I    V F+   
Sbjct:  78 NPRRTNFDGFVRDDFDRLSSHIRQ--YFNKPLEAKEVSTRGWNWGQAKISNHDVQFLVRD 135

Query: 128 ETAFELPISDISQVVRSGRNELALEF------HLDDTAGKTD-----------ECLVEMR 170
           + AFELP+S ++      + E+++EF            G +D           + LVEMR
Sbjct: 136 KLAFELPLSHLAN-SNIAKTEVSMEFLNPEQQQPGANTGTSDVNGTKSRRSKGDQLVEMR 194

Query: 171 FQAP---------------------TEEDAIALHAELSARVGSASFMGESLVFFEELPFI 209
              P                      E  A A H  L ++       G+S+V F+E+  +
Sbjct: 195 LYVPGQAIKDDGSDAASAQDDDVNNEETAAEAFHEALKSKADIGQVAGDSIVVFKEVLVL 254

Query: 210 VPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNT 269
            PRGRYD+++F T++ + GK++DYKILY ++ ++FLLPK DEIH+ LVI LDPPIRQG T
Sbjct: 255 TPRGRYDIDVFSTFIRLRGKTYDYKILYSSMNKLFLLPKADEIHVMLVIGLDPPIRQGQT 314

Query: 270 TYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLH 329
            YP+LVLQ  R ++E++  LN+  + +Q++Y  KL  +     +++VT + +VL  + + 
Sbjct: 315 RYPYLVLQFPR-EEEMDAELNLDEQTIQEKYDGKLKKRYEEPTFRIVTNIFKVLSGQKVA 373

Query: 330 APKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR----LE 385
            P +F +S G  +++  + A DG LYPLE    +V+K P Y+ Y ++      R    + 
Sbjct: 374 TPTDFESSSGQTSIKCNVKAADGNLYPLEKSLLWVSKQPVYVPYSEIHQAILSRVGGAVA 433

Query: 386 MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
             + FDL+V   +G+   F ++ R E   L  +L  ++VR+
Sbjct: 434 SSKTFDLRVATKSGTEHTFQSISREELDRLKAWLADRKVRI 474


emb|CAE60458.1| Hypothetical protein CBG04066 [Caenorhabditis briggsae]
(689 aa)

Score: 231 bits (589), Expect: 4.83989e-59
Length: 421, Idn/Pos/Gap = 135/238/21 (32%/56%/4%)

Query:  17 LGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGSTVNFEG  76
           L PG+L++     ++     G  TI ++  D+  +RW +      +R   KDG +  F G
Sbjct:  17 LTPGTLKVEDTSAIFKSDKAG-KTIKLSPGDIKDIRWQKLGNRPGIRFGLKDGGSHRFGG  75

Query:  77 FRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPIS 136
           F+ AD   +    + +W    G+ +    + GWN+G V+++G++V F   +   FE+P +
Sbjct:  76 FKEADFGKILVVTESSWGL--GIEKTNLVIKGWNYGNVQVKGKNVEFSWENNPIFEIPCT 133

Query: 137 DISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTE---EDAIALHAELS----AR 189
           ++SQ V + +NE  LEFH  +    +   L+EMRF  P +   ED I    E      A 
Sbjct: 134 NVSQCV-ANKNEAVLEFHQHENNPIS---LMEMRFHMPVDPENEDDIDRVEEFKQAVLAY 189

Query: 190 VGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKP 249
            G  +   + +    ++    PRGRYD++++PT +++HGK++DYKI  KT+ R+FL+P  
Sbjct: 190 AGLEAETEQPITLLSDILCTTPRGRYDIKVYPTSIALHGKTYDYKIPVKTITRLFLVPHK 249

Query: 250 DEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDD-EIEVALNMKPEELQKRYGDKLPPKI 308
           D  H+  V++L+PPIRQG T Y +LV +  +DDD E+E+AL    ++L+++YG +L  ++
Sbjct: 250 DGRHVYFVLALNPPIRQGQTRYTYLVFEFVKDDDQEMEIALT---DDLKEKYGGQLKSEM 306

Query: 309 SGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPP 368
            G L++ V+ + +V+ +  +  P  F+ S G  A++ +   N G LYPLE  F F++KP 
Sbjct: 307 DGPLYENVSILFKVVCNLKVTVPGRFIGSSGTPAIQCSHKQNPGLLYPLEKGFLFIHKPV 366

Query: 369 TYLRYEDVDFVEFKRLE---MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVR 425
            Y+R E++    F R +   + R FD ++ M +G +++F  +E+ E   L+ +L  K+++
Sbjct: 367 MYIRLEEISSCHFARSDAGTVTRTFDFEIDMKSGQSVMFNAMEKEENHKLFDYLNKKEIK 426

Query: 426 M 426
           +
Sbjct: 427 I 427


emb|CAG62236.1| unnamed protein product [Candida glabrata CBS138]
ref|XP_449262.1| unnamed protein product [Candida glabrata]
(543 aa)

Score: 229 bits (584), Expect: 1.90157e-58
Length: 538, Idn/Pos/Gap = 158/271/73 (29%/50%/13%)

Query:   8 DPVYVTDQSLGPGSLRLSKAGLLW---------SPRNEGLYTISVAREDLASLRWFRGAR  58
           D +++     G G  R++ +GL W         S +N+  + +     +L++++W RG R
Sbjct:   6 DRIFMNQSKFG-GRFRIADSGLGWKVSTSGGSASAQNKAPFLLPAT--ELSTVQWSRGCR  62

Query:  59 GDQLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEG 118
           G +L+   K+   +  EGF   D   ++      ++    V   E  + GWN+GQ  +  
Sbjct:  63 GFELKINTKNQGVIQLEGFSEDDFNIIKGDFHRRFSIQ--VEHKEHSLRGWNWGQTDLAR 120

Query: 119 ESVLFVSGSETAFELPISDISQVVRSGRNELALEFHL-DDTAGKTDECLVEMRFQAP--- 174
             ++F    +  FE+P + I+    + +NE+A+EF++ DDT     + +VEMRF  P   
Sbjct: 121 NEMVFALNGKPVFEIPYARINNTNLTAKNEVAVEFNIQDDTYQPAGDEMVEMRFYLPGSV 180

Query: 175 -TEED-----------------------AIALHAELSARVGSASFMGESLVFFEELPFIV 210
             +ED                       A A + EL  +       G+++V F+++ F  
Sbjct: 181 VVDEDQPAPKKEGEEEGEEAAETETKSLAEAFYEELKNKADIGEIAGDAIVSFQDVFFTT 240

Query: 211 PRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTT 270
           PRGRYD++++   + + GK+++YK+ +  ++R+  LPK D+I+  +V+++DPP+RQG TT
Sbjct: 241 PRGRYDIDIYENSIRLRGKTYEYKLQHNQIQRIVSLPKADDINHLVVLAMDPPLRQGQTT 300

Query: 271 YPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHA 330
           YP LVLQ ++ D+E EV LN+  +E +++Y DKL  +   +   V++ VL+ L  + +  
Sbjct: 301 YPFLVLQFQK-DEETEVQLNLSDQEYEEKYKDKLKKQYDSKTHIVISHVLKGLTGRRVVV 359

Query: 331 PKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRL----EM 386
           P  + +     A+  +  AN+GYLYPL+N FFF+ KP  Y+ + DV  V   R       
Sbjct: 360 PGEYKSKYEQCAVSCSYKANEGYLYPLDNAFFFLTKPTLYIPFNDVSSVVISRAGQTSTS 419

Query: 387 DRRFDLQVVML-NGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGATTGKTSMR 445
            R FDL+V++  N  + +F N+ + E   L  FL+SK +R+              K   +
Sbjct: 420 SRTFDLEVILRSNRGSTIFGNISKEEQQLLENFLKSKNLRV--------------KNEEK 465

Query: 446 AAAVRAEMAIAA---EEAARREADGEDDEEADDQDTKYDGKRDPQTFDDEDDEEDEDF 500
            A VR + A+ +   +E     + GEDDE  D+      G        D+DDE  E+F
Sbjct: 466 DAQVRLQSALGSDSDDEDVNMGSAGEDDESVDEDFHVSSG--------DDDDEVAEEF 515


emb|CAA86251.1| unnamed protein product [Saccharomyces cerevisiae]
ref|NP_013642.1| Pob3p [Saccharomyces cerevisiae]
pir||S48328 hypothetical protein YML069w - yeast (Saccharomyces cerevisiae)
sp|Q04636|YMG9_YEAST Hypothetical 63.0 kDa protein in DAK1-ORC1 intergenic region
(552 aa)

Score: 228 bits (580), Expect: 5.57909e-58
Length: 571, Idn/Pos/Gap = 168/280/89 (29%/49%/15%)

Query:   8 DPVYVTDQSLGPGSLRLSKAGLLWSPRNEGLYTISVARE-------DLASLRWFRGARGD  60
           D +Y+ +QS   G  R++ +GL W     G    + AR+       +L++++W RG RG 
Sbjct:   6 DRIYL-NQSKFSGRFRIADSGLGWKISTSGGSAANQARKPFLLPATELSTVQWSRGCRGY  64

Query:  61 QLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGES 120
            L+   K+   +  +GF   D   ++      +     V + E  + GWN+G+  +    
Sbjct:  65 DLKINTKNQGVIQLDGFSQDDYNLIKNDFHRRF--NIQVEQREHSLRGWNWGKTDLARNE 122

Query: 121 VLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGK-TDECLVEMRFQ------- 172
           ++F    +  FE+P + I+    + +NE+ +EF++ D   +   + LVEMRF        
Sbjct: 123 MVFALNGKPTFEIPYARINNTNLTSKNEVGIEFNIQDEEYQPAGDELVEMRFYIPGVIQT 182

Query: 173 ----------------APTEED----------------AIALHAELSARVGSASFMGESL 200
                            P +ED                A A + EL  +       G+++
Sbjct: 183 NVDENMTKKEESSNEVVPKKEDGAEGEDVQMAVEEKSMAEAFYEELKEKADIGEVAGDAI 242

Query: 201 VFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISL 260
           V F+++ F  PRGRYD++++   + + GK+++YK+ ++ ++R+  LPK D+IH  LV+++
Sbjct: 243 VSFQDVFFTTPRGRYDIDIYKNSIRLRGKTYEYKLQHRQIQRIVSLPKADDIHHLLVLAI 302

Query: 261 DPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVL 320
           +PP+RQG TTYP LVLQ ++ D+E EV LN++ E+ ++ Y DKL  +   +   V++ VL
Sbjct: 303 EPPLRQGQTTYPFLVLQFQK-DEETEVQLNLEDEDYEENYKDKLKKQYDAKTHIVLSHVL 361

Query: 321 RVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVE 380
           + L D+ +  P  + +     A+  +  AN+GYLYPL+N FFF+ KP  Y+ + DV  V 
Sbjct: 362 KGLTDRRVIVPGEYKSKYDQCAVSCSFKANEGYLYPLDNAFFFLTKPTLYIPFSDVSMVN 421

Query: 381 FKRL----EMDRRFDLQVVML-NGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLR 435
             R        R FDL+VV+  N  +  F N+ + E   L QFL+SK +R+         
Sbjct: 422 ISRAGQTSTSSRTFDLEVVLRSNRGSTTFANISKEEQQLLEQFLKSKNLRV--------- 472

Query: 436 GATTGKTSMRAAAVRAEMAIAA---EEAARREADGEDDEEADDQDTKYDGKRDPQTFDDE 492
                K   R    R + A+ +   EE     + GEDDE  D+         D Q   D 
Sbjct: 473 -----KNEDREVQERLQTALGSDSDEEDINMGSAGEDDESVDE---------DFQVSSDN 518

Query: 493 D-DEEDEDFAGDVEDESDGAPSDSDAARDDD 522
           D DE  E+F      +SD A SD++   D++
Sbjct: 519 DADEVAEEF------DSDAALSDAEGGSDEE 543


ref|XP_322219.1| hypothetical protein [Neurospora crassa]
gb|EAA26950.1| hypothetical protein [Neurospora crassa]
(565 aa)

Score: 227 bits (579), Expect: 6.75939e-58
Length: 554, Idn/Pos/Gap = 165/277/60 (29%/50%/10%)

Query:   1 MSAAQVIDPVYVTDQSLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGD  60
           M+A +  D +Y+ D S   G  R ++ GL W P   G    ++   ++   +W R ARG 
Sbjct:   1 MAAIESFDNIYL-DLSKESGKSRFAENGLGWKPAGGG-EAFTLDSSNIGGAQWSRAARGY  58

Query:  61 QLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGES 120
           +++   +    V  +GF   D   L +  K  W ++  +   E  + GWN+G+       
Sbjct:  59 EVKILLRSSGVVQLDGFHQEDYERLSKIFKN-W-YSVNLENKEHSLRGWNWGKAEFSKAE 116

Query: 121 VLFVSGSETAFELPISDISQVVRSGRNELALEFHLDD---------TAGKTDEC------ 165
           + F   +  AFE+P S+IS    +GRNE+A+EF  +D         T GK  +       
Sbjct: 117 LTFNVQNRPAFEIPYSEISNTNLAGRNEIAVEFAGNDGGKSNGHSGTGGKGKKASAGKDQ 176

Query: 166 LVEMRFQAP-------------------TEEDAIALHAEL---SARVGSASFMGESLVFF 203
           LVE+RF  P                    E++A+ L  +     A +G  +  G+++  F
Sbjct: 177 LVEVRFYIPGTTTRKEAEGGEAGSDADEEEKNAVTLFYDTLIEKAEIGETA--GDTIATF 234

Query: 204 EELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPP 263
            ++  + PRGR+D++++     + GK++DYKI Y  +++  +LPKPD++H  L I LDPP
Sbjct: 235 LDVLHLTPRGRFDIDMYDASFRLRGKTYDYKIQYDAIKKFMVLPKPDDLHFLLCIGLDPP 294

Query: 264 IRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVL 323
           +RQG T YP +V+Q +  D+E+ + LN+  EEL  +Y DKL       L QVV  + + L
Sbjct: 295 LRQGQTRYPFVVMQFKA-DEEVTLDLNITEEELNGKYKDKLQSHYEQPLHQVVAYIFKGL 353

Query: 324 VDKPLHAP-KNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFK 382
            +K +  P K+F T +  + ++ ++ A++G+LY LE  F FV KP TY+ YE    + F 
Sbjct: 354 ANKKVTTPAKDFTTHRQQYGIKCSIKASEGFLYCLEKAFMFVPKPATYISYEQTQSITFS 413

Query: 383 R----LEMDRRFDLQVVMLNGS-TLLFTNLERSEFSTLYQFLESKQVRMVGIPPALLRGA 437
           R    +     FD+ V M NG+ +  F+N+ R +   L +F + K +R        ++  
Sbjct: 414 RVGGAVSALSTFDITVHMKNGAGSSQFSNINREDLKALEEFFKLKGLR--------VKNE 465

Query: 438 TTGKTSMRAAAV-RAEMAIAAEEAARREAD-GEDDEEADDQDTKYDGKRDPQTFDDEDDE 495
               T++ AAA+   +MA + EEA   +AD G  DE+ +  D  +  + +    ++ D  
Sbjct: 466 IDDDTNLIAAALGDDDMASSDEEAVGPKADRGSADEDEESVDEDFQAESESDVAEEYDSN 525

Query: 496 EDEDFAGDVEDESD 509
            + D +G  E + D
Sbjct: 526 HESDGSGSEESDVD 539


gb|EAK90481.1| structure-specific recognition protein 1 (SSRP1) (recombination signal sequence recognition protein) [Cryptosporidium parvum]
(523 aa)

Score: 224 bits (570), Expect: 6.76135e-57
Length: 432, Idn/Pos/Gap = 128/235/24 (29%/54%/5%)

Query:  20 GSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGD---QLRC-TRKDGSTVNFE  75
           G  + SK    W  R     T     E++  + W + +  D   QLR   R+    ++F 
Sbjct:  22 GIFKASKELFGWKNRRTNA-TYHYKPEEVMGVEWIQTSCEDSSCQLRVFIREKKDCIHFT  80

Query:  76 GFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPI 135
           GF+  D + ++ + +  +     +   E    G N+G + I  +++   +  +    +P 
Sbjct:  81 GFKTEDYSVIKSHFETYYGIN--LETKELNTKGINWGDLTIHNDTICIGNEGKVMMYVPS 138

Query: 136 SDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPTEEDAIALHAELSAR------ 189
            +I+Q+    ++EL LEF+    AG+  + L+E+R   P +E+++  ++  SA       
Sbjct: 139 ININQIAMPSKSELVLEFNEGVNAGEDCDELMEIRLFVPNQENSLDGNSLSSAEKLRSDL 198

Query: 190 -----VGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMF 244
                +GS+  M + +  + ++  +VPRGRY++E+    L +HGKSFDY IL++++ R+F
Sbjct: 199 LKLTGIGSSGSM-DKVCRWNDIHLLVPRGRYEIEVLVNCLKLHGKSFDYTILFQSISRLF 257

Query: 245 LLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDE-IEVALNMKPEELQKRYGDK 303
           LLP+P    + LV++L+ P+RQGNT YP +V+Q     DE IE+ LN+  +E+Q+  G  
Sbjct: 258 LLPRPGTSLVNLVVALETPMRQGNTKYPFVVMQFDTQQDENIEMPLNLSEKEIQRFTG-- 315

Query: 304 LPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFF 363
           L P ++G+ W +VT++L+ L    +  P +F ++   H +R +  A DG LYPL   F F
Sbjct: 316 LSPIMTGKFWDIVTRILKSLTGHSIIVPGDFRSASMYHCIRCSYKAQDGLLYPLNRSFIF 375

Query: 364 VNKPPTYLRYEDVDFVEFKRL--EMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLES 421
           + KP   +R++D+  +EF R+     R F+L + +  G    FT+++++E++ L +FL+ 
Sbjct: 376 ITKPVILIRFDDILNIEFSRMGGNQTRFFELTITIRGGGDYSFTSIDKAEYNPLIKFLQE 435

Query: 422 KQVRMVGIPPAL 433
           K +R+  +  +L
Sbjct: 436 KNIRIKNLQESL 447


gb|EAL36346.1| structure specific recognition protein [Cryptosporidium hominis]
(514 aa)

Score: 223 bits (568), Expect: 1.25367e-56
Length: 406, Idn/Pos/Gap = 123/228/23 (30%/56%/5%)

Query:  46 EDLASLRWFRGARGD---QLRC-TRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPR 101
           E++  + W + +  D   QLR   R+    ++F GF+  D + ++ + +  +     +  
Sbjct:  38 EEVMGVEWIQTSCEDSSCQLRVFIREKFVCIHFTGFKTEDYSVIKSHFETYYGIN--LET  95

Query: 102 GEQGVVGWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGK 161
            E    G N+G + I  +++   +  +    +P  +I+Q+    ++EL LEF+    AG+
Sbjct:  96 KELNTKGINWGDLTIHNDTICIGNEGKVMMYVPSVNINQIAMPSKSELVLEFNEGVNAGE 155

Query: 162 TDECLVEMRFQAPTEEDAIALHAELSAR-----------VGSASFMGESLVFFEELPFIV 210
             + L+E+R   P +E+++  ++  SA            +GS+  M + +  + ++  +V
Sbjct: 156 DCDELMEIRLFVPNQENSLDGNSLSSAEKLRSDLLKLTGIGSSGSM-DKVCRWNDIHLLV 214

Query: 211 PRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTT 270
           PRGRY++E+    L +HGKSFDY IL++++ R+FLLP+P    + LVI+L+ P+RQGNT 
Sbjct: 215 PRGRYEIEVLVNCLKLHGKSFDYTILFQSISRLFLLPRPGTSLVNLVIALETPMRQGNTK 274

Query: 271 YPHLVLQLRRDDDE-IEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLH 329
           YP +V+Q     DE IE+ LN+  +E+Q+  G  L P ++G+ W +VT++L+ L    + 
Sbjct: 275 YPFVVMQFDTQQDENIEMPLNLSEKEIQRFTG--LSPIMTGKFWDIVTRILKSLTGHSII 332

Query: 330 APKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRL--EMD 387
            P +F ++   H +R +  A DG LYPL   F F+ KP   +R++D+  +EF R+     
Sbjct: 333 VPGDFRSASMYHCIRCSYKAQDGLLYPLNRSFIFITKPVILIRFDDILNIEFSRMGGNQT 392

Query: 388 RRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPAL 433
           R F+L + +  G    FT+++++E++ L +FL+ K +R+  +  +L
Sbjct: 393 RFFELTITIRGGGDYSFTSIDKAEYNPLIKFLQEKNIRIKNLQESL 438


emb|CAE64433.1| Hypothetical protein CBG09136 [Caenorhabditis briggsae]
(696 aa)

Score: 220 bits (560), Expect: 1.20279e-55
Length: 520, Idn/Pos/Gap = 150/264/29 (28%/50%/5%)

Query:  10 VYVTDQSL-GPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKD  68
           VY+ D  L   G+L+L+   L +     G  +++VA +D+  L+W +      LR    D
Sbjct:   9 VYLEDMGLLALGTLKLTDKSLSFKSEKGG-RSVTVAGDDIDGLKWQKLGNKPGLRVGVND  67

Query:  69 GSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSE 128
           G+   F G + AD   L+++    W  ++ + +    + GW++GQ  ++G ++ F    +
Sbjct:  68 GNVHRFGGLQDADLERLQKFTDAHW--SRPIEQSNLFIKGWSYGQAEVKGRNIEFSWEDK 125

Query: 129 TAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP----TEEDAIALHA 184
             FE+P +++S V  + +NE  LEFH +D A      L+EMRF  P    TEED   +  
Sbjct: 126 PIFEIPCTNVSNVT-ANKNEAVLEFHQNDNA---QIALMEMRFHMPVDAETEEDVDKVEE 181

Query: 185 ELSARVGSASFMGES---LVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVR 241
              A +  A    E+   +V   ++    PRGRYD++++PT +++HGK++DYKI  K++ 
Sbjct: 182 FKKAVLAYAGLEAETEQPIVLLTDILCTTPRGRYDIKVYPTSIALHGKTYDYKIPIKSIN 241

Query: 242 RMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYG 301
           R+FL+P  D   +  V+SL+PPIRQG T Y +L++   +D+++ E+ L++  E+L++  G
Sbjct: 242 RLFLVPHKDGRSVFFVLSLNPPIRQGQTRYSYLIMDFPKDEEQ-ELELSLTDEQLEESNG 300

Query: 302 DKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCF 361
             L   + G +++ V+ V + + +  +  P  F+   G  A++     N G LYPLE  F
Sbjct: 301 -ALKRTMDGPIYKTVSVVFKAICNLKITEPGRFIGHSGTPAIQCTHRQNPGLLYPLEKGF 359

Query: 362 FFVNKPPTYLRYEDVDFVEFKRLE---MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            F++KP  YLRYEDV      R +   + R  D +V M +G  ++F  +E+ E + L+ +
Sbjct: 360 LFIHKPAMYLRYEDVSSCHLARSDGGTVTRTVDFEVDMKSGGPIIFNTMEKEENNKLFDY 419

Query: 419 LESKQVRMVGIPPALLRGATTGKTSMR--AAAVRAE-------MAIAAEEAARREADGED 469
           L  K +++        R A +    +    AAV+AE           + +          
Sbjct: 420 LSKKNIKIRNPTRVDARAAESSDDEIDPYKAAVKAEGRKRDESDDDESTDEDYDLDKDLK 479

Query: 470 DEEADDQDTKYDGKRDPQTFDDEDDEEDEDFAGDVEDESD 509
           D +  ++D+      +P    D   E+D    G+ E +S+
Sbjct: 480 DRKTKEKDSSEGSASEPDDEYDSGSEQDSSGTGESEPDSE 519


gb|AAA19061.1| Hmg protein 4 [Caenorhabditis elegans]
sp|P41848|SSRP_CAEEL Probable structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein)
ref|NP_498633.1| high Mobility Group protein (78.6 kD) (hmg-4) [Caenorhabditis elegans]
(697 aa)

Score: 219 bits (557), Expect: 2.19327e-55
Length: 428, Idn/Pos/Gap = 128/236/20 (29%/55%/4%)

Query:  10 VYVTDQS-LGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKD  68
           VYV D   L  G+L+L++  L +   ++G  +++V   D+  L+W +      LR    D
Sbjct:   9 VYVEDVGHLAFGTLKLTEKSLNFKG-DKGGKSVNVTGSDIDKLKWQKLGNKPGLRVGLND  67

Query:  69 GSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSE 128
           G    F GF+  D   ++ +    W  ++ + +    + GWN+GQ  ++G++V F    +
Sbjct:  68 GGAHRFGGFKDTDLEKIQSFTSSNW--SQSIDQSNLFIKGWNYGQAEVKGKTVEFSWEDK 125

Query: 129 TAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP----TEEDAIALHA 184
             FE+P +++S V+ + +NE  LEFH +D +      L+EMRF  P     EEDA  +  
Sbjct: 126 PIFEIPCTNVSNVI-ANKNEAVLEFHQNDDSKVQ---LMEMRFHMPIDLENEEDADKVEE 181

Query: 185 ELSARVGSASFMGES---LVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVR 241
              A +  A    E+   +    ++    PRGRYD++++PT +++HGK++DYKI  K++ 
Sbjct: 182 FKKAVLAYAGLEAETEQPICLLTDILCTTPRGRYDIKVYPTSIALHGKTYDYKIPIKSIN 241

Query: 242 RMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYG 301
           R+FL+P  D  H+  V+SL+PPIRQG T Y +L+ +  +D+++ ++ L +  E+L+   G
Sbjct: 242 RLFLVPHKDGRHVYFVLSLNPPIRQGQTRYSYLIFEFGKDEEQ-DLELALTDEQLESSNG 300

Query: 302 DKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCF 361
           + L   ++G +++ ++ + + + +  +  P  F+ S G  A++     N G LYP+E  F
Sbjct: 301 N-LRRDMTGPIYETISILFKSICNLKITVPGRFLGSSGTPAIQCTHRQNPGLLYPMEKGF 359

Query: 362 FFVNKPPTYLRYEDVDFVEFKRLE---MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            F++KP  Y+R+E++    F R +   + R FD ++ +  G  L F  +E+ E + L+ +
Sbjct: 360 LFIHKPAMYIRFEEISSCHFARSDSGTVTRTFDFEIDLKYGGPLTFNAMEKEENNKLFDY 419

Query: 419 LESKQVRM 426
           L  K +++
Sbjct: 420 LNKKNIKI 427


gb|EAA57630.1| hypothetical protein AN6687.2 [Aspergillus nidulans FGSC A4]
ref|XP_410824.1| hypothetical protein AN6687.2 [Aspergillus nidulans FGSC A4]
(589 aa)

Score: 219 bits (557), Expect: 2.24887e-55
Length: 576, Idn/Pos/Gap = 174/278/83 (30%/48%/14%)

Query:   4 AQVIDPVYVTDQSLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLR  63
           ++  D +Y+ D S  PG  +L++ GL W P   G  T ++   ++ + +W R A+G +L+
Sbjct:   7 SESFDNIYL-DLSNQPGKCKLAETGLGWRPSGGG-DTFTLDSSNIGAAQWSRAAKGYELK  64

Query:  64 CTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLF 123
              +    +  +GF   D   L +  K  W +   V   E  + GWN+G+       + F
Sbjct:  65 ILSRSSGVIQLDGFDQEDFERLSKAFK-IW-YGINVESREHALRGWNWGKAEFTKAELAF 122

Query: 124 VSGSETAFELPISDISQVVRSGRNELALEFHLD------DTAGKT-----------DECL 166
              +  AFE+P S+IS    +G+NE+A+E  L         AG T           DE L
Sbjct: 123 NVQNRPAFEVPYSEISNTNLAGKNEVAVELSLSVDPNGSKPAGSTKNRGRKAAAGPDE-L 181

Query: 167 VEMRFQAP-----TE----------------------EDAIALHAEL---SARVGSASFM 196
           VEMRF  P     TE                      ++A  L  EL    A +G  +  
Sbjct: 182 VEMRFYIPGTAVKTENGIKGENADEKNGGEGEENGEEQNAANLFYELLMEKAEIGDVA-- 239

Query: 197 GESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLAL 256
           G++   F ++  + PRGR+D++++ +   + GK++DYKI Y ++++ FLLPK D+ H  +
Sbjct: 240 GDTFATFLDVLHLTPRGRFDIDMYESSFRLRGKTYDYKIQYSSIKKFFLLPKNDDTHTLI 299

Query: 257 VISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVV 316
           V+ L+PP+RQG T YP LV+QL+  D+EI + LNM  E L+ RY DKL P+    + QV+
Sbjct: 300 VLGLEPPLRQGQTRYPFLVMQLKL-DEEISLELNMTEELLETRYKDKLEPRYEEPIHQVI 358

Query: 317 TKVLRVLVDKPLHAP-KNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYED 375
           TK+ R L  K +  P K+FV+  G   ++ ++ AN+G LY L+    FV KP TY++ E+
Sbjct: 359 TKIFRGLSGKKVIMPSKDFVSHHGHSGVKCSIKANEGLLYFLDKSLIFVPKPATYIQMEN 418

Query: 376 VDFVEFKR----LEMDRRFDLQVVMLNG-STLLFTNLERSEFSTLYQFLESKQVRMVGIP 430
           V  V   R    +   R FD+ V +  G     F+N+ R E   L +F ++K +R     
Sbjct: 419 VAVVTMSRVGGAISASRTFDITVSLKAGMGEHQFSNINREEQQPLEEFFKAKNIR----- 473

Query: 431 PALLRGATTGKTSMRAAAV--RAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQT 488
              ++   +  T+   AA     +M  + E+   R   G  DE+ +  D  +        
Sbjct: 474 ---IKNEMSDDTNALIAAALDNDDMMSSDEDGGGRPDRGSADEDEESVDEDFQA------ 524

Query: 489 FDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDD 524
                 + D D A + +   + + S SDA  DD  D
Sbjct: 525 ------DSDSDVAEEYDSAHESSGSGSDAEMDDASD 554


emb|CAH02145.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
ref|XP_451752.1| unnamed protein product [Kluyveromyces lactis]
(555 aa)

Score: 216 bits (551), Expect: 1.16365e-54
Length: 573, Idn/Pos/Gap = 157/274/91 (27%/47%/15%)

Query:   8 DPVYVTDQSLGPGSLRLSKAGLLW-------SPRNEGLYTISVAREDLASLRWFRGARGD  60
           D +Y     LG G  RL++ GL W       S   +    + +A ++++S++W RG RG 
Sbjct:   6 DRIYYNQSKLG-GRFRLAEGGLGWKASATGGSASTQNNEPLLLAADEVSSVQWSRGCRGY  64

Query:  61 QLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGES 120
           +L+ + K+   +  +GF+  D   L+   +  +     +   E  + GWN+G++ +    
Sbjct:  65 ELKISTKNKGLIQMDGFQQEDFNLLKNDFQRRF--NMQLEHREHSLRGWNWGKLDLARNE 122

Query: 121 VLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTA-GKTDECLVEMRFQAP----- 174
           ++F    +  FE+P + I+    + +NE+ALEF   + A     + LVEMR   P     
Sbjct: 123 MVFSLNGKPTFEIPYTHINNTNLTAKNEIALEFDTQNEAYNPAGDELVEMRLYVPGTVEE 182

Query: 175 ----------------------------------TEEDAIA--LHAELSARVGSASFMGE 198
                                              EE  +A   + EL ++       G+
Sbjct: 183 NEDQDQIMVKDEAEAEDGVKSEVKTEEGSEEPDVQEEKTLAEYFYEELRSKADIGEISGD 242

Query: 199 SLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVI 258
           +++ F++L F  PRGRYD++++   + + GK+++YK+ ++ + R+F LPK D+IH  +V+
Sbjct: 243 AIISFQDLFFTTPRGRYDIDIYKNSIRLRGKTYEYKLQHRQINRIFSLPKADDIHYLMVL 302

Query: 259 SLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTK 318
           S+DPPIRQG T+YP LVLQ ++ D+E EV LN++ +E +K Y DKL  +   +   V++ 
Sbjct: 303 SIDPPIRQGQTSYPFLVLQFQK-DEETEVQLNVEDDEFEKLYKDKLKKQYDAKTHIVLSH 361

Query: 319 VLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDF 378
           VL+ L  + +  P  + +     A+  +   N+G+LYPL+N F F+ KP  Y+ ++D+  
Sbjct: 362 VLKGLTGRRVIVPGEYKSKYDQCAVSCSYKVNEGHLYPLDNAFLFLTKPTLYIPFQDIAA 421

Query: 379 VEFKRLEMD----RRFDLQVVM-LNGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPAL 433
           V   R        R FDL+VVM  N  T  F N+ + E   L  FL SK +R+       
Sbjct: 422 VNISRAGQTSTSARTFDLEVVMRANRGTTTFANISKEEQQLLETFLRSKNLRV------- 474

Query: 434 LRGATTGKTSMRAAAVRAEMAIAAEEAARRE-----ADGEDDEEADDQDTKYDGKRDPQT 488
                  K   + A  R + A  ++           + GED+E  D+             
Sbjct: 475 -------KNEDKEAEQRLQTAFGSDSDDDDVDINMGSAGEDEESVDE------------- 514

Query: 489 FDDEDDEEDEDFAGDVEDESDGAPSDSDAARDD 521
            D    +ED+D A + + E+  + S+ + ++ +
Sbjct: 515 -DFHASDEDDDVAEEFDSEASASDSEGETSKSE 546


gb|AAC24268.1| Hmg protein 3 [Caenorhabditis elegans]
ref|NP_491688.1| high Mobility Group protein (hmg-3) [Caenorhabditis elegans]
pir||T34025 hypothetical protein C32F10.5 - Caenorhabditis elegans
(689 aa)

Score: 212 bits (540), Expect: 2.36569e-53
Length: 547, Idn/Pos/Gap = 151/278/46 (27%/50%/8%)

Query:  10 VYVTD-QSLGPGSLRLSKAGLLWSPRNEGLYTISVAREDLASLRWFRGARGDQLRCTRKD  68
           VYV D   L  G+L L++  + +   ++G  ++ +   D+  L+W +      LR    D
Sbjct:   9 VYVEDIGHLTCGTLTLTENSINFIG-DKGGKSVYITGTDVDKLKWQKLGNKPGLRVGLSD  67

Query:  69 GSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSE 128
           G    F GF   D   ++ +    W  +K + +    + GWN+GQ  ++G+++ F   +E
Sbjct:  68 GGAHRFGGFLDDDLQKIQSFTSSNW--SKSINQSNLFINGWNYGQADVKGKNIEFSWENE 125

Query: 129 TAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAPT----EEDAIALHA 184
             FE+P +++S V+ + +NE  LEFH ++   ++   L+EMRF  P     EED   +  
Sbjct: 126 PIFEIPCTNVSNVI-ANKNEAILEFHQNE---QSKVQLMEMRFHMPVDLENEEDTDKVEE 181

Query: 185 ELSARVGSASFMGES---LVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVR 241
              A +  A    E+   +    ++    PRGRYD++++PT +++HGK++DYKI  KT+ 
Sbjct: 182 FKKAVLAYAGLEAETEQPICLLTDILCTTPRGRYDIKVYPTSIALHGKTYDYKIPVKTIN 241

Query: 242 RMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYG 301
           R+FL+P  D   +  V+SL+PPIRQG T Y +L+ +  +D++E ++ L++  E+L    G
Sbjct: 242 RLFLVPHKDGRQVYFVLSLNPPIRQGQTHYSYLIFEFGKDEEE-DLELSLTDEQLDYFNG 300

Query: 302 DKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCF 361
           + L  +++G +++ ++ + + + +  +  P  F+ S G  A++     N G LYP+E  F
Sbjct: 301 N-LQREMTGPIYETISILFKSICNLKVTVPGRFLGSSGTPAIQCTHRQNLGLLYPMEKGF 359

Query: 362 FFVNKPPTYLRYEDVDFVEFKRLE---MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            F+ KP  Y+R+E++    F R +   + R FD ++ +  GS+L F+ +++ E + L+ +
Sbjct: 360 LFIQKPVMYIRFEEISSCHFARSDSGTVTRTFDFEIDLKTGSSLTFSAMDKEENNKLFDY 419

Query: 419 LESKQVRMVGIPPALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDT 478
           L  K+++                       +R    I  + A    +D EDD +      
Sbjct: 420 LNKKEIK-----------------------IRNSHRIDNKSAGYGSSD-EDDIDPYKSTV 455

Query: 479 KYDGKRDPQTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDDDDDDDDDDDEALE 538
           K +G+      DDE  +ED D   D++ + +    DS      + DD+ D   + D +  
Sbjct: 456 KAEGREQDDDSDDESTDEDYDLDKDMKKQKND--KDSSEGSGSEPDDEYDSGSEKDASGT 513

Query: 539 AELEPEE 545
            E +P+E
Sbjct: 514 GESDPDE 520


emb|CAG81462.1| unnamed protein product [Yarrowia lipolytica CLIB99]
ref|XP_503258.1| hypothetical protein [Yarrowia lipolytica]
(544 aa)

Score: 205 bits (521), Expect: 3.27742e-51
Length: 584, Idn/Pos/Gap = 162/274/102 (27%/46%/17%)

Query:   8 DPVYVTDQSLGPGSLRLSKAGLLW----------SPRNEGLYTISVAREDLASLRWFRGA  57
           D +Y+ +QS   G LR+ + GL W          S   +      + +E+L +  W RG+
Sbjct:   7 DGIYL-NQSKAHGRLRMVETGLGWKAVQKTSMGGSKETKKDEPFLLGKEELLAAFWSRGS  65

Query:  58 RGDQLRCTRKDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIE 117
           RG +++   K+    NF+GF   +   L+  +K  +  +  V + E  V GWN+G+   E
Sbjct:  66 RGFEMKIQTKNRGAANFDGFEQDNLEELKNVMKRNYGIS--VEQREHSVKGWNWGKTDFE 123

Query: 118 GESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRFQAP--- 174
              ++F   ++ AFE+P ++++     G+NE+ALEF      G+  + LVEMRF  P   
Sbjct: 124 RSELVFSVANKPAFEIPYAEVANSNLVGKNEVALEFQ-QPADGRAGDELVEMRFYVPGVT 182

Query: 175 -------------------------------------TEEDAIALHAELSARVGSASFMG 197
                                                 +  A   +  L  +    +  G
Sbjct: 183 SVEGDENPKKKQKTEKEGEEGKEGDDDADADDESEEEVQSTAQIFYDTLKEKADIGAVAG 242

Query: 198 ESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALV 257
            ++V   E+  ++PRGRYD++++  ++ + GK++DY + YK V+R+ +LPKPD++H  LV
Sbjct: 243 TAVVSLSEIYLVIPRGRYDIDMYANFMRLRGKTYDYMVQYKHVQRLIVLPKPDDLHNILV 302

Query: 258 ISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDK-LPPKISGELWQVV 316
           + LDPP+RQG T YP LV+Q  R+  EI+V LN+   E  ++Y DK L         QVV
Sbjct: 303 VQLDPPLRQGQTRYPFLVMQFLREA-EIKVELNVDDAEFAEKYADKGLKQSYDESAHQVV 361

Query: 317 TKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDV 376
             + R L  + L  P +F T  G   +  +L A++G+LYPLE  F F++KP  ++ + ++
Sbjct: 362 GSIFRGLTGRKLTVPGSFKTVHGHAGVSCSLKASEGHLYPLERNFLFLSKP-VFIPFAEI 420

Query: 377 DFVEFKRL----EMDRRFDLQVVMLNGS-TLLFTNLERSEFSTLYQFLESKQVRMVGIPP 431
             +   R+       R FD+ + + N      F+N+ + E   L  F++SK +R+     
Sbjct: 421 QDITLSRVGSSVTTSRTFDMTLKLRNAQGEYQFSNISKEEQEGLEAFIKSKGIRL----- 475

Query: 432 ALLRGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGKRDPQTFDD 491
                               +  +A E+A       E D+++DD      G     + D+
Sbjct: 476 --------------------KNDLAEEKALLAATLAEVDDDSDD------GGEFRGSADE 509

Query: 492 EDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDDDDDDDDDDDE 535
           +D+  DEDF          A SDS+ A + D + +    ++DDE
Sbjct: 510 DDESPDEDFH---------AESDSEVAEEFDSNAESSSGEEDDE 544


sp|Q04931|SSRP_RAT Structure-specific recognition protein 1 (SSRP1) (Recombination signal sequence recognition protein) (T160)
gb|AAA40927.1| HMG box (bp.  1168..1428)
(561 aa)

Score: 202 bits (515), Expect: 1.87445e-50
Length: 339, Idn/Pos/Gap = 121/194/21 (35%/57%/6%)

Query: 152 EFHLDDTAGKTDECLVEMRFQAP-TEEDAI----ALHAELSARVGSASFMGESLVFFEEL 206
           EFH +D A   +  L+E+RF  P T+ED +    A    + ++       G+++  F EL
Sbjct:   1 EFHQNDDA---EVSLMEVRFYVPPTQEDGVDPVEAFAQNVLSKADVIQATGDAICIFREL  57

Query: 207 PFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQ 266
             + PRGRYD+ ++PT+L +HGK+FDYKI Y TV R+FLLP  D+  +  VISLDPPI+Q
Sbjct:  58 QCLTPRGRYDIRIYPTFLHLHGKTFDYKIPYTTVLRLFLLPHKDQRQMFFVISLDPPIKQ 117

Query: 267 GNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDK 326
           G T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG L+++V++V++ LV++
Sbjct: 118 GQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSGSLYEMVSRVMKALVNR 176

Query: 327 PLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR-LE 385
            +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP ++R++++ FV F R   
Sbjct: 177 KITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVHIRFDEISFVNFARATT 236

Query: 386 MDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMV------GIPPALLRGATT 439
             R FD ++    G+   F+++ER E+  L+ F+ +K++ +       GI P     A +
Sbjct: 237 TTRSFDSEIETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKNRGLKEGINPGYDDYADS 296

Query: 440 GKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDT 478
            +    A   R +     EE   RE +  D  +   ++T
Sbjct: 297 DEDQHDAYLERMK-----EEGKIREENANDSSDDSGEET 330


ref|NP_702282.1| structure specific recognition protein, putative [Plasmodium falciparum 3D7]
gb|AAN37006.1| structure specific recognition protein, putative [Plasmodium falciparum 3D7]
(506 aa)

Score: 199 bits (507), Expect: 1.69635e-49
Length: 328, Idn/Pos/Gap = 108/185/14 (32%/56%/4%)

Query: 108 GWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDE-CL 166
           GWN+G+ ++E  ++ F   ++ AF LP ++I+Q+    + ++A+EF  D+   K +E  L
Sbjct: 117 GWNWGEFKLENSNLCFDIDNKYAFNLPTNNINQLNVQIKTDIAMEFKNDENNNKGNEDFL 176

Query: 167 VEMRFQAPTEEDAIA----LHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPT 222
            E+RF  P E D       L  +L  +V       ES+     +P +VPRGRYD+E++ +
Sbjct: 177 AEIRFYYPHENDENQNFQNLKNDLLEKVNIGDTKSESIASLSNIPLLVPRGRYDIEMYSS 236

Query: 223 YLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDD 282
              +HGKS+D+ I Y  + +M L+PK +     L+ SL   ++QG T YP +++QL  DD
Sbjct: 237 TFKLHGKSYDFNIQYTNINKMILVPKSNSNQYVLIFSLSNKMKQGQTEYPFILIQLNNDD 296

Query: 283 D-EIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAH 341
           D E++++ +   +E+  +Y  KL   ISG+   VVTK+   LV+K +  P ++ TS+  H
Sbjct: 297 DMELDISAS---DEVMTKY--KLEKTISGKAHDVVTKLFTALVNKNVIVPGDYRTSKNQH 351

Query: 342 ALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR---LEMDRRFDLQVVMLN 398
            +  +  A  G LYPL   F F+ KP   + ++D+  + F+R   +   R F L +    
Sbjct: 352 GITCSYRAASGQLYPLNKYFLFIVKPVILISFDDIVTLSFQRTGNINQHRFFSLIIKHKR 411

Query: 399 GSTLLFTNLERSEFSTLYQFLESKQVRM 426
           G +  +TN+++SE++ L  FL+SK + +
Sbjct: 412 GMSYEYTNIDKSEYNPLLTFLKSKNINI 439


gb|EAL17792.1| hypothetical protein CNBL3050 [Cryptococcus neoformans var. neoformans B-3501A]
gb|AAW45161.1| chromatin binding protein, putative [Cryptococcus neoformans var. neoformans JEC21]
ref|XP_572468.1| chromatin binding protein, putative [Cryptococcus neoformans var. neoformans JEC21]
(588 aa)

Score: 192 bits (489), Expect: 1.65537e-47
Length: 448, Idn/Pos/Gap = 134/231/46 (29%/51%/10%)

Query:  20 GSLRLSKAGLLWSP-RNEGLYTISVAREDLASLRWFRGARGDQLRCTRKDGST--VNFEG  76
           G LR +  G  W   ++E     +    D+    WFR AR  QLR   ++     ++F+G
Sbjct:  18 GKLRFNPVGFGWKAYQSEDNNPTTYNGSDIRHATWFRVARHFQLRLGMRNSEKPRISFDG  77

Query:  77 FRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELPIS 136
           F+  D   ++  ++E +  T  +   +  + GWN+G+ +++G  ++F    +TAF++P+S
Sbjct:  78 FKRDDLDKIKRTLQEYFNIT--LETRDTSLKGWNWGEAQVKGSDLVFQVQGKTAFDVPLS 135

Query: 137 DISQVVRSGRNELALEFHL--------DDTAGKTDECLVEMRFQAP-------------- 174
            ++    +G+ E+ALEF+          D   +  + +VEMRF  P              
Sbjct: 136 QVANSNIAGKYEVALEFNPPSNYKFDPKDLNKRPPDEMVEMRFYIPGKSMKKAGSDAGSG 195

Query: 175 ---TEED--------AIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTY 223
              TE D        A A H+ +  +    + +G+S+V FE+   + PRGR+ +E++   
Sbjct: 196 GEETELDEEGNEVSAADAFHSLIKEKADIGAVVGDSIVVFEDCLILTPRGRFSIEVYADS 255

Query: 224 LSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDD 283
           + + GKS D+++ + ++ R+FLLPK D++H+ LV+ LDPPIRQG T YP LV Q  + D+
Sbjct: 256 IRLVGKSTDHRVPFTSIHRIFLLPKLDDLHVQLVLGLDPPIRQGATRYPFLVAQWPK-DE 314

Query: 284 EIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHAL 343
            +   LN+  EEL + Y D L        +QVV++VL+ L  K +  P +   +QG + +
Sbjct: 315 VVNAELNLTDEELAQ-YPD-LEKTYEATTFQVVSRVLKALTGKKVTPPGSLRNAQGLNGI 372

Query: 344 RTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR----LEMDRRFDLQVV-MLN 398
           R  + A  G LY LE    F++K P  + +   D + F R    +   R FD++VV    
Sbjct: 373 RANVKAVQGELYFLEKGLIFISKQPILIDFSKTDSISFSRVGGGVASARTFDMRVVSKTG 432

Query: 399 GSTLLFTNLERSEFSTLYQFLESKQVRM 426
           G+  +F+ + + E   +  FL+SK +R+
Sbjct: 433 GADHVFSAINKQEVGPISSFLQSKNIRL 460


gb|EAA18148.1| putative structure specific recognition protein [Plasmodium yoelii yoelii]
(493 aa)

Score: 189 bits (481), Expect: 1.4131e-46
Length: 420, Idn/Pos/Gap = 117/214/21 (27%/50%/5%)

Query:  20 GSLRLSKAGLLW-SPRNEGLYTISVAREDLASLRWFRGARGD---QLRCTR-KDGSTVNF  74
           GS R+S   L W + +   +Y       D++   W + +  +    L+    KD   V F
Sbjct:  30 GSFRMSNEFLGWKNKKTNSVYQYKC--NDISEGEWIKLSYNNNRLHLKFNESKDNLIVFF  87

Query:  75 EGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSGSETAFELP 134
           +GF   + A + ++ ++ +    G  +      GWN+G+ ++E  ++ F    + AF + 
Sbjct:  88 DGFPDRNIAEITQHFQKYFNIKLGTRK--LATKGWNWGEFKLENSNLFFDIDKKYAFNIN 145

Query: 135 ISDISQVVRSGRNELALEFHLDDTAGKTDE-CLVEMRFQAPTEEDAIA----LHAELSAR 189
            ++I+Q+    + ++A+E   D+    T+E  L E+RF  P E D       L   L  +
Sbjct: 146 TNNINQLNVQIKTDIAIELKNDENKQNTNEDVLSEIRFYYPHENDENQNFQDLKNNLLEK 205

Query: 190 VGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVRRMFLLPKP 249
           V       E +     +P +VPRGRY++E++     +HGKS+D+ + Y  + +M L+PK 
Sbjct: 206 VNIGDSKSECIASLSNIPLLVPRGRYEIEMYSKTFKLHGKSYDFTVQYSNINKMLLVPKT 265

Query: 250 DEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKIS 309
           +     L+ SL+  I+QG T YP +++QL  DDD +++ +N   E++Q     KL   ++
Sbjct: 266 NSNQYILIFSLNNKIKQGQTEYPFILIQLSNDDD-MDLDINASEEDIQNY---KLEKTLT 321

Query: 310 GELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPT 369
           G+ + VVT++   L  K    P ++ T++  H +  +  A  G LYPL   F FV KP  
Sbjct: 322 GKAYDVVTRLFTALAKKNAIIPGDYRTAKNEHGITCSYRAASGQLYPLNKYFLFVVKPVI 381

Query: 370 YLRYEDVDFVEFKR---LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
            + ++D+  + F+R   +   R F L +    G +  +TN+++SE++ L +FL+SK + +
Sbjct: 382 LISFDDIVTLSFQRTGNINQHRFFSLIIKHKRGISYEYTNIDKSEYAPLLEFLKSKNLNI 441


emb|CAI04722.1| structure specific recognition protein, putative [Plasmodium berghei]
(493 aa)

Score: 187 bits (476), Expect: 5.28091e-46
Length: 368, Idn/Pos/Gap = 106/195/14 (28%/52%/3%)

Query:  67 KDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSG 126
           KD   V F+GF   + + + ++ ++ +    G  +      GWN+G+ ++E  +++F   
Sbjct:  80 KDNLIVFFDGFPDRNLSEITQHFQKYFNIKLGTRK--LATKGWNWGEFKLENSNLIFDID 137

Query: 127 SETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDE-CLVEMRFQAPTEEDAIA---- 181
            + AF +  ++I+Q+    + ++A+E   D+    T+E  L E+RF  P E D       
Sbjct: 138 KKYAFNINTNNINQLNVQIKTDIAIELKNDENKQNTNEDVLSEIRFYYPHENDENQNFQD 197

Query: 182 LHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVR 241
           L   L  +V       E +     +P +VPRGRY++E++     +HGKS+D+ + Y  + 
Sbjct: 198 LKNNLLEKVNIGDSKSECIASLSNIPLLVPRGRYEIEMYSKTFKLHGKSYDFTVQYSNIN 257

Query: 242 RMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYG 301
           +M L+PK +     L+ SL+  I+QG T YP +++QL  DDD +++ +N   E++Q    
Sbjct: 258 KMLLVPKTNSNQYILIFSLNNKIKQGQTEYPFILIQLSNDDD-MDLDINASEEDIQNY-- 314

Query: 302 DKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCF 361
            KL   ++G+ + VVT++   L  K    P ++ T++  H +  +  A  G LYPL   F
Sbjct: 315 -KLEKTLTGKAYDVVTRLFTALAKKNAIIPGDYRTAKNEHGITCSYRAASGQLYPLNKYF 373

Query: 362 FFVNKPPTYLRYEDVDFVEFKR---LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQF 418
            FV KP   + ++D+  + F+R   +   R F L +    G +  +TN+++SE++ L +F
Sbjct: 374 LFVVKPVILISFDDIVTLSFQRTGNINQHRFFSLIIKHKRGISYEYTNIDKSEYAPLLEF 433

Query: 419 LESKQVRM 426
           L+SK + +
Sbjct: 434 LKSKNLNI 441


emb|CAI02497.1| hypothetical protein PB300791.00.0 [Plasmodium berghei]
(415 aa)

Score: 169 bits (428), Expect: 2.20283e-40
Length: 322, Idn/Pos/Gap = 93/169/11 (28%/52%/3%)

Query:  67 KDGSTVNFEGFRPADRATLEEYVKEAWAWTKGVPRGEQGVVGWNFGQVRIEGESVLFVSG 126
           KD   V F+GF   + + + ++ ++ +    G  +      GWN+G+ ++E  +++F   
Sbjct:  80 KDNLIVFFDGFPDRNLSEITQHFQKYFNIKLGTRK--LATKGWNWGEFKLENSNLIFDID 137

Query: 127 SETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDE-CLVEMRFQAPTEEDAIA---- 181
            + AF +  ++I+Q+    + ++A+E   D+    T+E  L E+RF  P E D       
Sbjct: 138 KKYAFNINTNNINQLNVQIKTDIAIELKNDENKQNTNEDVLSEIRFYYPHENDENQNFQD 197

Query: 182 LHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYKTVR 241
           L   L  +V       E +     +P +VPRGRY++E++     +HGKS+D+ + Y  + 
Sbjct: 198 LKNNLLEKVNIGDSKSECIASLSNIPLLVPRGRYEIEMYSKTFKLHGKSYDFTVQYSNIN 257

Query: 242 RMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYG 301
           +M L+PK +     L+ SL+  I+QG T YP +++QL  DDD +++ +N   E++Q    
Sbjct: 258 KMLLVPKTNSNQYILIFSLNNKIKQGQTEYPFILIQLSNDDD-MDLDINASEEDIQNY-- 314

Query: 302 DKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCF 361
            KL   ++G+ + VVT++   L  K    P ++ T++  H +  +  A  G LYPL   F
Sbjct: 315 -KLEKTLTGKAYDVVTRLFTALAKKNAIIPGDYRTAKNEHGITCSYRAASGQLYPLNKYF 373

Query: 362 FFVNKPPTYLRYEDVDFVEFKR 383
            FV KP   + ++D+  + F+R
Sbjct: 374 LFVVKPVILISFDDIVTLSFQR 395


gb|AAB41544.1| structure-specific recognition protein 1 [Bos taurus]
(460 aa)

Score: 130 bits (326), Expect: 1.30678e-28
Length: 250, Idn/Pos/Gap = 83/136/27 (33%/54%/10%)

Query: 256 LVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQV 315
            VISLDPPI+QG T Y  L+L   +D+D I + LNM  EE++KR+  +L   +SG L+++
Sbjct:   6 FVISLDPPIKQGQTRYHFLILLFSKDED-ISLTLNMNEEEVEKRFEGRLTKNMSGSLYEM  64

Query: 316 VTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYED 375
           V++V++ LV++ +  P NF    GA  +  +  A+ G LYPLE  F +V+KPP ++R+++
Sbjct:  65 VSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYPLERGFIYVHKPPVHIRFDE 124

Query: 376 VDFVEFKR-LEMDRRFDLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRMVGIPPALL 434
           + FV F R     R FD ++    G+   F+++ER E+  L+ F+ +K            
Sbjct: 125 ISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKLFDFVNAK------------ 172

Query: 435 RGATTGKTSMRAAAVRAEMAIAAEEAARREADGEDDEEADDQDTKYDGK-RDPQTFDDED 493
                 K +++   ++  M  + +E A  +   ED  +A  +  K +GK R+    D  D
Sbjct: 173 ------KLNIKNRGLKEGMNPSYDEYADSD---EDQHDAYLERMKEEGKIREENANDSSD 223

Query: 494 D---EEDEDF 500
           D   E DE F
Sbjct: 224 DSGEETDESF 233


emb|CAH76375.1| structure specific recognition protein, putative [Plasmodium chabaudi]
(273 aa)

Score: 126 bits (317), Expect: 1.79485e-27
Length: 241, Idn/Pos/Gap = 74/132/9 (30%/54%/3%)

Query: 108 GWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDE-CL 166
           GWN+G+ ++E  +++F    + AF +  ++I+Q+    + ++A+E   D+    T+E  L
Sbjct:  32 GWNWGEFKLENSNLIFDIDKKYAFNINTNNINQLNVQIKTDIAIELKNDENKQNTNEDVL  91

Query: 167 VEMRFQAPTEEDAIA----LHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPT 222
            E+RF  P E D       L   L  +V       E +     +P +VPRGRY++EL+  
Sbjct:  92 SEIRFYYPHENDENQNFQDLKNNLLDKVNIGDSKSECIASLSNIPLLVPRGRYEIELYSK 151

Query: 223 YLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDD 282
              +HGKS+D+ + Y  + +M L+PK +     L+ SL+  I+QG T YP +++Q   +D
Sbjct: 152 TFKLHGKSYDFTVQYSNINKMLLVPKTNSNQYILIFSLNNKIKQGQTEYPFILIQ-LNND 210

Query: 283 DEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHA 342
           D++++ +N  PEE  K Y  KL   ++G+ + VVT++   L  K    P ++ T++  H 
Sbjct: 211 DDMDLDIN-APEEDIKNY--KLEKTLTGKAYDVVTRLFTALAKKNAIIPGDYRTAKNEHG 267

Query: 343 L 343
           +
Sbjct: 268 I 268


gb|EAL46446.1| structure specific recognition protein, putative [Entamoeba histolytica HM-1:IMSS]
(376 aa)

Score: 120 bits (302), Expect: 9.21271e-26
Length: 337, Idn/Pos/Gap = 90/177/26 (26%/52%/7%)

Query: 103 EQGVVGWNFGQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKT 162
           E  V G+N+G++ I+  SV         F++   D ++   S + E+++EF  DD+  K 
Sbjct:   5 EYCVSGFNWGRIDIDKNSVQLTHDGYLIFKMNPKDFTKSSISNKTEVSIEF--DDS--KD  60

Query: 163 DECLVEMRFQAPTEE------DAIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYD 216
            + L E++F AP  E      +A  L+ ++ A V   +  G+ +  FE + F+ P+G YD
Sbjct:  61 GDALSEIKFFAPQTEQQNDKDNATELYDKI-AEVTPTNAAGKEVCLFENIGFLSPKGHYD 119

Query: 217 LELFPTYLSMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVL 276
           ++++   + +  K++D+KI YK + R + L K DE     +++L  P+++G + Y  LV+
Sbjct: 120 VKIYEDSVRVQNKTYDFKINYKDIARYYKLRK-DEDTSFFILNLSNPLKKGKSVYECLVM 178

Query: 277 QLRRDDDEIEVALNMKPEELQKRYGDK--LPPKISGELWQVVTKVLRVLVDKPLHAPKNF 334
           +L   ++E+   L+     L K + DK  L   ++     +  ++ R L + P+ +  + 
Sbjct: 179 EL-SSNEEVTAELH-----LTKEFEDKTGLEESMTDNELDLFVELFRSLCNVPIISSGHK 232

Query: 335 VTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRLEM---DRRFD 391
             +  +H L+  L  N+G+L+P+ + F FV K    + ++D+  V+  R+     ++ FD
Sbjct: 233 FKTNDSHFLKCNLSTNEGFLFPMSDSFIFVFKRIRIIMFKDIKSVDILRMNASNDNKTFD 292

Query: 392 LQVVMLNG--STLLFTNLERSEFSTLYQFLESKQVRM 426
             V+ L G   +L FT + R+E+  L  +L+  Q+++
Sbjct: 293 F-VINLKGRRGSLQFTGMNRNEYENLVGYLKESQLKL 328


emb|CAD25634.1| STRUCTURE-SPECIFIC RECOGNITION PROTEIN [Encephalitozoon cuniculi GB-M1]
ref|NP_586030.1| STRUCTURE-SPECIFIC RECOGNITION PROTEIN [Encephalitozoon cuniculi GB-M1]
(425 aa)

Score: 119 bits (299), Expect: 2.1044e-25
Length: 321, Idn/Pos/Gap = 93/157/43 (28%/48%/13%)

Query: 112 GQVRIEGESVLFVSGSETAFELPISDISQVVRSGRNELALEFHLDDTAGKTDECLVEMRF 171
           G++ I G+  L    ++T FE+P+ DI  VV   RNEL++              L +M  
Sbjct: 105 GELGINGQKALEFRNTKTIFEIPVDDIESVVDI-RNELSVS-------------LRDMEI 150

Query: 172 QAPTEEDAIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSF 231
           +  ++   I        + G +S + + ++  E L    PRG+++   F  YL + G S+
Sbjct: 151 RFVSDRKTIE-----EIKGGCSSSVDDEILKMEGLSLAYPRGKFNFIFFRDYLRLVGSSY 205

Query: 232 DYKILYKTVRRMFLLPKP--DEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVAL 289
           D+KI YK+++ +++L K    +    +VI  DPPIRQG T Y H+V+      D++E  L
Sbjct: 206 DHKIYYKSIKMLYVLEKGYIRDGERYVVIGADPPIRQGQTRYDHVVVAF----DDVEREL 261

Query: 290 NMKPEELQKRYGDKLPPKISGELWQVVTKVLRVL-VDKPLHAPKNFVTSQGAHALRTALG 348
           ++  E L+  Y        SG L ++  +V+  L V K + +  +F +  G   LR A+ 
Sbjct: 262 SVSDERLKGEY--------SGLLSEIFAEVMEALCVIKAVRS--SFESRDGMRCLRCAMK 311

Query: 349 ANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKRLEMD----RRFDLQVVMLNGSTLLF 404
           A +G LYPL++C  F+ K    L   ++  VEF R+ +     + FD+ +      T  F
Sbjct: 312 AYEGQLYPLDDCMLFLPK-AVRLDLGEISLVEFSRINLSSMQAKTFDMTLFCEGPYT--F 368

Query: 405 TNLERSEFSTLYQFLESKQVR 425
             L + EF  L Q+   K ++
Sbjct: 369 NGLSKDEFGALEQYFHGKGIK 389


emb|CAC08510.1| SSRP1-like protein [Zygosaccharomyces rouxii]
(542 aa)

Score: 117 bits (292), Expect: 1.24441e-24
Length: 253, Idn/Pos/Gap = 75/132/9 (29%/52%/3%)

Query: 179 AIALHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYLSMHGKSFDYKILYK 238
           A A H EL  +       G+S+V F+++ F  PRGRYD++++   + + GK+++YK+ ++
Sbjct: 275 AEAFHEELKEKADIGEVSGDSIVSFQDVFFATPRGRYDIDIYKNSIRLRGKTYEYKLQHR 334

Query: 239 TVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDEIEVALNMKPEELQK 298
            + +   LPK   +  + +I      +   TT P +  Q     DE +  ++++ E+ + 
Sbjct: 335 QIPKDRSLPKAMILPSSGLIHRTTITQGQTTTLPCITFQ----KDERQRCIDLEDEDFEA 390

Query: 299 RYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALRTALGANDGYLYPLE 358
            Y D+L  +   +   VV+ VL+ L  + +  P  + +     A+  +  AN+GYLYPL+
Sbjct: 391 NYKDRLKREYDAKTHIVVSHVLKGLTGRRVMVPGEYKSKYDQCAVSCSYKANEGYLYPLD 450

Query: 359 NCFFFVNKPPTYLRYEDVDFVEFKRL----EMDRRFDLQVVML-NGSTLLFTNLERSEFS 413
           N FFF+ KP  Y+ + DV  V   R        R FDL+V +  N  +  F N+ + E  
Sbjct: 451 NAFFFLTKPTLYIPFMDVSSVNISRAGQASTSSRTFDLEVTLRGNRGSTTFANISKEEQQ 510

Query: 414 TLYQFLESKQVRM 426
            L QFL+S+ +R+
Sbjct: 511 LLEQFLKSRNLRV 523


emb|CAH84175.1| hypothetical protein PC300891.00.0 [Plasmodium chabaudi]
(202 aa)

Score: 116 bits (291), Expect: 1.52031e-24
Length: 203, Idn/Pos/Gap = 68/108/8 (33%/53%/3%)

Query: 169 MRFQAPTEEDAIA----LHAELSARVGSASFMGESLVFFEELPFIVPRGRYDLELFPTYL 224
           +RF  P E D       L   L  +V       E +     +P +VPRGRY++EL+    
Sbjct:   1 IRFYYPHENDENQNFQDLKNNLLDKVNIGDSKSECIASLSNIPLLVPRGRYEIELYSKTF  60

Query: 225 SMHGKSFDYKILYKTVRRMFLLPKPDEIHLALVISLDPPIRQGNTTYPHLVLQLRRDDDE 284
            +HGKS+D+ + Y  + +M L+PK +     L+ SL+  I+QG T YP +++Q   +DD+
Sbjct:  61 KLHGKSYDFTVQYSNINKMLLVPKTNSNQYILIFSLNNKIKQGQTEYPFILIQ-LNNDDD 119

Query: 285 IEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGAHALR 344
           +++ +N  PEE  K Y  KL   ++G+ + VVT++   L  K    P ++ T++  H + 
Sbjct: 120 MDLDIN-APEEDIKNY--KLEKTLTGKAYDVVTRLFTALAKKNAIIPGDYRTAKNEHGIT 176

Query: 345 TALGANDGYLYPLENCFFFVNKP 367
            +  A  G LYPL   F FV KP
Sbjct: 177 CSYRAASGQLYPLNKYFLFVVKP 199


gb|EAA57453.1| hypothetical protein MG10128.4 [Magnaporthe grisea 70-15]
ref|XP_365908.1| hypothetical protein MG10128.4 [Magnaporthe grisea 70-15]
(261 aa)

Score: 95 bits (235), Expect: 4.6199e-18
Length: 156, Idn/Pos/Gap = 57/89/6 (36%/57%/3%)

Query: 276 LQLRRDDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAP-KNF 334
           +Q +RD+ E+ + LN+  EEL  +Y DKL       L QVVT + R L +K + AP K+F
Sbjct:   1 MQFKRDE-EVTLDLNLAEEELNGKYKDKLQGHYEQPLHQVVTYIFRGLANKKITAPAKSF  59

Query: 335 VTSQGAHALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR----LEMDRRF 390
            T +G   ++ A+ A++G+LY LE  F FV KP TY+ YE    V F R    +   + F
Sbjct:  60 QTHRGQLGIKCAIKASEGFLYCLEKAFMFVPKPATYISYEQTQSVTFSRVGGAVSATQTF 119

Query: 391 DLQVVMLNGSTLLFTNLERSEFSTLYQFLESKQVRM 426
           D+ + M  G +  F+N+ R +   L  F ++K++R+
Sbjct: 120 DITIHMKGGGSSQFSNINREDLKALETFFQAKELRV 155


gb|EAA18768.1| structure-specific recognition protein 1 [Plasmodium yoelii yoelii]
(198 aa)

Score: 78 bits (192), Expect: 5.69971e-13
Length: 149, Idn/Pos/Gap = 43/82/6 (28%/55%/4%)

Query: 281 DDDEIEVALNMKPEELQKRYGDKLPPKISGELWQVVTKVLRVLVDKPLHAPKNFVTSQGA 340
           +DD++++ +N   E++Q     KL   ++G+ + VVT++   L  K    P ++ T++  
Sbjct:   1 NDDDMDLDINASEEDIQNY---KLEKTLTGKAYDVVTRLFTALAKKNAIIPGDYRTAKNE  57

Query: 341 HALRTALGANDGYLYPLENCFFFVNKPPTYLRYEDVDFVEFKR---LEMDRRFDLQVVML 397
           H +  +  A  G LYPL   F FV KP   + ++D+  + F+R   +   R F L +   
Sbjct:  58 HGITCSYRAASGQLYPLNKYFLFVVKPVILISFDDIVTLSFQRTGNINQHRFFSLIIKHK 117

Query: 398 NGSTLLFTNLERSEFSTLYQFLESKQVRM 426
            G +  +TN+++SE++ L +FL+SK + +
Sbjct: 118 RGISYEYTNIDKSEYAPLLEFLKSKNLNI 146


ref|YP_209549.1| RNA polymerase beta' subunit-2 [Huperzia lucidula]
gb|AAT80745.1| RNA polymerase beta' subunit-2 [Huperzia lucidula]
(1788 aa)

Score: 52 bits (124), Expect: 3.95749e-05
Length: 130, Idn/Pos/Gap = 37/57/4 (28%/43%/3%)

Query:  450 RAEMAIAAEEAARREADGED--DEEADDQDTKYDGKRDPQTFDDE-DDEEDEDFAGDVED  506
            R   +   EE    E  GE+  DEE  D++T  +   D +T D+E  DEE  D     E+
Sbjct: 1604 RELSSFPTEETIGGETIGEETSDEETSDEETSDEETSDEETSDEETSDEETSDEETSDEE 1663

Query:  507 ESDGAPSDSDAARDDDDDDDDDDDDDDDEALEAELEPEEPLEQVTEEDQSQGDPESTSAA  566
             SD   SD + + ++  D +  D++  DE    E    E + + T E   +   E+   A
Sbjct: 1664 TSDKETSDEETSDEETSDKETSDEETSDEETIGE-TTREAIREATRETAGEATGETVGGA 1722

Query:  567 AAAETGHATD  576
                 G AT+
Sbjct: 1723 TEETVGEATE 1732


ref|NP_700792.1| hypothetical protein PF10_0319 [Plasmodium falciparum 3D7]
gb|AAN35516.1| hypothetical protein [Plasmodium falciparum 3D7]
(553 aa)

Score: 50 bits (120), Expect: 0.000104175
Length: 97, Idn/Pos/Gap = 31/55/4 (31%/56%/4%)

Query: 461 ARREADGEDDEEADDQDTKYDGKRDPQTFDDE--DDEEDEDFAGDVEDESDGAPSDSDAA 518
           A +  D ++D+E +D +   D K D +  +DE  DDE+++D   D E   D   +D    
Sbjct: 210 AEQINDEKNDDEKNDDEKNDDEKNDAEQINDEKNDDEKNDDEKNDDEKNDDEKINDEKND 269

Query: 519 RDDDDDDDDDDDDDDDEALEAELEPEEPL--EQVTEE 553
            + +DD+ +DD+ +DDE ++AE   +E +  EQ+ +E
Sbjct: 270 DEKNDDEKNDDEKNDDEKIDAEQINDEQINAEQINDE 306


gb|AAC17540.2| Hypothetical protein F55F10.1 [Caenorhabditis elegans]
ref|NP_500551.2| protein conserved (4F151) [Caenorhabditis elegans]
(4368 aa)

Score: 45 bits (106), Expect: 0.00487823
Length: 96, Idn/Pos/Gap = 27/47/5 (28%/48%/5%)

Query:  466 DGEDDEEADDQDTKYDGKRDPQTFDDEDDEEDEDFAGDVEDESDGAPSDSDAARDDDDDD  525
            D +D  E  ++  + +G  D Q  D E+ E   D    ++ E D A    D  +++  D 
Sbjct: 3629 DAKDVTEEMEETGQIEGLEDEQPVDSEEHEAKNDNEKPIDMEDDFAEDLQDIDKNEKGDQ 3688

Query:  526 DDDDDDDDDEALEAELEPEEPLEQVTEEDQSQGDPE  561
            +D +D+ D+E      + E+ +  V EED+ Q DP+
Sbjct: 3689 NDGEDESDEEP-----DVEDQMGDVEEEDEKQLDPK 3719