R.P.Kuhnlein
et
al.
1
gaattccccgataaaaaggggggttttatacaaacaatagcgaagaacacaaaatcttataaaccgaaattaaaaaaggcaacaacaacgctcataactgcgctgccaagcgagatggcg
121
ggc
gtggaagcgaga
ggagtgagtgagtgagcgaagag
gcgcaacaaatagcagactctt
acgctagag
241
ccctcccataaattatgataaattattaatcgagagagcgagcgagatggggcattacaacaacaatagcgcaaaggtgagccgcaaaagacagaaggaagtctcgctgcttctcttt
g
361
tatgttctcatt
caAgdatggcccgagageaaagagagagaggQgtaaagaagagcagctcataat
cgccgaaca,agtltcataaacgtc
ctcaggtaaaaaragcgartgcgac
g
481
agcatcacgaggcaaaagtagaagpaaaaaag
ccagg
gagc
ggccggcaaagtaacagtaacagcaatttaataaaattatgatgagagcgagcgagaggcscatatgaa
601
aaataatatgaaatgcaacaaa
catgcatgt
agagt
aacacaagataaattggctgagtgatagtgattftcagaatataataacacacacacacacacatacacaaac
721
c
attctcac
aaagcagccgaacaaagccgaacfcccacaacttct
cctattattcta
cattt
ttttgtcactcttatttGGACGCGCGTTGCTGMCGTTCG
841
CGCGACACTCGA
TC
TCT
AGTGCC
TCG
TMCACG
aGCGTATCGTAACGCGCACT'CA;TTCTGTTTTMGGCGAGTTTMCCATCGATTTACACAAGATAMCCCCCAAGTG
961
CCGCGAMAATTACCACTTACGATGACAATCAACGGTCAG,TTG
1
GCCAACAGCTAGATACGATAAATTAGCTAAACACGCGCAAAGGCCAGCGGTACTACGCGMCCCACTG
1081
CCGACGGCAAAACACAAAAGTGCTACAAGTMCTAAAMAAGTTAITTTCCTCACAAAATAAACCAMAAAAACATAGTGCAATGATCAAGCTGAGTTMTGAAATCMACCA
1201
AAAACACACTCACAAAAAGTGCTAACTAAAG6TTCTCTTGATGAAAAATCACCTGTCCAACGTTCTGTGTGCGATGCGTAGTGACTTCAAGGATMTCACCAGGAGACCATCAATAAA
M
K
N
H
L
S
N
V
L
C
A
M
R
S
D
F
K
D
N
H
Q
E
T
I
N
K
M
1321
TGATACAAMGTACAGTGAATGCTGTCA
AACAGCTG
AAGGATCGCGCTCGCAGCGCCGACAAAGgtgagctgaaaatctatttgccaagoaaa..........
I
Q
F
G
T
V
K
Y
G
I
V
K
Q
L
K
D
R
A
R
S
A
D
K
A
..........
..
aactcgtatcctaaataatttgagaatgttgttttctgtgttttcagACAITCGGGTAsGCGOATCAG
EGAAEGGG
GCSPLTGCTCGCACTA-CG
6241
AACCACTACGGCCAGTCCCAGCCGAAGTCCCGAGCCGGAGGAGGAGCAGCCCGAGGAGCAGAGCACTTCAGAGCAGAGCATACCAGAGCAAAGCACACCAGACCACCAACTCGAGAACGA
T T
T
A
S P
S
R
S
P
E P
E
E
E
Q
P E
E
0
S
T
S
E
Q
S
I
P
E
0S
T
P
D
H
0
L
E
N
D
6361
TATCAAATCCGAGGCGMATCAGAGATAGAGCCCGTTGAGGATMCAACAACAGAGTGGCGATGACAAAGCCCAGTTCCGAGGAGCGGGAACCGAATGCCAGTGGCTCCATGCCGAGTTC
I
K
S
E
A
K
S
E
I
E P
V
E
D
N
N
N
R V
A
M
T
K
P
S
S
E
E
R
E P
N
A
5
G
S
M
P
S
S
6481
CCCAGTGGCGGAGGCCAGTGCCGAGGAGGCGGCCACCGAGAGGACGCCGGAAAAGGAGAAGGAGAAGGACGTGGAGGTCGATGTGGAGATGCCCGATGAGGCACCCAGCAGTGCGGTGCC
P
V
A
E
A
S
A
E
E
A
A
T
E
R
T
P
E
K
E
K
E
K
D
V
E
V
D
V
E
M
P
D
E
A
P
S
S
A
V
P
6601
CTCGACTGAGGTMCTCTGCCGGGCGGAGCAGGAGCACCGGTCACCCTGGAGGCCATCCAAAATATGCAAAT6GCCATT6CCCAGMGCGGCCAAGACCATTGCGAATGGTTCCAATGG
S
T
E
V
T
L
P
G G
A
G
A
P
V
T
L
E
A
I
Q
N
M
0
M
A
I
A
Q
F
A
A
K
T
I
A
N
G
S
N
G
6721
AGCCGACAATGAGGCTGCCATGAAGCAGTTGGCCTTCCTTCAGCAAACCCTCTTCAATCTGCAGCAACAGCAGCTCTTCCAGATCCAGCTGATCCAACAGCTCCAGTCGCAGCTGGCGCT
A
D
N
E
A
A
M
K
0
L
A
F
L
Q
Q
T
L
F
N
L
0
0
0 0
L
F
0I0LI
0
0
L
Q
S
Q
L
A
L
6841
CAATCAGGCGAAACAGGAAGAGGATACCGA,GGAGGATGCGGATCAGGAGCAAGATCAGGAACAGG
A
CAGATACCTAGAGAG
GAGGACGCATCGCCGATATGGAACTGCGCCAGAA
N
Q
A
K
Q
E
E
D
T
E
E
D
A
D
Q
E
Q
D
Q
E
Q
E
T
D
T
Y
E E
E
E
R
I
A
D
M
E
L
R
Q
K
6961
GGCGGAGGCCAGAATGGCGGAGGCTAAAGCGCGTCAGCATCTTAAAMTGCTGGTGTTCCGCTGCGAGAGTCCTCCGGTTCTCCAGCTGAATCTCTGAAGCGAAGACGTGAGCATGATCA
A
E
A
R
M
A
E
A
K
A
R
H
L
I
N
A
G
V
P
L
R
E
S
S
G
S P
A
E
S
L
K
R
R
R
E
H
D
H
7081
CGAATCCCAGCCAAATCGTAGAACGAGMGGATMACACACACAAAGCAGATACGGCGCAGGATGCGCTGGCCAAGTTAAAGGAAATGGAGAACACACCACTGCCCTTCGGTTCCGATCT
E
S
Q
P
N
R
R
T
S
L
D
N
T
H
K
A
D
T
A
Q
D
A
L
A
K
L
K
E
M
E N
T
P
L
P
F
G
S
D
L
7201
GGCTTCCAGCATTATCACCAACCATGATGATCTGCCCGAGCCGAATTCCCTGGACCTGCTCCAGAAACGT6CCCAGGA6GTGCTGGACTCCGCGTCGCAG6GGATCCTGGCCAACAGCAT
A
S
S
I
I
T
N
H
D D
L
P
E P N
S
L
D
L
L
0
K
R
A
0
E
V
L
D
S
A
S
0
G
I
L
A
N
S
M
7321
GGCTGACGACM
GCCTTCGGTGAGAAATCGGGTGAGGGAAAGGGTCGCAATGAGCCGTTCTTCAAGCACCGCTGCAGGTACTGCGGGAA6GTCM
GGCTCGGACTCGGCGCTCCAGAT
A
D D
F
A
F
G
E
K
S
G
E
G
K
G
R
N
E
P
F F
K
H
R C
R
Y
C
G
K
V
F
G
S
D
S
A
L
0
I
7441
CCACATAAGATCGCATACTGGCGAGCGGCCCMAAGTGCAATGTGTGCGGCAGTCGGTTCACCACCAAGGGCAACCTTAAGGTTCACMCAGCGGCATGCCCAMAG6TTCCCCCATGT
H
I
R
S
H
T
G
E
R
P
F
K
C
N
V C
G
S
R
F
T
T
K
G
N
L
K
V
H
F
Q
R
H
A
QC
K
F
P
H
V
7561
GCCCATGAATGCCACGCCCATTCCGGAGCACATGGACAAGMCATCCGCCGCTGCTGGATCMATGTCGCCCACGGATA6CTCTCCCMATCATTCCCC6CCGCCGCCCCCATTGGGCTC
P
M
N
A
T
P
I
P
E
H
M
D
K
F
H
P
P
L
L
D
0
M
S
P
T
D
S
S
P N H
S P
A
P
P
P
L
G
S
7681
TGCTCCGGCATCCMCCGCCCGCCTTCCCTGGCCTTCAGAATCTCTATCGCCCGCCTAT6GGATCCTTAMAAATCTTGGAGCCGCTGCGCCGCACCAATACTTCCCTCAGGAGTTGCC
A
P
A
S
F P
P
A
F
P
G
L
Q
N
L
Y
R
P P
M
E
I
L
K
S
L
G
A
A
A
P
H
Q
Y
F
P
Q
E
L
P
7801
CACGGATCTGAGAAACCCTCGCCTCAATTGGATGAGGATGAGCCGCAGGTTAA6AGAACUCCGTCGAAGAGMGGACAGCGGGAGGAGCATGAACAGGAGATGGCAGAGTGCTCAGA
T D
L
R
K
P S P
Q
L
D
E
D
E
P
Q
V
K
N E
P
V
E
E
K
D
Q
R
E
E
H
E
0E
M
A
E
C
S
E
7921
GCCCGAGCCGGAACCGCTGCCCCTAGAGTGCGCATCAAGGAGGAGCGTGTGGAGGAGCAGGAACAGGTTAAACAGGAGGACCATCGCATAGAGCCACGTAGGACACCCTCTCCTTCATC
P
E
P
E
P
L
P
L
E
V
R
I
K
E
E
R
V
E
E
Q
E
Q
V
K
0
E
D
H
R
I
E
P
R
R
T
P
S
P
S
S
8041
AGAGCACCGCTCCCCGCACCACCACCGTCACAGCCACTGGGCTATCCACCAGTGGTGCAGCCCATCCAACCGGCCGCACTTATGCATCCGCAATCTTCGCCGGGCTCGCAATCCCACCT
E
H
R
S P
H
H
H
R
H
S
H
M
G
Y
P
P
V V
0
P
I
Q
P
A
A
L
M
H P
Q
S
S
P
G
S
Q
S
H
L
8161
GGATCACCTGCCCACGCCGGGGCAATTGCCACCCCGCGAAGAMCTTCGCTGAGCGMTCCCCCTTMACMACCACCGCCAA6ATGCTATCACCCGAACACCACTCTCCAGTAAGATC
D
H
L
P
T
P
G
L
P
P
R
E
D
F F
A
E
R
F P
L
N
F
T
T
A
K
M
L
S
P
E
H
H
S
P
V
R
S
8281
GCCCGCTGGCGGAGCACTTCCACCGGGTGTTCCACCACCACCGCACCACCACCCGCACCACATGGCCAGATCGCCGTTCMAACCCCATCAAGCACGAGATGGCCGCACTACTGCCCCG
P
A
G
G
A
L
P
P
G
V
P
P
P
P
H
H
H
P
H
H
M
A
R
S
P
F
F
N
P
I
K
H
E
M
A A
L
L
P
R
8401
CCCGCAlAC4AG
CGATAACTCGTGGGAGAACTTCATCGAGGMCGACACCTGTGAG
ACCATGMGTAAGGACTAGAGA6ACAAGMGATACGATCCCMATCAGTGTGTGGT
P
H S
N
D
N
S
W
E
N
F
I
E
V
S
N
T
C
E
T
M K
L
K
E
L
M
K
N
K
K
I
S
D
P
N
Q
C V
V
8521
CTGTGATCGGGTGTTATCCTGCAAGAGTGCCCTCCAGATGCACTACCGAACCCACACCGGTGAGCGCCCATTCAAGTGCAGGATCTGCGGCAGGGCATTCACCACCAAGGGCAACCTAAA
C
D
R
V
L
S
C K
S
A
L
Q
M
H
Y
R
T
H
T
G
E
R
P
F
K
C R
I
C
G
R
A
F
T
T K
G
N
L
K
8641
GACCCACATGGCTGTGCACAAGATTCGTCCGCCGATGAGAMCTTCCACCAGTGCCCCGMGCCACAAGAAGTACTCGAATGCCCTGGTCCTGCAGCAGCACATCCGATTGCATACGGG
T
H
M
A
V
H
K
I
R
P
P
M
R
N
F
H
Q
C
P
V C
H
K
K
Y
S
N
A
L
V
L
0 0
H
I
R
L
H
T
G
8761
TGAGCCCACTGATCTGACGCCGGAGCAAATCCAGGCGGCCGAGATCAGGGACCCGCCACCTTCGATGATGCCCGGTCACMATGAATCCCTTCGCAGCGGCTGCCTTCCAMCGGTGC
E
P
T
D
L
T
P
E
0
I
Q
A
A
E
I
R
D
P
P
P
S
M
M
P
G
H
F
M
N
P
F
A
A A
A
F
H
F
G
A
8881
TCTTCCCGGCGGTCCAGGTGGTCCTCCGGGTCCGAATCATGGTGCCCACAATGGCGCCTTGGGATCGGAGTCGTCGCAGGGCGATATGGATGATMTATGGACTGCGGCGAGGACTACGA
L
P
G
G
P
G
G
P P
G
P
N
H
G
A
H
N
G
A
L
G
S
E
S
S
Q
G
D
M
D
D
N
M
D
C
G
E
D
Y
D
9001
CGATGATGTGTCGTCGGAGCACCTCTCGAATAGTMTCTCGAGCAGGAGGGCGACAGATCGCGCTCTGGTGATGACTTCAAGTCCCTGTTGTTCGAGCAAAGCTGAGAATTGATGCCAC
D
D
V
S
S
E
H
L
S
N
S
N
L
E
CQC
E
G
D
R
S
R
S
G
D
D
F
K
S
L
L
F
E
Q
K
L
R
I
D
A
T
9121
CGGTGTGGTTAACACGAACCCCGTAAGACCGCGTTCCTCCGC
GCAGTCATGGCCATTCGGTGGGCTCCACCTCTGCGCCCACCTCGCCCAGCGTA
ATGCATCATCCCAGGTTATCAA
G
V
V
N
T
N
P
V
R
P
R
S
S
A
S
S
H
G
H
S
V
G
S
T
S
A
P
T
S
P
S
V
H
A
SS0V
I
K
9241
GCGCAGCTCTTCGCCCGCTCGTTCAGAGGCTTCTCAGGGAGCCCTGGACTTGACGCCCU6TGCTGCCCCCACATCGA6TTCCAGTTCGCGTTCTCCCCTGCCA
6~~CCAGTCAG
R
S
S
S
P
A
R
S
E
A
S
0
G
A
L
D
L
T
P
R
A
A
P
T
S
S
S
S
S
R
S
P
L P
K
E
K
P
V
S
9361
TCCGCCCAGCTTGCCTAGGAGTCCCAGTGGTTCTAGCCACGCCTCCGCCAACATACTGACCTCACCCCTGCCGCCCACCGTGGGCATTGACTGCTTGCCTAAGAC6TGCAACACCA
M
P
P
S
L
P
R
S P
S
G
S
S H
A
S
A
N
I
L
T
S
P
L
P
P
T
V
G
I
D
C
L
P
K
G
L
H
H L
9481
6G=CAGCAG6CAGCATCACCT11TbATbGWCACAAAGCGCAGTGGCAGCGGCAGCAGCTGCGCAGCACCATCATCACCAGCAAATGGCTGCACTCGATCAGCACCAGAGACTGCGTCG
00H0
H
L
M
0
0
0
A
A
V
A
A
A
A
A
A
Q
H
H H
H
0
0
M
A
A
L
D
0
H
0
E
0
L
R R
9601
C&AGCO.oTH6AACGCAGCAAAAGGCCGCAGCAGCTGCTGCAGACGGCCGCAGCAGCCGCGGCCCAGCGACAAACACCTCCGCAAGCCCGTGATCAGCGGCAGGAAGGGGACC6GG
E
A
A
E
A
0
0
K
A
A
A
A
A
A
A
A
A
A
A
A
A
A
0
R
0
T
P
P
Q
A
R
D
0
R
Q
E
G
G
P
G
9721
AGCGGGACCGCCGCCCAATCCGTTGATGGGCGCCCGCCCGCCCTTCGGCATGTTCCCCAACCTGCCGCTCTTCCCCCCCGCCACCACCCAGAACATGTGCAATGCGATGAACCAGATCGC
A
G
P
P
P
N
P
L
H
G
A
R
P P
F
G
M
F
P
N
L
P
L
F
P
P
A
T
T
N
M
C
N
A
M
N
Q
I
A
9841
CCAGTCCGTAATGCCGGCGGCTCCATCACCACGCCCTCA
GCGGT
GTTCGCGGCAGTACCACCTGCG6CATCT6CTACAAGACATTCCCCTGCCACTCGGCGCTGGAGATCCACTA
QSV
M
P
A
A
P
F
N
P
L
A
L
S
G
V R
G
S
T
T
C
G
I
C
Y
K
T
F P
C
H
S
A
L
E
I
H
Y
9961
CGGSAGCCACACC^AAAGG6CGGCCATTCAAGTGCAGCATCTGTGATCGCGGCTTTACACCAAGgtgagctatagttacttctattctgaatttattggggggttttctaacggtgccta
R
S H
T
K
E
R
P
F
K
C
S
I
C
D
R
G
F
T
T
K
10081,
cacttaaaacaaaatttaaaccaaaaaactoatoaaaaatttcctttttttttcatttattttccaaGG6AACCTGAAGCAACACATGCTAACTCATAAAATCCGCGATATGGAGCAAGa
96O
a;aacvv;aacyaLaywa; s; ;;;;S; aS;
;auv1
l l l u uu
27
51
68
106
148
1"
28
we
348
38
428
468
508
7Us
548
828
708
748
788
m
908
948
1028
1086
1108
1148
llU
118
1228
1286
1308
133
6
N
L
K
Q
H
M
L
T
H
K
I
R
D
N
E
Q
E
134
10201
MCCTTCAGAAATCGTGCCGTAAAgtatgtaagtcttccaatatcacccatcccgtcctgtccttttcattcctattcataaatcccsttagtttgcttttaccaactcttcttatttctt
T
F
R
N R
A
V
K
1S
10321
atggcactttttctttacgatgatttatacatcttttaacaagttatattatcagtagtttatagattttggagacactatasatacttccctatagataattgttcctatgcccctaat
10441
gaccatcttattaaatacattaatcatttcacttttactaaacaatccacatctttttgctctttcccat
gcagAT6A6TGAGTGGAACMAAGTCCGAATAAGTAAACACTCTACAC
10561
TACCACGATTACGATGCAGATGGCTTMTCCGCTATCAAGATCACTTGACCCCCGGAAMMGMGGCGATCGCAGTCCAGACCCAGTCMGATGCAATCGATTCTCGCAACCAAATGA
10681
TCTCMGTMTAGAGCGAMTGAGGGGGAGAAAGAGGACGTGACAGGTCCGGTGAAGiATCGGTCGTGTAACAAATCMATAMATMAATTGTTGCCCACMTATTACTTG
10801
ATTGTTGTTCCAAGCGAMGGAAAAGTMM6CCMACTGCAMAATGGGCTGATCGATGATCGATTATGTTCCCGGGCTCGGGCCAACmAAMeMlATGATCACCGGGGMAT
10921
TAACGGGGG6AATGCCGAGCACACGTACACCCATACTAAGGTGGGATCATGAMACGTATCCAMAGATGCATCAAAAGCGAM6TGTTCCAGCTAMCMATCGAAAGATCTGCTGATCTG
11041
GAACCAAAGCTGCTTGGTATGGAGAGACTGATGGCGMCATGTTCCACAMACTGAGAACGGAATCTAAACTAAATCCAAATCCGTATCCGGACTCGTAMGTATCCATATTCGMAT
11161
ACGAGTCCGAATCCGAGGCAGCTGATGAA6CGCAG6TGAAGGCGTAGTAAMATCAAMATTCGAAMAAAGCM
ATCTTMAGGTATATGCTMAAA
IAAAAMTTGTACCT
11281
MGCGAGACATGTGTACATACGTATATATsAMTATATATGATATATTATAACCAAATCCCAAGAACGCATACGCATACACGTTMTMATCTAMGGAATGTGCAATMTATGA
11401
CATGCTAAAM
AGATGCATCGCCGAGCGCA
GGCTTMGMCCTACTACTCGTMATTAGAMATMTCCACMCTGTACATACTTCGTATATAAGCACCCACACTCGCACAC
11521
ACTTATATATCATAACACACACMGACMCGCTTTGC
MCGAGMGGGTTACGAGCTAA
AAACGAAATAAMAATCTAAAAATTTAGGATTGTAT
11641
ATTAAATGTAWAAACGAAATACAMTTACGATTGCAGTGGCCGGGGGAMTCGAAGCCCCCCMATCGMGCCCCAGCAAMATGCMCCATATCACAGATGAAGAACCACMAAAGATAT
11761
CTAACATTCATAGTTMAMAGTTGTTAGCCGAGGAATCCCCGACCCACAACCCAAMACCCC_AA_ACCCGCTCACACTCTMMCACTTCATTGGMA
11881
GGAAGGATTAACCCCTTAGCGATAAGTAAGTCTATGAGCGATCGTACATGTATTGATTACCACMATTAMTTATACACGGATGCCAATGTATCCCACTCTGTTCGTAAGCATTMGCATA
12001
GTCTCATMMATTACGCMGCCCAAGC
CGAA6AAAMCAGAAACTAAAAA
TGCAMTAAMT
TCAGGAAGT
AACATAAATATTGTAMMGTTATGA
12121
AAAATTaatoaacc
tctcclagttttctttcggatctacggat
->
poly(g)
tall
Flg.
3.
Nucleotide
and
deduced
amino
acid
sequences
of
the
sal
gene.
The
sequence
of
12
164
bp
of
the
genornic
sal
region
(available
under
accession
No.
X75541),
excluding
DNA
sequences
of
the
first
intron,
is
presented
and
numbered
on
the
left
side.
The
DNA
sequence
of
the
coding
strand
of
a
composite
sal
cDNA
is
indicated
by
upper
case
type.
Introns
and
genomic
sequences
not
represented
in
the
cDNA
are
in
lower
case
type.
The
predicted
amino
acid
sequence
is
shown
below
the
nucleotide
sequence
and
numbered
on
the
right
side.
The
protein
sequence
shown
is
a
conceptual
translation
of
the
longest
open
reading
frame
within
the
cDNA
sequences.
It
begins
at
the
fifth
ATG
of
the
cDNA
and
ends
at
a
TGA
triplet
indicated
by
asterisks.
Two
putative
polyadenylation
signals
at
the
3'end
of
the
transcription
unit
are
double
underlined
and
the
poly(A)
tail
of
a
sal
cDNA
clone
is
indicated
by
an
arrowhead.
The
P-insertion
site
of
the sal
mutant
salA405
is
at
nucleotide
position
480.
172