[0001] This invention relates to methods for the preparation and use of variant antibodies and finds application particularly in the fields of immunology and cancer diagnosis and therapy.
[0002] Naturally occurring antibodies (immunoglobulins) comprise two heavy chains linked together by disulfide bonds and two light chains, one light chain being linked to each of the heavy chains by disulfide bonds. Each heavy chain has at one end a variable domain (V
[0003] The constant domains are not involved directly in binding the antibody to an antigen, but are involved in various effector functions, such as participation of the antibody in antibody dependent cellular cytotoxicity. The variable domains of each pair of light and heavy chains are involved directly in binding the antibody to the antigen. The domains of natural light and heavy chains have the same general structure, and each domain comprises four framework (FR) regions, whose sequences are somewhat conserved, connected by three hyper-variable or complementarity determining regions (CDRs) (see Kabat, E. A. et al.,
[0004] Widespread use has been made of monoclonal antibodies, particularly those derived from rodents including mice, however they are frequently antigenic in human clinical use. For example, a major limitation in the clinical use of rodent monoclonal antibodies is an anti-globulin response during therapy (Miller, R. A. et al.,
[0005] The art has attempted to overcome this problem by constructing “chimeric” antibodies in which an animal antigen-binding variable domain is coupled to a human constant domain (Cabilly et al., U.S. Pat. No. 4,816,567; Morrison, S. L. et al.,
[0006] The isotype of the human constant domain may be selected to tailor the chimeric antibody for participation in antibody-dependent cellular cytotoxicity (ADCC) and complement-dependent cytotoxicity (see e.g. Brüggemann, M. et al.,
[0007] In the typical embodiment, such chimeric antibodies contain about one third rodent (or other non-human species) sequence and thus are capable of eliciting a significant anti-globulin response in humans. For example, in the case of the murine anti-CD3 antibody, OKT3, much of the resulting anti-globulin response is directed against the variable region rather than the constant region (Jaffers, G. J. et al.,
[0008] In a further effort to resolve the antigen binding functions of antibodies and to minimize the use of heterologous sequences in human antibodies, Winter and colleagues (Jones, P. T. et al.,
[0009] The therapeutic promise of this approach is supported by the clinical efficacy of a humanized antibody specific for the CAMPATH-1 antigen with two non-Hodgkin lymphoma patients, one of whom had previously developed an anti-globulin response to the parental rat antibody (Riechmann, L. et al.,
[0010] In some cases, substituting CDRs from rodent antibodies for the human CDRs in human frameworks is sufficient to transfer high antigen binding affinity (Jones, P. T. et al.,
[0011] For a given antibody a small number of FR residues are anticipated to be important for antigen binding. Firstly for example, certain antibodies have been shown to contain a few FR residues which directly contact antigen in crystal structures of antibody-antigen complexes (e.g., reviewed in Davies, D. R. et al.,
[0012] It is also known that, in a few instances, an antibody variable domain (either V
[0013] The three-dimensional structure of immunoglobulin chains has been studied, and crystal structures for intact immunoglobulins, for a variety of immunoglobulin fragments, and for antibody-antigen complexes have been published (see e.g., Saul et al.,
[0014] Humanizing an antibody with retention of high affinity for antigen and other desired biological activities is at present difficult to achieve using currently available procedures. Methods are needed for rationalizing the selection of sites for substitution in preparing such antibodies and thereby increasing the efficiency of antibody humanization.
[0015] The proto-oncogene HER2 (human epidermal growth factor receptor 2) encodes a protein tyrosine kinase (p185
[0016] The murine monoclonal antibody known as muMAb4D5 (Fendly, B. M. et al.,
[0017] It is therefore an object of this invention to provide methods for the preparation of antibodies which are less antigenic in humans than non-human antibodies but have desired antigen binding and other characteristics and activities.
[0018] It is a further object of this invention to provide methods for the efficient humanization of antibodies, i.e. selecting non-human amino acid residues for importation into a human antibody background sequence in such a fashion as to retain or improve the affinity of the non-human donor antibody for a given antigen.
[0019] It is another object of this invention to provide humanized antibodies capable of binding p185
[0020] Other objects, features, and characteristics of the present invention will become apparent upon consideration of the following description and the appended claims.
[0021] The objects of this invention are accomplished by a method for making a humanized antibody comprising amino acid sequence of an import, non-human antibody and a human antibody, comprising the steps of:
[0022] a. obtaining the amino acid sequences of at least a portion of an import antibody variable domain and of a consensus variable domain;
[0023] b. identifying Complementarity Determining Region (CDR) amino acid sequences in the import and the human variable domain sequences;
[0024] c. substituting an import CDR amino acid sequence for the corresponding human CDR amino acid sequence;
[0025] d. aligning the amino acid sequences of a Framework Region (FR) of the import antibody and the corresponding FR of the consensus antibody;
[0026] e. identifying import antibody FR residues in the aligned FR sequences that are non-homologous to the corresponding consensus antibody residues;
[0027] f. determining if the non-homologous import amino acid residue is reasonably expected to have at least one of the following effects:
[0028] 1. non-covalently binds antigen directly,
[0029] 2. interacts with a CDR; or
[0030] 3. participates in the V
[0031] g. for any non-homologous import antibody amino acid residue which is reasonably expected to have at least one of these effects, substituting that residue for the corresponding amino acid residue in the consensus antibody FR sequence.
[0032] Optionally, the method of this invention comprises the additional steps of determining if any non-homologous residues identified in step (e) are exposed on the surface of the domain or buried within it, and if the residue is exposed but has none of the effects identified in step (f), retaining the consensus residue.
[0033] Additionally, in certain embodiments the method of this invention comprises the feature wherein the corresponding consensus antibody residues identified in step (e) above are selected from the group consisting of 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71L, 73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91H, 92H, 93H, and 103H (utilizing the numbering system set forth in Kabat, E. A. et al.,
[0034] In certain embodiments, the method of this invention comprises the additional steps of searching either or both of the import, non-human and the consensus variable domain sequences for glycosylation sites, determining if the glycosylation is reasonably expected to be important for the desired antigen binding and biological activity of the antibody (i.e., determining if the glycosylation site binds to antigen or changes a side chain of an amino acid residue that binds to antigen, or if the glycosylation enhances or weakens antigen binding, or is important for maintaining antibody affinity). If the import sequence bears the glycosylation site, it is preferred to substitute that site for the corresponding residues in the consensus human if the glycosylation site is reasonably expected to be important. If only the consensus sequence, and not the import, bears the glycosylation site, it is preferred to eliminate that glycosylation site or substitute therefor the corresponding amino acid residues from the import sequence.
[0035] Another embodiment of this invention comprises aligning import antibody and the consensus antibody FR sequences, identifying import antibody FR residues which are non-homologous with the aligned consensus FR sequence, and for each such non-homologous import antibody FR residue, determining if the corresponding consensus antibody residue represents a residue which is highly conserved across all species at that site, and if it is so conserved, preparing a humanized antibody which comprises the consensus antibody amino acid residue at that site.
[0036] Certain alternate embodiments of the methods of this invention comprise obtaining the amino acid sequence of at least a portion of an import, non-human antibody variable domain having a CDR and a FR, obtaining the amino acid sequence of at least a portion of a consensus antibody variable domain having a CDR and a FR, substituting the non-human CDR for the human CDR in the consensus antibody variable domain, and then substituting an amino acid residue for the consensus amino acid residue at at least one of the following sites:
[0037] a. (in the FR of the variable domain of the light chain) 4L, 35L, 36L, 38L, 43L, 44L, 58L, 46L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71L, 73L, 85L, 87L, 98L, or
[0038] b. (in the FR of the variable domain of the heavy chain) 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91H, 92H, 93H, and 103H.
[0039] In preferred embodiments, the non-CDR residue substituted at the consensus FR site is the residue found at the corresponding location of the non-human antibody.
[0040] Optionally, this just-recited embodiment comprises the additional steps of following the method steps appearing at the beginning of this summary and determining whether a particular amino acid residue can reasonably be expected to have undesirable effects.
[0041] This invention also relates to a humanized antibody comprising the CDR sequence of an import, non-human antibody and the FR sequence of a human antibody, wherein an amino acid residue within the human FR sequence located at any one of the sites 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71L, 73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91H, 92H, 93H, and 103H has been substituted by another residue. In preferred embodiments, the residue substituted at the human FR site is the residue found at the corresponding location of the non-human antibody from which the nonhuman CDR was obtained. In other embodiments, no human FR residue other than those set forth in this group has been substituted.
[0042] This invention also encompasses specific humanized antibody variable domains, and isolated polypeptides having homology with the following sequences.
1. SEQ. ID NO. 1, which is the light chain variable domain of a humanized version of muMAb4D5: DIQMTQSPSSLSASVGDRVTITCRASQDVNTAVAWYQQKPGKAPKLLIYSASFLESGVP SRFSGSRSGTDFTLTISSLQPEDFATYYCQQHYTTPPTFGQGTKVEIKRT 2. SEQ. ID NO. 2, which is the heavy chain variable domain of a humanized version of muMAb4D5): EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPTNGYTR YADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTLV TVSS
[0043] In another aspect, this invention provides a consensus antibody variable domain amino acid sequence for use in the preparation of humanized antibodies, methods for obtaining, using, and storing a computer representation of such a consensus sequence, and computers comprising the sequence data of such a sequence. In one embodiment, the following consensus antibody variable domain amino acid sequences are provided:
SEQ. ID NO. 3 (light chain): DIQMTQSPSSLSASVGDRVTITCRASQDVSSYLAWYQQKPGKAPKLLIYAASSLESGVP SRFSGSGSGTDFTLTISSLQPEDFATYYCQQYNSLPYTFGQGTKVEIKRT, and SEQ. ID NO. 4 (heavy chain): EVQLVESGGGLVQPGGSLRLSCAASGFTFSDYAMSWVRQAPGKGLEWVAVISENGGYT RYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTL
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051] In general, the following words or phrases have the indicated definitions when used in the description, examples, and claims:
[0052] The murine monoclonal antibody known as muMAb4D5 (Fendly, B. M. et al.,
[0053] A humanized antibody for the purposes herein is an immunoglobulin amino acid sequence variant or fragment thereof which is capable of binding to a predetermined antigen and which comprises a FR region having substantially the amino acid sequence of a human immunoglobulin and a CDR having substantially the amino acid sequence of a non-human immunoglobulin.
[0054] Generally, a humanized antibody has one or more amino acid residues introduced into it from a source which is non-human. These non-human amino acid residues are referred to herein as “import” residues, which are typically taken from an “import” antibody domain, particularly a variable domain. An import residue, sequence, or antibody has a desired affinity and/or specificity, or other desirable antibody biological activity as discussed herein.
[0055] In general the humanized antibody will comprise substantially all of at least one, and typically two, variable domains (Fab, Fab′, F(ab′)
[0056] The humanized antibody will be selected from any class of immunoglobulins, including IgM, IgG, IgD, IgA and IgE, and any isotype, including IgG1, IgG2, IgG3 and IgG4. Usually the constant domain is a complement fixing constant domain where it is desired that the humanized antibody exhibit cytotoxic activity, and the class is typically IgG
[0057] The FR and CDR regions of the humanized antibody need not correspond precisely to the parental sequences, e.g., the import CDR or the consensus FR may be mutagenized by substitution, insertion or deletion of at least one residue so that the CDR or FR residue at that site does not correspond to either the consensus or the import antibody. Such mutations, however, will not be extensive. Usually, at least 75% of the humanized antibody residues will correspond to those of the parental FR and CDR sequences, more often 90%, and most preferably greater than 95%.
[0058] In general, humanized antibodies prepared by the method of this invention are produced by a process of analysis of the parental sequences and various conceptual humanized products using three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin models are commonly available and are familiar to those skilled in the art. Computer programs are available which illustrate and display probable three dimensional conformational structures of selected candidate immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of residues that influence the ability of the candidate immunoglobulin to bind its antigen.
[0059] Residues that influence antigen binding are defined to be residues that are substantially responsible for the antigen affinity or antigen specificity of a candidate immunoglobulin, in a positive or a negative sense. The invention is directed to the selection and combination of FR residues from the consensus and import sequence so that the desired immunoglobulin characteristic is achieved. Such desired characteristics include increases in affinity and greater specificity for the target antigen, although it is conceivable that in some circumstances the opposite effects might be desired. In general, the CDR residues are directly and most substantially involved in influencing antigen binding (although not all CDR residues are so involved and therefore need not be substituted into the consensus sequence). However, FR residues also have a significant effect and can exert their influence in at least three ways: They may noncovalently directly bind to antigen, they may interact with CDR residues and they may affect the interface between the heavy and light chains.
[0060] A residue that noncovalently directly binds to antigen is one that, by three dimensional analysis, is reasonably expected to noncovalently directly bind to antigen. Typically, it is necessary to impute the position of antigen from the spatial location of neighboring CDRs and the dimensions and structure of the target antigen. In general, only those humanized antibody residues that are capable of forming salt bridges, hydrogen bonds, or hydrophobic interactions are likely to be involved in non-covalent antigen binding, however residues which have atoms which are separated from antigen spatially by 3.2 Angstroms or less may also non-covalently interact with antigen. Such residues typically are the relatively larger amino acids having the side chains with the greatest bulk, such as tyrosine, arginine, and lysine. Antigen-binding FR residues also typically will have side chains that are oriented into an envelope surrounding the solvent oriented face of a CDR which extends about 7 Angstroms into the solvent from the CDR domain and about 7 Angstroms on either side of the CDR domain, again as visualized by three dimensional modeling.
[0061] A residue that interacts with a CDR generally is a residue that either affects the conformation of the CDR polypeptide backbone or forms a noncovalent bond with a CDR residue side chain. Conformation-affecting residues ordinarily are those that change the spatial position of any CDR backbone atom (N, Cα, C, O, Cβ) by more than about 0.2 Angstroms. Backbone atoms of CDR sequences are displaced for example by residues that interrupt or modify organized structures such as beta sheets, helices or loops. Residues that can exert a profound affect on the conformation of neighboring sequences include proline and glycine, both of which are capable of introducing bends into the backbone. Other residues that can displace backbone atoms are those that are capable of participating in salt bridges and hydrogen bonds.
[0062] A residue that interacts with a CDR side chain is one that is reasonably expected to form a noncovalent bond with a CDR side chain, generally either a salt bridge or hydrogen bond. Such residues are identified by three dimensional positioning of their side chains. A salt or ion bridge could be expected to form between two side chains positioned within about 2.5-3.2 Angstroms of one another that bear opposite charges, for example a lysinyl and a glutamyl pairing. A hydrogen bond could be expected to form between the side chains of residue pairs such as seryl or threonyl with aspartyl or glutamyl (or other hydrogen accepting residues). Such pairings are well known in the protein chemistry art and will be apparent to the artisan upon three dimensional modeling of the candidate immunoglobulin.
[0063] Immunoglobulin residues that affect the interface between heavy and light chain variable regions (“the V
[0064] Since it is not entirely possible to predict in advance what the exact impact of a given substitution will be it may be necessary to make the substitution and assay the candidate antibody for the desired characteristic. These steps, however, are per se routine and well within the ordinary skill of the art.
[0065] CDR and FR residues are determined according to a standard sequence definition (Kabat et al.,
[0066] Throughout this description, reference is made to the numbering scheme from Kabat, E. A., et al,
[0067] For purposes of this invention, to assign residue numbers to a candidate antibody amino acid sequence which is not included in the Kabat compendium, one follows the following steps. Generally, the candidate sequence is aligned with any immunoglobulin sequence or any consensus sequence in Kabat. Alignment may be done by hand, or by computer using commonly accepted computer programs; an example of such a program is the Align 2 program discussed in this description. Alignment may be facilitated by using some amino acid residues which are common to most Fab sequences. For example, the light and heavy chains each typically have two cysteines which have the same residue numbers; in V
[0068] Thus, in humanization of an import variable sequence, where one cuts out an entire human or consensus CDR and replaces it with an import CDR sequence, (a) the exact number of residues may be swapped, leaving the numbering the same, (b) fewer import amino acid residues may be introduced than are cut, in which case there will be a gap in the residue numbers, or (c) a larger number of amino acid residues may be introduced then were cut, in which case the numbering will involve the use of suffixes such as 100abcde.
[0069] The terms “consensus sequence” and “consensus antibody” as used herein refers to an amino acid sequence which comprises the most frequently occurring amino acid residues at each location in all immunoglobulins of any particular subclass or subunit structure. The consensus sequence may be based on immunoglobulins of a particular species or of many species. A “consensus” sequence, structure, or antibody is understood to encompass a consensus human sequence as described in certain embodiments of this invention, and to refer to an amino acid sequence which comprises the most frequently occurring amino acid residues at each location in all human immunoglobulins of any particular subclass or subunit structure. This invention provides consensus human structures and consensus structures which consider other species in addition to human.
[0070] The subunit structures of the five immunoglobulin classes in humans are as follows:
Class Heavy Chain Subclasses Light Chain Molecular Formula IgG γ γ1, γ2, γ3, γ4 κ or λ (γ IgA α α1, α2 κ or λ (α IgM μ none κ or λ (μ IgD δ none κ or λ (δ IgE ε none κ or λ (ε
[0071] In preferred embodiments of an IgGγ1 human consensus sequence, the consensus variable domain sequences are derived from the most abundant subclasses in the sequence compilation of Kabat et al., DIQMTQSPSSLSASVGDRVTITCRASQDVSSYLAWYQQKPGKAPKLLIYAASSLESGVPSRFSG (SEQ. ID NO. 3) SGSGTDFTLTISSLQPEDFATYYCQQYNSLPYTFGQGTKVEIKRT;
[0072] the VEVQLVESGGGLVQPGGSLRLSCAASGFTFSDYAMSWVRQAPGKGLEWVAVISENGGYTRYAD (SEQ. ID NO. 4) SVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTLVTVSS.
[0073] These sequences include consensus CDRs as well as consensus FR residues (see for example in
[0074] While not wishing to be limited to any particular theories, it may be that these preferred embodiments are less likely to be immunogenic in an individual than less abundant subclasses. However, in other embodiments, the consensus sequence is derived from other subclasses of human immunoglobulin variable domains. In yet other embodiments, the consensus sequence is derived from human constant domains.
[0075] Identity or homology with respect to a specified amino acid sequence of this invention is defined herein as the percentage of amino acid residues in a candidate sequence that are identical with the specified residues, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent homology, and not considering any conservative substitutions as part of the sequence identity. None of N-terminal, C-terminal or internal extensions, deletions, or insertions into the specified sequence shall be construed as affecting homology. All sequence alignments called for in this invention are such maximal homology alignments. While such alignments may be done by hand using conventional methods, a suitable computer program is the “Align 2” program for which protection is being sought from the U.S. Register of Copyrights (Align 2, by Genentech, Inc., application filed 9 Dec. 1991).
[0076] “Non-homologous” import antibody residues are those residues which are not identical to the amino acid residue at the analogous or corresponding location in a consensus sequence, after the import and consensus sequences are aligned.
[0077] The term “computer representation” refers to information which is in a form that can be manipulated by a computer. The act of storing a computer representation refers to the act of placing the information in a form suitable for manipulation by a computer.
[0078] This invention is also directed to novel polypeptides, and in certain aspects, isolated novel humanized anti-p185DIQMTQSPSSLSASVGDRVTITCRASQDVNTAVAWYQQKPGKAPKLLIYSASFLESGVP (SEQ. ID NO. 1 SRFSGSRSGTDFTLTISSLQPEDFATYYCQQHYTTPPTFGQGTKVEIKRT,
[0079] which is the light chain variable domain of huMAb4D5); or
EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPTNGYTR (SEQ. ID NO. 2, which is the heavy chain variable domain YADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTLV of huMAb4D5) TVSS.
[0080] “Biological property”, as relates for example to anti-p185
[0081] Thus, the biologically active and antigenically active huMAb4D5 polypeptides that are the subject of certain embodiments of this invention include the sequence of the entire translated nucleotide sequence of huMAb4D5; mature huMAb4D5; fragments thereof having a consecutive sequence of at least 5, 10, 15, 20, 25, 30 or 40 amino acid residues comprising sequences from muMAb4D5 plus residues from the human FR of huMAb4D5; amino acid sequence variants of huMAb4D5 wherein an amino acid residue has been inserted N- or C-terminal to, or within, huMAb4D5 or its fragment as defined above; amino acid sequence variants of huMAb4D5 or its fragment as defined above wherein an amino acid residue of huMAb4D5 or its fragment as defined above has been substituted by another residue, including predetermined mutations by, e.g., site-directed or PCR mutagenesis; derivatives of huMAb4D5 or its fragments as defined above wherein huMAb4D5 or its fragments have been covalent modified, by substitution, chemical, enzymatic, or other appropriate means, with a moiety other than a naturally occurring amino acid; and glycosylation variants of huMAb4D5 (insertion of a glycosylation site or deletion of any glycosylation site by deletion, insertion or substitution of suitable residues). Such fragments and variants exclude any polypeptide heretofore identified, including muMAb4D5 or any known polypeptide fragment, which are anticipatory order 35 U.S.C. 102 as well as polypeptides obvious thereover under 35 U.S.C. 103.
[0082] An “isolated” polypeptide means polypeptide which has been identified and separated and/or recovered from a component of its natural environment. Contaminant components of its natural environment are materials which would interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or nonproteinaceous solutes. In preferred embodiments, for example, a polypeptide product comprising huMAb4D5 will be purified from a cell culture or other synthetic environment (1) to greater than 95% by weight of protein as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a gas- or liquid-phase sequenator (such as a commercially available Applied Biosystems sequenator Model 470, 477, or 473), or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated huMAb4D5 includes huMAb4D5 in situ within recombinant cells since at least one component of the huMAb4D5 natural environment will not be present. Ordinarily, however, isolated huMAb4D5 will be prepared by at least one purification step.
[0083] In accordance with, this invention, huMAb4D5 nucleic acid is RNA or DNA containing greater than ten bases that encodes a biologically or antigenically active huMAb4D5, is complementary to nucleic acid sequence encoding such huMAb4D5, or hybridizes to nucleic acid sequence encoding such huMAb4D5 and remains stably bound to it under stringent conditions, and comprises nucleic acid from a muMAb4D5 CDR and a human FR region.
[0084] Preferably, the huMAb4D5 nucleic acid encodes a polypeptide sharing at least 75% sequence identity, more preferably at least 80%, still more preferably at least 85%, even more preferably at 90%, and most preferably 95%, with the huMAb4D5 amino acid sequence. Preferably, a nucleic acid molecule that hybridizes to the huMAb4D5 nucleic acid contains at least 20, more preferably 40, and most preferably 90 bases. Such hybridizing or complementary nucleic acid, however, is further defined as being novel under 35 U.S.C. 102 and unobvious under 35 U.S.C. 103 over any prior. art nucleic acid.
[0085] Stringent conditions are those that (1) employ low ionic strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate/0/1% NaDodSO
[0086] The term “control sequences” refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, a ribosome binding site, and possibly, other as yet poorly understood sequences. Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers.
[0087] Nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, “operably linked” means that the DNA sequences being linked are contiguous and, in the case of a secretory leader, contiguous and in reading phase. However enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, then synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice.
[0088] An “exogenous” element is defined herein to mean nucleic acid sequence that is foreign to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is ordinarily not found.
[0089] As used herein, the expressions “cell,” “cell line,” and “cell culture” are used interchangeably and all such designations include progeny. Thus, the words “transformants” and “transformed cells” include the primary subject cell and cultures derived therefrom without regard for the number of transfers. It is also understood that all progeny may not be precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the same function or biological, activity as screened for in the originally transformed cell are included. Where distinct designations are intended, it will be clear from the context.
[0090] “Oligonucleotides” are short-length, single- or double-stranded polydeoxynucleotides that are chemically synthesized by known methods (such as phosphotriester, phosphite, or phosphoramidite chemistry, using solid phase techniques such as described in EP 266,032 published 4 May 1988, or via deoxynucleoside H-phosphonate intermediates as described by Froehler et al.,
[0091] The technique of “polymerase chain reaction,” or “PCR,” as used herein generally refers to a procedure wherein minute amounts of a specific piece of nucleic acid, RNA and/or DNA, are amplified as described in U.S. Pat. No. 4,683,195 issued 28 Jul. 1987. Generally, sequence information from the ends of the region of interest or beyond needs to be available, such that oligonucleotide primers can be designed; these primers will be identical or similar in sequence to opposite strands of the template to be amplified. The 5′ terminal nucleotides of the two primers may coincide with the ends of the amplified material. PCR can be used to amplify specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid sequences, etc. See generally Mullis et al.,
[0092] Some aspects of this invention include obtaining an import, non-human antibody variable domain, producing a desired humanized antibody sequence and for humanizing an antibody gene sequence are described below. A particularly preferred method of changing a gene sequence, such as gene conversion from a non-human or consensus sequence into a humanized nucleic acid sequence, is the cassette mutagenesis procedure described in Example 1. Additionally, methods are given for obtaining and producing antibodies generally, which apply equally to native non-human antibodies as well as to humanized antibodies.
[0093] Generally, the antibodies and antibody variable domains of this invention are conventionally prepared in recombinant cell culture, as described in more detail below. Recombinant synthesis is preferred for reasons of safety and economy, but it is known to prepare peptides by chemical synthesis and to purify them from natural sources; such preparations are included within the definition of antibodies herein.
[0094] Molecular Modeling
[0095] An integral step in our approach to antibody humanization is construction of computer graphics models of the import and humanized antibodies. These models are used to determine if the six complementarity-determining regions (CDRs) can be successfully transplanted from the import framework to a human one and to determine which framework residues from the import antibody, if any, need to be incorporated into the humanized antibody in order to maintain CDR conformation. In addition, analysis of the sequences of the import and humanized antibodies and reference to the models can help to discern which framework residues are unusual and thereby might be involved in antigen binding or maintenance of proper antibody structure.
[0096] All of the humanized antibody models of this invention are based on a single three-dimensional computer graphics structure hereafter referred to as the consensus structure. This consensus structure is a key distinction from the approach of previous workers in the field, who typically begin by selecting a human antibody structure which has an amino acid sequence which is similar to the sequence of their import antibody.
[0097] The consensus structure of one embodiment of this invention was built in five steps as described below.
[0098] Step 1: Seven Fab X-ray crystal structures from the Brookhaven Protein Data Bank were used (entries 2FB4, 2RHE, 3FAB, and 1 REI which are human structures, and 2MCP, 1 FBJ, and 2HFL which are murine structures). For each structure, protein mainchain geometry and hydrogen bonding patterns were used to assign each residue to one of three secondary structure types: alpha-helix, beta-strand or other (i.e. non-helix and non-strand). The immunoglobulin residues used in superpositioning and those included in the consensus structure are shown in Table 1.
TABLE I Immunoglobulin Residues Used in Superpositioning and Those Included in the Consensus Structure V Ig 2FB4 2RHE 2MCP 3FAB 1FBJ 2HFL 1REI Consensus 2-11 18-24 18-24 19-25 18-24 19-25 19-25 19-25 16-27 32-37 34-39 39-44 32-37 32-37 32-37 33-38 33-39 41-49 60-66 62-68 67-72 53-66 60-65 60-65 61-66 59-77 69-74 71-76 76-81 69-74 69-74 69-74 70-75 84-88 86-90 91-95 84-88 84-88 84-88 85-89 82-91 101-105 RMS 0.40 0.60 0.53 0.54 0.48 0.50 V Ig 2FB4 2MCP 3FAB 1FBJ 2HFL Consensus 3-8 18-25 18-25 18-25 18-25 18-25 17-23 34-39 34-39 34-39 34-39 34-39 33-41 46-52 46-52 46-52 46-52 46-52 45-51 57-61 59-63 56-60 57-61 57-61 57-61 68-71 70-73 67-70 68-71 68-71 66-71 78-84 80-86 77-83 78-84 78-84 75-82 92-99 94-101 91-98 92-99 92-99 88-94 102-108 RMS 0.43 0.85 0.62 0.91 RMS 0.91 0.73 0.77 0.92
[0099] Step 2: Having identified the alpha-helices and beta-strands in each of the seven structures, the structures were superimposed on one another using the INSIGHT computer program (Biosym Technologies, San Diego, Calif.) as follows: The 2FB4 structure was arbitrarily chosen as the template (or reference) structure. The 2FB4 was held fixed in space and the other six structures rotated and translated in space so that their common secondary structural elements (i.e. alpha-helices and beta-strands) were oriented such that these common elements were as close in position to one another as possible. (This superpositioning was performed using accepted mathematical formulae rather than actually physically moving the structures by hand.)
[0100] Step 3: With the seven structures thus superimposed, for each residue in the template (2FB4) Fab one calculates the distance from the template alpha-carbon atom (Cα) to the analogous Cα atom in each of the other six superimposed structures. This results in a table of Cα-Cα distances for each residue position in the sequence. Such a table is necessary in order to determine which residue positions will be included in the consensus model. Generally, if all Cα-Cα distances for a given residue position were ≦1.0 Å, that position was included in the consensus structure. If for a given position only one Fab crystal structure was >1.0 Å, the position was included but the outlying crystal structure was not included in the next step (for this position only). In general, the seven β-strands were included in the consensus structure while some of the loops connecting the β-strands, e.g. complementarity-determining regions (CDRs), were not included in view of Cα divergence.
[0101] Step 4: For each residue which was included in the consensus structure after step 3, the average of the coordinates for individual mainchain N, Cα, C, O and Cβ atoms were calculated. Due to the averaging procedure, as well as variation in bond length, bond angle and dihedral angle among the crystal structures, this “average” structure contained some bond lengths and angles which deviated from standard geometry. For purposes of this invention, “standard geometry” is understood to include geometries commonly accepted as typical, such as the compilation of bond lengths and angles from small molecule structures in Weiner, S. J. et. al.,
[0102] Step 5: In order to correct these deviations, the final step was to subject the “average” structure to 50 cycles of energy minimization (DISCOVER program, Biosym Technologies) using the AMBER (Weiner, S. J. et. al., TABLE II Average Bond Lengths and Angles for “Average” (Before) and Energy-Minimized Consensus (After 50 Cycles) Structures V V V V Standard before after before after Geometry (Å) (Å) (Å) (Å) (Å) N-Cα 1.459(0.012) 1.451(0.004) 1.451(0.023) 1.452(0.004) 1.449 Cα-C 1.515(0.012) 1.523(0.005) 1.507(0.033) 1.542(0.005) 1.522 O═C 1.208(0.062) 1.229(0.003) 1.160(0.177) 1.231(0.003) 1.229 C—N 1.288(0.049) 1.337(0.002) 1.282(0.065) 1.335(0.004) 1.335 Cα-Cβ 1.508(0.026) 1.530(0.002) 1.499(0.039) 1.530(0.002) 1.526 (*) (*) (*) (*) (*) C—N-Cα 123.5(4.2) 123.8(1.1) 125.3(4.6) 124.0(1.1) 121.9 N-Cα-C 110.0(4.0) 109.5(1.9) 110.3(2.8) 109.5(1.6) 110.1 Cα-C—N 116.6(4.0) 116.6(1.2) 117.6(5.2) 116.6(0.8) 116.6 O═C-N 123.1(4.1) 123.4(0.6) 122.2(4.9) 123.3(0.4) 122.9 N-Cα-Cβ 110.3(2.1) 109.8(0.7) 110.6(2.5) 109.8(0.6) 109.5 Cβ-Cα-C 111.4(2.4) 111.1(0.7) 111.2(2.2) 111.1(0.6) 111.1
[0103] The consensus structure might conceivably be dependent upon which crystal structure was chosen as the template on which the others were superimposed. As a test, the entire procedure was repeated using the crystal structure with the worst superposition versus 2FB4, i.e. the 2HFL Fab structure, as the new template (reference). The two consensus structures compare favorably (root-mean-squared deviation of 0.11 Å for all N, Cα and C atoms).
[0104] Note that the consensus structure only includes mainchain (N, Cα, C, O, Cβ atoms) coordinates for only those residues which are part of a conformation common to all seven X-ray crystal structures. For the Fab structures, these include the common β-strands (which comprise two β-sheets) and a few non-CDR loops which connect these β-strands. The consensus structure does not include CDRs or sidechains, both of which vary in their conformation among the seven structures. Also, note that the consensus structure includes only the V
[0105] This consensus structure is used as the archetype. It is not particular to any species, and has only the basic shape without side chains. Starting with this consensus structure the model of any import, human, or humanized Fab can be constructed as follows. Using the amino acid sequence of the particular antibody VL and VH domains of interest, a computer graphics program (such as INSIGHT, Biosym Technologies) is used to add sidechains and CDRs to the consensus structure. When a sidechain is added, its conformation is chosen on the basis of known Fab crystal structures (see the Background section for publications of such crystal structures) and rotamer libraries (Ponder, J. W. & Richards, F. M.,
[0106] CDRs are added to the model (now having the backbone plus side chains) as follows. The size (i.e. number of amino acids) of each import CDR is compared to canonical CDR structures tabulated by Chothia et al.,
[0107] However, if no matching canonical CDR can be assigned for the import CDR, then one of two options can be exercised. First, using a program such as INSIGHT (Biosym Technologies), the Brookhaven Protein Data Bank can be searched for loops with a similar size to that of the import CDR and these loops can be evaluated as possible conformations for the import CDR in the model. Minimally, such loops must exhibit a conformation in which no loop atom overlaps with other protein atoms. Second, one can use available programs which calculate possible loop conformations, assuming a given loop size, using methods such as described by Bruccoleri et al.,
[0108] When all CDRs and sidechains have been added to the consensus structure to give the final model (import, human or humanized), the model is preferably subjected to energy minimization using programs which are available commercially (e.g. DISCOVER, Biosym Technologies). This technique uses complex mathematical formulae to refine the model by performing such tasks as checking that all atoms are within appropriate distances from one another and checking that bond lengths and angles are within chemically acceptable limits.
[0109] Models of a humanized, import or human antibody sequence are used in the practice of this invention to understand the impact of selected amino acid residues of the activity of the sequence being modeled. For example, such a model can show residues which may be important in antigen binding, or for maintaining the conformation of the antibody, as discussed in more detail below. Modeling can also be used to explore the potential impact of changing any amino acid residue in the antibody sequence.
[0110] Methods for Obtaining a Humanized Antibody Sequence
[0111] In the practice of this invention, the first step in humanizing an import antibody is deriving a consensus amino acid sequence into which to incorporate the import sequences. Next a model is generated for these sequences using the methods described above. In certain embodiments of this invention, the consensus human sequences are derived from the most abundant subclasses in the sequence compilation of Kabat et al. (Kabat, E. A. et al.,
[0112] While these steps may be taken in different order, typically a structure for the candidate humanized antibody is created by transferring the at least one CDR from the non-human, import sequence into the consensus human structure, after the entire corresponding human CDR has been removed. The humanized antibody may contain human replacements of the non-human import residues at positions within CDRs as defined by sequence variability (Kabat, E. A. et al.,
[0113] Differences between the non-human import and the human consensus framework residues are individually investigated to determine their possible influence on CDR conformation and/or binding to antigen. Investigation of such possible influences is desirably performed through modeling, by examination of the characteristics of the amino acids at particular locations, or determined experimentally through evaluating the effects of substitution or mutagenesis of particular amino acids.
[0114] In certain preferred embodiments of this invention, a humanized antibody is made comprising amino acid sequence of an import, non-human antibody and a human antibody, utilizing the steps of:
[0115] a. obtaining the amino acid sequences of at least a portion of an import antibody variable domain and of a consensus human variable domain;
[0116] b. identifying Complementarity Determining Region (CDR) amino acid sequences in the import and the human variable domain sequences;
[0117] c. substituting an import CDR amino acid sequence for the corresponding human CDR amino acid sequence;
[0118] d. aligning the amino acid sequences of a Framework Region (FR) of the import antibody and the corresponding FR of the consensus antibody;
[0119] e. identifying import antibody FR residues in the aligned FR sequences that are non-homologous to the corresponding consensus antibody residues;
[0120] f. determining if the non-homologous import amino acid residue is reasonably expected to have at least one of the following effects:
[0121] 1. non-covalently binds antigen directly,
[0122] 2. interacts with a CDR; or
[0123] 3. participates in the V
[0124] g. for any non-homologous import antibody amino acid residue which is reasonably expected to have at least one of these effects, substituting that residue for the corresponding amino acid residue in the consensus antibody FR sequence.
[0125] Optionally, one determines if any non-homologous residues identified in step (e) are exposed on the surface of the domain or buried within it, and if the residue is exposed but has none of the effects identified in step (f), one may retain the consensus residue.
[0126] Additionally, in certain embodiments the corresponding consensus antibody residues identified in step (e) above are selected from the group consisting of 4L, 35L, 36L, 38L, 43L, 44L, 46L, 58L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71L, 73L, 85L, 87L, 98L, 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75L, 76H, 78H, 91H, 92H, 93H, and 103H (utilizing the numbering system set forth in Kabat, E. A. et al.,
[0127] In preferred embodiments, the method of this invention comprises the additional steps of searching either or both of the import, non-human and the consensus variable domain sequences for glycosylation sites, determining if the glycosylation is reasonably expected to be important for the desired antigen binding and biological activity of the antibody (i.e., determining if the glycosylation site binds to antigen or changes a side chain of an amino acid residue that binds to antigen, or if the glycosylation enhances or weakens antigen binding, or is important for maintaining antibody affinity). If the import sequence bears the glycosylation site, it is preferred to substitute that site for the corresponding residues in the consensus human sequence if the glycosylation site is reasonably expected to be important. If only the consensus sequence, and not the import, bears the glycosylation site, it is preferred to eliminate that glycosylation site or substitute therefor the corresponding amino acid residues from the import sequence.
[0128] Another preferred embodiment of the methods of this invention comprises aligning import antibody and the consensus antibody FR sequences, identifying import antibody FR residues which are non-homologous with the aligned consensus FR sequence, and for each such non-homologous import antibody FR residue, determining if the corresponding consensus antibody residue represents a residue which is highly conserved across all species at that site, and if it is so conserved, preparing a humanized antibody which comprises the consensus antibody amino acid residue at that site.
[0129] In certain alternate embodiments, one need not utilize the modeling and evaluation steps described above, and may instead proceed with the-steps of obtaining the amino acid sequence of at least a portion of an import, non-human antibody variable domain having a CDR and a FR, obtaining the amino acid sequence of at least a portion of a consensus human antibody variable domain having a CDR and a FR, substituting the non-human CDR for the human CDR in the consensus human antibody variable domain, and then substituting an amino acid residue for the consensus amino acid residue at at least one of the following sites:
[0130] a. (in the FR of the variable domain of the light chain) 4L, 35L, 36L, 38L, 43L, 44L, 58L, 46L, 62L, 63L, 64L, 65L, 66L, 67L, 68L, 69L, 70L, 71L, 73L, 85L, 87L, 98L, or
[0131] b. (in the FR of the variable domain of the heavy chain) 2H, 4H, 24H, 36H, 37H, 39H, 43H, 45H, 49H, 58H, 60H, 67H, 68H, 69H, 70H, 73H, 74H, 75H, 76H, 78H, 91H, 92H, 93H, and 103H.
[0132] Preferably, the non-CDR residue substituted at the consensus FR site is the residue found at the corresponding location of the non-human antibody. If desired, one may utilize the other method steps described above for determining whether a particular amino acid residue can reasonably be expected to have undesirable effects, and remedying those effects.
[0133] If after making a humanized antibody according to the steps above and testing its activity one is not satisfied with the humanized antibody, one preferably reexamines the potential effects of the amino acids at the specific locations recited above. Additionally, it is desirable to reinvestigate any buried residues which are reasonably expected to affect the V
[0134] In certain embodiments of this invention, amino acid residues in the consensus human sequence are substituted for by other amino acid residues. In preferred embodiments, residues from a particular non-human import sequence are substituted, however there are circumstances where it is desired to evaluate the effects of other amino acids. For example, if after making a humanized antibody according to the steps above and testing its activity one is not satisfied with the humanized antibody, one may compare the sequences of other classes or subgroups of human antibodies, or classes or subgroups of antibodies from the particular non-human species, and determine which other amino acid side chains and amino acid residues are found at particular locations and substituting such other residues.
[0135] Antibodies
[0136] Certain aspects of this invention are directed to natural antibodies and to monoclonal antibodies, as illustrated in the Examples below and by antibody hybridomas deposited with the ATCC (as described below). Thus, the references throughout this description to the use of monoclonal antibodies are intended to include the use of natural or native antibodies as well as humanized and chimeric antibodies. As used herein, the term “antibody” includes the antibody variable domain and other separable antibody domains unless specifically excluded.
[0137] In accordance with certain aspects of this invention, antibodies to be humanized (import antibodies) are isolated from continuous hybrid cell lines formed by the fusion of antigen-primed immune lymphocytes with myeloma cells.
[0138] In certain embodiments, the antibodies of this invention are obtained by routine screening. Polyclonal antibodies to an antigen generally are raised in animals by multiple subcutaneous (sc) or intraperitoneal (ip) injections of the antigen and an adjuvant. It may be useful to conjugate the antigen or a fragment containing the target amino acid sequence to a protein that is immunogenic in the species to be immunized, e.g., keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a bifunctional or derivatizing agent, for example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, SOCl
[0139] The route and schedule of the host animal or cultured antibody-producing cells therefrom are generally in keeping with established and conventional techniques for antibody stimulation and production. While mice are frequently employed as the test model, it is contemplated that any mammalian subject including human subjects or antibody-producing cells obtained therefrom can be manipulated according to the processes of this invention to serve as the basis for production of mammalian, including human, hybrid cell lines.
[0140] Animals are typically immunized against the immunogenic conjugates or derivatives by combining 1 mg or 1 μg of conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's complete adjuvant and injecting the solution intradermally at multiple sites. One month later the animals are boosted with 1/5 to 1/10 the original amount of conjugate in Freund's complete adjuvant (or other suitable adjuvant) by subcutaneous injection at multiple sites. 7 to 14 days later animals are bled and the serum is assayed for antigen titer. Animals are boosted until the titer plateaus. Preferably, the animal is boosted with the conjugate of the same antigen, but conjugated to a different protein and/or through a different cross-linking agent. Conjugates also can be made in recombinant cell culture as protein fusions. Also, aggregating agents such as alum are used to enhance the immune response.
[0141] After immunization, monoclonal antibodies are prepared by recovering immune lymphoid cells—typically spleen cells or lymphocytes from lymph node tissue—from immunized animals and immortalizing the cells in conventional fashion, e.g. by fusion with myeloma cells or by Epstein-Barr (EB)-virus transformation and screening for clones expressing the desired antibody. The hybridoma technique described originally by Kohler and Milstein,
[0142] It is possible to fuse cells of one species with another. However, it is preferable that the source of the immunized antibody producing cells and the myeloma be from the same species.
[0143] The hybrid cell lines can be maintained in culture in vitro in cell culture media. The cell lines of this invention can be selected and/or maintained in a composition comprising the continuous cell line in hypoxanthine-aminopterin thymidine (HAT) medium. In fact, once the hybridoma cell line is established, it can be maintained on a variety of nutritionally adequate media. Moreover, the hybrid cell lines can be stored and preserved in any number of conventional ways, including freezing and storage under liquid nitrogen. Frozen cell lines can be revived and cultured indefinitely with resumed synthesis and secretion of monoclonal antibody. The secreted antibody is recovered from tissue culture supernatant by conventional methods such as precipitation, Ion exchange chromatography, affinity chromatography, or the like. The antibodies described herein are also recovered from hybridoma cell cultures by conventional methods for purification of IgG or IgM as the case may be that heretofore have been used to purify these immunoglobulins from pooled plasma, e.g. ethanol or polyethylene glycol precipitation procedures. The purified antibodies are sterile filtered, and optionally are conjugated to a detectable marker such as an enzyme or spin label for use in diagnostic assays of the antigen in test samples.
[0144] While routinely rodent monoclonal antibodies are used as the source of the import antibody, the invention is not limited to any species. Additionally, techniques developed for the production of chimeric antibodies (Morrison et al.,
[0145] Techniques for creating recombinant DNA versions of the antigen-binding regions of antibody molecules (known as Fab fragments) which bypass the generation of monoclonal antibodies are encompassed within the practice of this invention. One extracts antibody-specific messenger RNA molecules from immune system cells taken from an immunized animal, transcribes these into complementary DNA (cDNA), and clones the cDNA into a bacterial expressions system. One example of such a technique suitable for the practice of this invention was developed by researchers at Scripps/Stratagene, and incorporates a proprietary bacteriophage lambda vector system which contains a leader sequence that causes the expressed Fab protein to migrate to the periplasmic space (between the bacterial cell membrane and the cell wall) or to be secreted. One can rapidly generate and screen great numbers of functional FAb fragments for those which bind the antigen. Such FAb fragments with specificity for the antigen are specifically encompassed within the term “antibody” as it is defined, discussed, and claimed herein.
[0146] Amino Acid Sequence Variants
[0147] Amino acid sequence variants of the antibodies and polypeptides of this invention (referred to in herein as the target polypeptide) are prepared by introducing appropriate nucleotide changes into the DNA encoding the target polypeptide, or by in vitro synthesis of the desired target polypeptide. Such variants include, for example, humanized variants of non-human antibodies, as well as deletions from, or insertions or substitutions of, residues within particular amino acid sequences. Any combination of deletion, insertion, and substitution can be made to arrive at the final construct, provided that the final construct possesses the desired characteristics. The amino acid changes also may alter post-translational processes of the target polypeptide, such as changing the number or position of glycosylation sites, altering any membrane anchoring characteristics, and/or altering the intra-cellular location of the target polypeptide by inserting, deleting, or otherwise affecting any leader sequence of the native target polypeptide.
[0148] In designing amino acid sequence variants of target polypeptides, the location of the mutation site and the nature of the mutation will depend on the target polypeptide characteristic(s) to be modified. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting residues of the same or a different class adjacent to the located site, or combinations of options 1-3. In certain embodiments, these choices are guided by the methods for creating humanized sequences set forth above.
[0149] A useful method for identification of certain residues or regions of the target polypeptide that are preferred locations for mutagenesis is called “alanine scanning mutagenesis” as described by Cunningham and Wells (
[0150] There are two principal variables in the construction of amino acid sequence variants: the location of the mutation site and the nature of the mutation. In general, the location and nature of the mutation chosen will depend upon the target polypeptide characteristic to be modified.
[0151] Amino acid sequence deletions of antibodies are generally not preferred, as maintaining the generally configuration of an antibody is believed to be necessary for its activity. Any deletions will be selected so as to preserve the structure of the target antibody.
[0152] Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions ranging in length from one residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions (i.e., insertions within the target polypeptide sequence) may range generally from about 1 to 10 residues, more preferably 1 to 5, most preferably 1 to 3. Examples of terminal insertions include the target polypeptide with an N-terminal methionyl residue, an artifact of the direct expression of target polypeptide in bacterial recombinant cell culture, and fusion of a heterologous N-terminal signal sequence to the N-terminus of the target polypeptide molecule to facilitate the secretion of the mature target polypeptide from recombinant host cells. Such signal sequences generally will be obtained from, and thus homologous to, the intended host cell species. Suitable sequences include STII or lpp for
[0153] Other insertional variants of the target polypeptide include the fusion to the N- or C-terminus of the target polypeptide of immunogenic polypeptides, e.g., bacterial polypeptides such as beta-lactamase or an enzyme encoded by the
[0154] Another group of variants are amino acid substitution variants. These variants have at least one amino acid residue in the target polypeptide molecule removed and a different residue inserted in its place. The sites of greatest interest for substitutional mutagenesis include sites identified as the active site(s) of the target polypeptide, and sites where the amino acids found in the target polypeptide from various species are substantially different in terms of side-chain bulk, charge, and/or hydrophobicity. Other sites for substitution are described infra, considering the effect of the substitution of the antigen binding, affinity and other characteristics of a particular target antibody.
[0155] Other sites of interest are those in which particular residues of the target polypeptides obtained from various species are identical. These positions may be important for the biological activity of the target polypeptide. These sites, especially those falling within a sequence of at least three other identically conserved sites, are substituted in a relatively conservative manner. If such substitutions result in a change in biological activity, then other changes are introduced and the products screened until the desired effect is obtained.
[0156] Substantial modifications in function or immunological identity of the target polypeptide are accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are divided into groups based on common side chain properties:
[0157] (1) hydrophobic: norleucine, met, ala, val, leu, ile;
[0158] (2) neutral hydrophilic: cys, ser, thr;
[0159] (3) acidic: asp, glu;
[0160] (4) basic: asn, gln, his, lys, arg;
[0161] (5) residues that influence chain orientation: gly, pro; and
[0162] (6) aromatic: trp, tyr, phe.
[0163] Non-conservative substitutions will entail exchanging a member of one of these classes for another. Such substituted residues may be introduced into regions of the target polypeptide that are homologous with other antibodies of the same class or subclass, or, more preferably, into the non-homologous regions of the molecule.
[0164] Any cysteine residues not involved in maintaining the proper conformation of target polypeptide also may be substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant crosslinking.
[0165] DNA encoding amino acid sequence variants of the target polypeptide is prepared by a variety of methods known in the art. These methods include, but are not limited to, isolation from a natural source (in the case of naturally occurring amino acid sequence variants) or preparation by oligonucleotide-mediated (or site-directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared variant or a non-variant version of the target polypeptide. A particularly preferred method of gene conversion mutagenesis is described below in Example 1. These techniques may utilized target polypeptide nucleic acid (DNA or RNA), or nucleic acid complementary to the target polypeptide nucleic acid.
[0166] Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution, deletion, and insertion variants of target polypeptide DNA. This technique is well known in the art as described by Adelman et al.,
[0167] Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et al. (
[0168] Single-stranded DNA template may also be generated by denaturing double-stranded plasmid (or other) DNA using standard techniques.
[0169] For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), the oligonucleotide is hybridized to the single-stranded template under suitable hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA polymerase I, is then added to synthesize the complementary strand of the template using the oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated form of the target polypeptide, and the other strand (the original template) encodes the native, unaltered sequence of the target polypeptide. This heteroduplex molecule is then transformed into a suitable host cell, usually a prokaryote such as
[0170] The method described immediately above may be modified such that a homoduplex molecule is created wherein both strands of the plasmid contain the mutation(s). The modifications are as follows: The single-stranded oligonucleotide is annealed to the single-stranded template as described above. A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with a modified thio-deoxyribocytosine called dCTP-(aS) (which can be obtained from Amersham Corporation). This mixture is added to the template-oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a strand of DNA identical to the template except for the mutated bases is generated. In addition, this new strand of DNA will contain dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuclease digestion.
[0171] After the template strand of the double-stranded heteroduplex is nicked with an appropriate restriction enzyme, the template strand can be digested with ExoIII nuclease or another appropriate nuclease past the region that contains the site(s) to be mutagenized. The reaction is then stopped to leave a molecule that is only partially single-stranded. A complete double-stranded DNA homoduplex is then formed using DNA polymerase in the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex molecule can then be transformed into a suitable host cell such as
[0172] DNA encoding target polypeptide variants with more than one amino acid to be substituted may be generated in one of several ways. If the amino acids are located close together in the polypeptide chain, they may be mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. If, however, the amino acids are located some distance from each other (separated by more than about ten amino acids), it is more difficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be employed.
[0173] In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of DNA that is synthesized from the template will encode all of the desired amino acid substitutions.
[0174] The alternative method involves two or more rounds of mutagenesis to produce the desired mutant. The first round is as described for the single mutants: wild-type DNA is used for the template, an oligonucleotide encoding the first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule is then generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of mutagenesis as the template. Thus, this template already contains one or more mutations. The oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resulting strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA can be used as a template in a third round of mutagenesis, and so on.
[0175] PCR mutagenesis is also suitable for making amino acid variants of target polypeptide. While the following discussion refers to DNA, it is understood that the technique also finds application with RNA. The PCR technique generally refers to the following procedure (see Erlich, supra, the chapter by R. Higuchi, p. 61-70): When small amounts of template DNA are used as starting material in a PCR, primers that differ slightly in sequence from the corresponding region in a template DNA can be used to generate relatively large quantities of a specific DNA fragment that differs from the template sequence only at the positions where the primers differ from the template. For introduction of a mutation into a plasmid DNA, one of the primers is designed to overlap the position of the mutation and to contain the mutation; the sequence of the other primer must be identical to a stretch of sequence of the opposite strand of the plasmid, but this sequence can be located anywhere along the plasmid DNA. It is preferred, however, that the sequence of the second primer is located within 200 nucleotides from that of the first, such that in the end the entire amplified region of DNA bounded by the primers can be easily sequenced. PCR amplification using a primer pair like the one just described results in a population of DNA fragments that differ at the position of the mutation specified by the primer, and possibly at other positions, as template copying is somewhat error-prone.
[0176] If the ratio of template to product material is extremely low, the vast majority of product DNA fragments incorporate the desired mutation(s). This product material is used to replace the corresponding region in the plasmid that served as PCR template using standard DNA technology. Mutations at separate positions can be introduced simultaneously by either using a mutant second primer, or performing a second PCR with different mutant primers and ligating the two resulting PCR fragments simultaneously to the vector fragment in a three (or more)-part ligation.
[0177] In a specific example of PCR mutagenesis, template plasmid DNA (1 μg) is linearized by digestion with a restriction endonuclease that has a unique recognition site in the plasmid DNA outside of the region to be amplified. Of this material, 100 ng is added to a PCR mixture containing PCR buffer, which contains the four deoxynucleotide tri-phosphates and is included in the GeneAmp® kits (obtained from Perkin-Elmer Cetus, Norwalk, Conn. and Emeryville, Calif.), and 25 pmole of each oligonucleotide primer, to a final volume of 50 μl. The reaction mixture is overlayed with 35 μl mineral oil. The reaction is denatured for 5 minutes at 100° C., placed briefly on ice, and then 1 μl Thermus aquaticus (Taq) DNA polymerase (5 units/μl purchased from Perkin-Elmer Cetus, Norwalk, Conn. and Emeryville, Calif.) is added below the mineral oil layer. The reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin-Elmer Cetus) programmed as follows: 2 min. at 55° C., then 30 sec. at 72° C., then 19 cycles of the following: 30 sec. at 94° C., 30 sec. at 55° C., and 30 sec. at 72° C.
[0178] At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous phase transferred to a new vial, extracted with phenol/chloroform (50:50:vol), and ethanol precipitated, and the DNA is recovered by standard procedures. This material is subsequently subjected to the appropriate treatments for insertion into a vector.
[0179] Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al. (
[0180] Insertion of DNA into a Cloning Vehicle
[0181] The cDNA or genomic DNA encoding the target polypeptide is inserted into a replicable vector for further cloning (amplification of the DNA) or for expression. Many vectors are available, and selection of the appropriate vector will depend on 1) whether it is to be used for DNA amplification or for DNA expression, 2) the size of the DNA to be inserted into the vector, and 3) the host cell to be transformed with the vector. Each vector contains various components depending on its function (amplification of DNA or expression of DNA) and the host cell for which it is compatible. The vector components generally include, but are not limited to, one or more of the following: a signal sequence, an origin of replication, one or more marker genes, an enhancer element, 8 promoter, and a transcription termination sequence.
[0182] (a) Signal Sequence Component
[0183] In general, the signal sequence may be a component of the vector, or it may be a part of the target polypeptide DNA that is inserted into the vector.
[0184] The target polypeptides of this invention may be expressed not only directly, but also as a fusion with a heterologous polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or it may be a part of the target polypeptide DNA that is inserted into the vector. Included within the scope of this invention are target polypeptides with any native signal sequence deleted and replaced with a heterologous signal sequence. The heterologous signal sequence selected should be one that is recognized and processed (i.e. cleaved by a signal peptidase) by the host cell. For prokaryotic host cells that do not recognize and process the native target polypeptide signal sequence, the signal sequence is substituted by a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the native target polypeptide signal sequence may be substituted by the yeast invertase, alpha factor, or acid phosphatase leaders. In mammalian cell expression the native signal sequence is satisfactory, although other mammalian signal sequences may be suitable.
[0185] (b) Origin of Replication Component
[0186] Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected host cells. Generally, in cloning vectors this sequence is one that enables the vector to replicate independently of the host chromosomal DNA, and includes origins of replication or autonomously replicating sequences. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2 μ plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the origin of replication component is not needed for mammalian expression vectors (the SV40 origin may typically be used only because it contains the early promoter).
[0187] Most expression vectors are “shuttle” vectors, i.e. they are capable of replication in at least one class of organisms but can be transfected into another organism for expression. For example, a vector is cloned in
[0188] DNA may also be amplified by insertion into the host genome. This is readily accomplished using Bacillus species as hosts, for example, by including in the vector a DNA sequence that is complementary to a sequence found in Bacillus genomic DNA. Transfection of Bacillus with this vector results in homologous recombination with the genome and insertion of the target polypeptide DNA. However, the recovery of genomic DNA encoding the target polypeptide is more complex than that of an exogenously replicated vector because restriction enzyme digestion is required to excise the target polypeptide DNA.
[0189] (c) Selection Gene Component
[0190] Expression and cloning vectors should contain a selection gene, also termed a selectable marker. This gene encodes a protein necessary for the survival or growth of transformed host cells grown in a selective culture medium. Host cells not transformed with the vector containing the selection gene will not survive in the culture medium. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g. the gene encoding D-alanine racemase for Bacilli.
[0191] One example of a selection scheme utilizes a drug to arrest growth of a host cell. Those cells that are successfully transformed with a heterologous gene express a protein conferring drug resistance and thus survive the selection regimen. Examples of such dominant selection use the drugs neomycin (Southern et al.,
[0192] Another example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take up the target polypeptide nucleic acid, such as dihydrofolate reductase (DHFR) or thymidine kinase. The mammalian cell transformants are placed under selection pressure which only the transformants are uniquely adapted to survive by virtue of having taken up the marker. Selection pressure is imposed by culturing the transformants under conditions in which the concentration of selection agent in the medium is successively changed, thereby leading to amplification of both the selection gene and the DNA that encodes the target polypeptide. Amplification is the process by which genes in greater demand for the production of a protein critical for growth are reiterated in tandem within the chromosomes of successive generations of recombinant cells. Increased quantities of the target polypeptide are synthesized from the amplified DNA.
[0193] For example, cells transformed with the DHFR selection gene are first identified by culturing all of the transformants in a culture medium that contains methotrexate (Mtx), a competitive antagonist of DHFR. An appropriate host cell when wild-type DHFR is employed is the Chinese hamster ovary (CHO) cell line deficient in DHFR activity, prepared and propagated as described by Urlaub and Chasin,
[0194] A suitable selection gene for use in yeast is the trp1 gene present in the yeast plasmid YRp7 (Stinchcomb et al.,
[0195] (d) Promoter Component
[0196] Expression and cloning vectors usually contain a promoter that is recognized by the host organism and is operably linked to the target polypeptide nucleic acid. Promoters are untranslated sequences located upstream (5′) to the start codon of a structural gene (generally within about 100 to 1000 bp) that control the transcription and translation of a particular nucleic acid sequence, such as that encoding the target polypeptide, to which they are operably linked. Such promoters typically fall into two classes, inducible and constitutive. Inducible promoters are promoters that initiate increased levels of transcription from DNA under their control in response to some change in culture conditions, e.g. the presence or absence of a nutrient or a change in temperature. At this time a large number of promoters recognized by a variety of potential host cells are well known. These promoters are operably linked to DNA encoding the target polypeptide by removing the promoter from the source DNA by restriction enzyme digestion and inserting the isolated promoter sequence into the vector. Both the native target polypeptide promoter sequence and many heterologous promoters may be used to direct amplification and/or expression of the target polypeptide DNA. However, heterologous promoters are preferred, as they generally permit greater transcription and higher yields of expressed target polypeptide as compared to the native target polypeptide promoter.
[0197] Promoters suitable for use with prokaryotic hosts include the β-lactamase and lactose promoter systems (Chang et al.,
[0198] Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase (Hitzeman et al.,
[0199] Other yeast promoters, which are inducible promoters having the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in Hitzeman et al., EP 73,657A. Yeast enhancers also are advantageously used with yeast promoters.
[0200] Promoter sequences are known for eukaryotes. Virtually all eukaryotic genes have an AT-rich region located approximately 25 to 30 bases upstream from the site where transcription is initiated. Another sequence found 70 to 80 bases upstream from the start of transcription of many genes is a CXCAAT region where X may be any nucleotide. At the 3′ end of most eukaryotic genes is an AATAAA sequence that may be the signal for addition of the poly A tail to the 3′ end of the coding sequence. All of these sequences are suitably inserted into mammalian expression vectors.
[0201] Target polypeptide transcription from vectors in mammalian host cells is controlled by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 Jul. 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and most preferably Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g. the actin promoter or an immunoglobulin promoter, from heat-shock promoters, and from the promoter normally associated with the target polypeptide sequence, provided such promoters are compatible with the host cell systems.
[0202] The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment that also contains the SV40 viral origin of replication. Fiers et a.,
[0203] (e) Enhancer Element Component
[0204] Transcription of DNA encoding the target polypeptide of this invention by higher eukaryotes is often increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10-300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and position independent having been found 5′ (Laimins et al.,
[0205] (f) Transcription Termination Component
[0206] Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5′ and, occasionally 3′ untranslated regions of eukaryotic or viral DNA or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding the target polypeptide. The 3′ untranslated regions also include transcription termination sites.
[0207] Construction of suitable vectors containing one or more of the above listed components the desired coding and control sequences employs standard ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to generate the plasmids required.
[0208] For analysis to confirm correct sequences in plasmids constructed, the ligation mixtures are used to transform
[0209] Particularly useful in the practice of this invention are expression vectors that provide for the transient expression in mammalian cells of DNA encoding the target polypeptide. In general, transient expression involves the use of an expression vector that is able to replicate efficiently in a host cell, such that the host cell accumulates many copies of the expression vector and, in turn, synthesizes high levels of a desired polypeptide encoded by the expression vector. Transient expression systems, comprising a suitable expression vector and a host cell, allow for the convenient positive identification of polypeptides encoded by cloned DNAs, as well as for the rapid screening of such polypeptides for desired biological or physiological properties. Thus, transient expression systems are particularly useful in the invention for purposes of identifying analogs and variants of the target polypeptide that have target polypeptide-like activity.
[0210] Other methods, vectors, and host cells suitable for adaptation to the synthesis of the target polypeptide in recombinant vertebrate cell culture are described in Gething et al.,
[0211] Selection and Transformation of Host Cells
[0212] Suitable host cells for cloning or expressing the vectors herein are the prokaryote, yeast, or higher eukaryote cells described above. Suitable prokaryotes include eubacteria, such as Gram-negative or Gram-positive organisms, for example,
[0213] In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts for target polypeptide-encoding vectors.
[0214] Suitable host cells for the expression of glycosylated target polypeptide are derived from multicellular organisms. Such host cells are capable of complex processing and glycosylation activities. In principle, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture. Examples of invertebrate cells include plant and insect cells. Numerous baculoviral strains and variants and corresponding permissive insect host cells from hosts such as
[0215] However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture (tissue culture) has become a routine procedure in recent years [
[0216] Host cells are transfected and preferably transformed with the above-described expression or cloning vectors of this invention and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences.
[0217] Transfection refers to the taking up of an expression vector by a host cell whether or not any coding sequences are in fact expressed. Numerous methods of transfection are known to the ordinarily skilled artisan, for example, CaPO
[0218] Transformation means introducing DNA into an organism so that the DNA is replicable, either as an extrachromosomal element or by chromosomal integrant. Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in section 1.82 of Sambrook et al., supra, is generally used for prokaryotes or other cells that contain substantial cell-wall barriers. Infection with
[0219] Culturing the Host Cells
[0220] Prokaryotic cells used to produce the target polypeptide of this invention are cultured in suitable media as described generally in Sambrook et al., supra.
[0221] The mammalian host cells used to produce the target polypeptide of this invention may be cultured in a variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential Medium ([MEM], Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium ([DMEM], Sigma) are suitable for culturing the host cells. In addition, any of the media described in Ham and Wallace,
[0222] The host cells referred to in this disclosure encompass cells in in vitro culture as well as cells that are within a host animal.
[0223] It is further envisioned that the target polypeptides of this invention may be produced by homologous recombination, or with recombinant production methods utilizing control elements introduced into cells already containing DNA encoding the target polypeptide currently in use in the field. For example, a powerful promoter/enhancer element, a suppressor, or an exogenous transcription modulatory element is inserted in the genome of the intended host cell in proximity and orientation sufficient to influence the transcription of DNA encoding the desired target polypeptide. The control element does not encode the target polypeptide of this invention, but the DNA is present in the host cell genome. One next screens for cells making the target polypeptide of this invention, or increased or decreased levels of expression, as desired.
[0224] Detecting Gene Amplification/Expression
[0225] Gene amplification and/or expression may be measured in a sample directly, for example, by conventional Southern blotting, northern blotting to quantitate the transcription of mRNA (Thomas,
[0226] Gene expression, alternatively, may be measured by immunological methods, such as immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly the expression of gene product. With immunohistochemical staining techniques, a cell sample is prepared, typically by dehydration and fixation, followed by reaction with labeled antibodies specific for the gene product coupled, where the labels are usually visually detectable, such as enzymatic labels, fluorescent labels, luminescent labels, and the like. A particularly sensitive staining technique suitable for use in the present invention is described by Hsu et al.,
[0227] Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native target polypeptide or against a synthetic peptide based on the DNA sequences provided herein as described further in Section 4 below.
[0228] Purification of the Target Polypeptide
[0229] The target polypeptide preferably is recovered from the culture medium as a secreted polypeptide, although it also may be recovered from host cell lysates when directly expressed without a secretory signal.
[0230] When the target polypeptide is expressed in a recombinant cell other than one of human origin, the target polypeptide is completely free of proteins or polypeptides of human origin. However, it is necessary to purify the target polypeptide from recombinant cell proteins or polypeptides to obtain preparations that are substantially homogeneous as to the target polypeptide. As a first step, the culture medium or lysate is centrifuged to remove particulate cell debris. The membrane and soluble protein fractions are then separated. The target polypeptide may then be purified from the soluble protein fraction and from the membrane fraction of the culture lysate, depending on whether the target polypeptide is membrane bound. The following procedures are exemplary of suitable purification procedures: fractionation on immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; and protein A Sepharose columns to remove contaminants such as IgG.
[0231] Target polypeptide variants in which residues have been deleted, inserted or substituted are recovered in the same fashion, taking account of any substantial changes in properties occasioned by the variation. For example, preparation of a target polypeptide fusion with another protein or polypeptide, e.g. a bacterial or viral antigen, facilitates purification; an immunoaffinity column containing antibody to the antigen (or containing antigen, where the target polypeptide is an antibody) can be used to adsorb the fusion. Immunoaffinity columns such as a rabbit polyclonal anti-target polypeptide column can be employed to absorb the target polypeptide variant by binding it to at least one remaining immune epitope. A protease inhibitor such as phenyl methyl sulfonyl fluoride (PMSF) also may be useful to inhibit proteolytic degradation during purification, and antibiotics may be included to prevent the growth of adventitious contaminants. One skilled in the art will appreciate that purification methods suitable for native target polypeptide may require modification to account for changes in the character of the target polypeptide or its variants upon expression in recombinant cell culture.
[0232] Covalent Modifications of Target Polypeptides
[0233] Covalent modifications of target polypeptides are included within the scope of this invention. One type of covalent modification included within the scope of this invention is a target polypeptide fragment. Target polypeptide fragments having up to about 40 amino acid residues may be conveniently prepared by chemical synthesis, or by enzymatic or chemical cleavage of the full-length target polypeptide or variant target polypeptide. Other types of covalent modifications of the target polypeptide or fragments thereof are introduced into the molecule by reacting specific amino acid residues of the target polypeptide or fragments thereof with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C-terminal residues.
[0234] Cysteinyl residues most commonly are reacted with α-haloacetates (and corresponding amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl residues also are derivatized by reaction with bromotrifluoroacetone, α-bromo-β-(5-imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl2-pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-1,3-diazole.
[0235] Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because this agent is relatively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; the reaction is preferably performed in 0.1 M sodium cacodylate at pH 6.0.
[0236] Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable reagents for derivatizing α-amino-containing residues include imidoesters such as methyl picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2,4-pentanedione; and transaminase-catalyzed reaction with glyoxylate.
[0237] Arginyl residues are modified by reaction with one or several conventional reagents, among them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginine residues requires that the reaction be performed in alkaline conditions because of the high pK
[0238] The specific modification of tyrosyl residues may be made, with particular interest in introducing spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromethane. Most commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl residues are iodinated using
[0239] Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides (R′—N═C═N—R′), where R and R′ are different alkyl groups, such as 1-cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or 1-ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl residues by reaction with ammonium ions.
[0240] Derivatization with bifunctional agents is useful for crosslinking target polypeptide to a water-insoluble support matrix or surface for use in the method for purifying anti-target polypeptide antibodies, and vice versa. Commonly used crosslinking agents include, e.g., 1,1-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3′-dithiobis(succinimidylpropionate), and bifunctional maleimides such as bis-N-maleimido-1,8-octane. Derivatizing agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate yield photoactivatable intermediates that are capable of forming crosslinks in the presence of light. Alternatively, reactive water-insoluble matrices such as cyanogen bromide-activated carbohydrates and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein immobilization.
[0241] Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and aspartyl residues, respectively. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of these residues falls within the scope of this invention.
[0242] Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the α-amino groups of lysine, arginine, and histidine side chains (T. E. Creighton.
[0243] Another type of covalent modification of the target polypeptide included within the scope of this invention comprises altering the native glycosylation pattern of the polypeptide. By altering is meant deleting one or more carbohydrate moieties found in the native target polypeptide, and/or adding one or more glycosylation sites that are not present in the native target polypeptide.
[0244] Glycosylation of polypeptides is typically either N-linked or O-linked. N-linked refers to the attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tri-peptide sequences asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, are the recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. Thus, the presence of either of these tri-peptide sequences in a polypeptide creates a potential glycosylation site. O-linked glycosylation refers to the attachment of one of the sugars N-acetylgalactosamine, galactose, or xylose, to a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5-hydroxylysine may also be used.
[0245] Addition of glycosylation sites to the target polypeptide is conveniently accomplished by altering the amino acid sequence such that it contains one or more of the above-described tri-peptide sequences (for N-linked glycosylation sites). The alteration may also be made by the addition of, or substitution by, one or more serine or threonine residues to the native target polypeptide sequence (for O-linked glycosylation sites). For ease, the target polypeptide amino acid sequence is preferably altered through changes at the DNA level, particularly by mutating the DNA encoding the target polypeptide at preselected bases such that codons are generated that will translate into the desired amino acids. The DNA mutation(s) may be made using methods described above under the heading of “Amino Acid Sequence Variants of Target Polypeptide”.
[0246] Another means of increasing the number of carbohydrate moieties on the target polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. These procedures are advantageous in that they do not require production of the polypeptide in a host cell that has glycosylation capabilities for N- and O-linked glycosylation. Depending on the coupling mode used, the sugar(s) may be attached to (a) arginine and histidine, (b) free carboxyl groups, (c) free sulfhydryl groups such as those of cysteine, (d) free hydroxyl groups such as those of serine, threonine, or hydroxyproline, (e) aromatic residues such as those of phenylalanine, tyrosine, or tryptophan, or (f) the amide group of glutamine. These methods are described in WO 87/05330 published 11 Sep. 1987, and in Aplin and Wriston (
[0247] Removal of carbohydrate moieties present on the native target polypeptide may be accomplished chemically or enzymatically. Chemical deglycosylation requires exposure of the polypeptide to the compound trifluoromethanesulfonic acid, or an equivalent compound. This treatment results in the cleavage of most or all sugars except the linking sugar (N-acetylglucosamine or N-acetylgalactosamine), while leaving the polypeptide intact. Chemical deglycosylation is described by Hakimuddin et al. (
[0248] Glycosylation at potential glycosylation sites may be prevented by the use of the compound tunicamycin as described by Duskin et al. (
[0249] Another type of covalent modification of the target polypeptide comprises linking the target polypeptide to various nonproteinaceous polymers, e.g. polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. Pat. Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337.
[0250] The target polypeptide also may be entrapped in microcapsules prepared, for example, by coacervation techniques or by interfacial polymerization (for example, hydroxymethylcellulose or gelatin-microcapsules and poly-[methylmethacrylate]microcapsules, respectively), in colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules), or in macroemulsions. Such techniques are disclosed in
[0251] Target polypeptide preparations are also useful in generating antibodies, for screening for binding partners, as standards in assays for the target polypeptide (e.g. by labeling the target polypeptide for use as a standard in a radioimmunoassay, enzyme-linked immunoassay, or radioreceptor assay), in affinity purification techniques, and in competitive-type receptor binding assays when labeled with radioiodine, enzymes, fluorophores, spin labels, and the like.
[0252] Since it is often difficult to predict in advance the characteristics of a variant target polypeptide, it will be appreciated that some screening of the recovered variant will be needed to select the optimal variant. For example, a change in the immunological character of the target polypeptide molecule, such as affinity for a given antigen or antibody, is measured by a competitive-type immunoassay. The variant is assayed for changes in the suppression or enhancement of its activity by comparison to the activity observed for the target polypeptide in the same assay. Other potential modifications of protein or polypeptide properties such as redox or thermal stability, hydrophobicity, susceptibility to proteolytic degradation, stability in recombinant cell culture or in plasma, or the tendency to aggregate with carriers or into multimers are assayed by methods well known in the art.
[0253] Diagnostic and Related Uses of the Antibodies
[0254] The antibodies of this invention are useful in diagnostic assays for antigen expression in specific cells or tissues. The antibodies are detectably labeled and/or are immobilized on an insoluble matrix.
[0255] The antibodies of this invention find further use for the affinity purification of the antigen from recombinant cell culture or natural sources. Suitable diagnostic assays for the antigen and its antibodies depend on the particular antigen or antibody. Generally, such assays include competitive and sandwich assays, and steric inhibition assays. Competitive and sandwich methods employ a phase-separation step as an integral part of the method while steric inhibition assays are conducted in a single reaction mixture. Fundamentally, the same procedures are used for the assay of the antigen and for substances that bind the antigen, although certain methods will be favored depending upon the molecular weight of the substance being assayed. Therefore, the substance to be tested is referred to herein as an analyte, irrespective of its status otherwise as an antigen or antibody, and proteins that bind to the analyte are denominated binding partners, whether they be antibodies, cell surface receptors, or antigens.
[0256] Analytical methods for the antigen or its antibodies all use one or more of the following reagents: labeled analyte analogue, immobilized analyte analogue, labeled binding partner, immobilized binding partner and steric conjugates. The labeled reagents also are known as “tracers.”
[0257] The label used (and this is also useful to label antigen nucleic acid for use as a probe) is any detectable functionality that does not interfere with the binding of analyte and its binding partner. Numerous labels are known for use in immunoassay, examples including moieties that may be detected directly, such as fluorochrome, chemiluminescent and radioactive labels, as well as moieties, such as enzymes, that must be reacted or derivatized to be detected. Examples of such labels include the radioisotopes
[0258] Conventional methods are available to bind these labels covalently to proteins or polypeptides. For instance, coupling agents such as dialdehydes, carbodiimides, dimaleimides, bis-imidates, bis-diazotized benzidine, and the like may be used to tag the antibodies with the above-described fluorescent, chemiluminescent, and enzyme labels. See, for example, U.S. Pat. No. 3,940,475 (fluorimetry) and U.S. Pat. No. 3,645,090 (enzymes); Hunter et al.,
[0259] The conjugation of such label, including the enzymes, to the antibody is a standard manipulative procedure for one of ordinary skill in immunoassay techniques. See, for example, O'Sullivan et al., “Methods for the Preparation of Enzyme-antibody Conjugates for Use in Enzyme Immunoassay,” in
[0260] Immobilization of reagents is required for certain assay methods. Immobilization entails separating the binding partner from any analyte that remains free in solution. This conventionally is accomplished by either insolubilizing the binding partner or analyte analogue before the assay procedure, as by adsorption to a water-insoluble matrix or surface (Bennich et al., U.S. Pat. No. 3,720,760), by covalent coupling (for example, using glutaraldehyde cross-linking), or by insolubilizing the partner or analogue afterward, e.g., by immunoprecipitation.
[0261] Other assay methods, known as competitive or sandwich assays, are well established and widely used in the commercial diagnostics industry.
[0262] Competitive assays rely on the ability of a tracer analogue to compete with the test sample analyte for a limited number of binding sites on a common binding partner. The binding partner generally is insolubilized before or after the competition and then the tracer and analyte bound to the binding partner are separated from the unbound tracer and analyte. This separation is accomplished by decanting (where the binding partner was preinsolubilized) or by centrifuging (where the binding partner was precipitated after the competitive reaction). The amount of test sample analyte is inversely proportional to the amount of bound tracer as measured by the amount of marker substance. Dose-response curves with known amounts of analyte are prepared and compared with the test results to quantitatively determine the amount of analyte present in the test sample. These assays are called ELISA systems when enzymes are used as the detectable markers.
[0263] Another species of competitive assay, called a “homogeneous” assay, does not require a phase separation. Here, a conjugate of an enzyme with the analyte is prepared and used such that when anti-analyte binds to the analyte the presence of the anti-analyte modifies the enzyme activity. In this case, the antigen or its immunologically active fragments are conjugated with a bifunctional organic bridge to an enzyme such as peroxidase. Conjugates are selected for use with antibody so that binding of the antibody inhibits or potentiates the enzyme activity of the label; This method per se is widely practiced under the name of EMIT.
[0264] Steric conjugates are used in steric hindrance methods for homogeneous assay. These conjugates are synthesized by covalently linking a low-molecular-weight hapten to a small analyte so that antibody to hapten substantially is unable to bind the conjugate at the same time as anti-analyte. Under this assay procedure the analyte present in the test sample will bind anti-analyte, thereby allowing anti-hapten to bind the conjugate, resulting in a change in the character of the conjugate hapten, e.g., a change in fluorescence when the hapten is a fluorophore.
[0265] Sandwich assays particularly are useful for the determination of antigen or antibodies. In sequential sandwich assays an immobilized binding partner is used to adsorb test sample analyte, the test sample is removed as by washing, the bound analyte is used to adsorb labeled binding partner, and bound material is then separated from residual tracer. The amount of bound tracer is directly proportional to test sample analyte. In “simultaneous” sandwich assays the test sample is not separated before adding the labeled binding partner. A sequential sandwich assay using an anti-antigen monoclonal antibody as one antibody and a polyclonal anti-antigen antibody as the other is useful in testing samples for particular antigen activity.
[0266] The foregoing are merely exemplary diagnostic assays for the import and humanized antibodies of this invention. Other methods now or hereafter developed for the determination of these analytes are included within the scope hereof, including the bioassays described above.
[0267] Immunotoxins
[0268] This invention is also directed to immunochemical derivatives of the antibodies of this 5 invention such as immunotoxins (conjugates of the antibody and a cytotoxic moiety). Antibodies which carry the appropriate effector functions, such as with their constant domains, are also used to induce lysis through the natural complement process, and to interact with antibody dependent cytotoxic cells normally present.
[0269] For example, purified, sterile filtered antibodies are optionally conjugated to a cytotoxin such as ricin for use in AIDS therapy. U.S. patent application Ser. No. 07/350,895 illustrates methods for making and using immunotoxins for the treatment of HIV infection. The methods of this invention, for example, are suitable for obtaining humanized antibodies for use as immunotoxins for use in AIDS therapy.
[0270] The cytotoxic moiety of the immunotoxin may be a cytotoxic drug or an enzymatically active toxin of bacterial, fungal, plant or animal origin, or an enzymatically active fragment of such a toxin. Enzymatically active toxins and fragments thereof used are diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from
[0271] Immunotoxins can be made in a variety of ways, as discussed herein. Commonly known crosslinking reagents can be used to yield stable conjugates.
[0272] Advantageously, monoclonal antibodies specifically binding the domain of the antigen which is exposed on the infected cell surface, are conjugated to ricin A chain. Most advantageously the ricin A chain is deglycosylated and produced through recombinant means. An advantageous method of making the ricin immunotoxin is described in Vitetta et al.,
[0273] When used to kill infected human cells in vitro for diagnostic purposes, the conjugates will typically be added to the cell culture medium at a concentration of at least about 10 nM. The formulation and mode of administration for in vitro use are not critical. Aqueous formulations that are compatible with the culture or perfusion medium will normally be used. Cytotoxicity may be read by conventional techniques.
[0274] Cytotoxic radiopharmaceuticals for treating infected cells may be made by conjugating radioactive isotopes (e.g. I, Y, Pr) to the antibodies. Advantageously alpha particle-emitting isotopes are used. The term “cytotoxic moiety” as used herein is intended to include such isotopes.
[0275] In a preferred embodiment, ricin A chain is deglycosylated or produced without oligosaccharides, to decrease its clearance by irrelevant clearance mechanisms (e.g., the liver). In another embodiment, whole ricin (A chain plus B chain) is conjugated to antibody if the galactose binding property of B-chain can be blocked (“blocked ricin”).
[0276] In a further embodiment toxin-conjugates are made with Fab or F(ab′)
[0277] In another embodiment, fusogenic liposomes are filled with a cytotoxic drug and the liposomes are coated with antibodies specifically binding the particular antigen.
[0278] Antibody Dependent Cellular Cytotoxicity
[0279] Certain aspects of this invention involve antibodies which are (a) directed against a particular antigen and (b) belong to a subclass or isotype that is capable of mediating the lysis of cells to which the antibody molecule binds. More specifically, these antibodies should belong to a subclass or isotype that, upon complexing with cell surface proteins, activates serum complement and/or mediates antibody dependent cellular cytotoxicity (ADCC) by activating effector cells such as natural killer cells or macrophages.
[0280] Biological activity of antibodies is known to be determined, to a large extent, by the constant domains or Fc region of the antibody molecule (Uananue and Benacerraf,
[0281] In general, mouse antibodies of the IgG2a and IgG3 subclass and occasionally IgG1 can mediate ADCC, and antibodies of the IgG3, IgG2a, and IgM subclasses bind and activate serum complement. Complement activation generally requires the binding of at least two IgG molecules in close proximity on the target cell. However, the binding of only one IgM molecule activates serum complement.
[0282] The ability of any particular antibody to mediate lysis of the target cell by complement activation and/or ADCC can be assayed. The cells of interest are grown and labeled in vitro; the antibody is added to the cell culture in combination with either serum complement or immune cells which may be activated by the antigen antibody complexes. Cytolysis of the target cells is detected by the release of label from the lysed cells. In fact, antibodies can be screened using the patient's own serum as a source of complement and/or immune cells. The antibody that is capable of activating complement or mediating ADCC in the in vitro test can then be used therapeutically in that particular patient.
[0283] This invention specifically encompasses consensus Fc antibody domains prepared and used according to the teachings of this invention.
[0284] Therapeutic and Other Uses of the Antibodies
[0285] When used in vivo for therapy, the antibodies of the subject invention are administered to the patient in therapeutically effective amounts (i.e. amounts that have desired therapeutic effect). They will normally be administered parenterally. The dose and dosage regimen will depend upon the degree of the infection, the characteristics of the particular antibody or immunotoxin used, e.g., its therapeutic index, the patient, and the patient's history. Advantageously the antibody or immunotoxin is administered continuously over a period of 1-2 weeks, intravenously to treat cells in the vasculature and subcutaneously and intraperitoneally to treat regional lymph nodes. Optionally, the administration is made during the course of adjunct therapy such as combined cycles of radiation, chemotherapeutic treatment, or administration of tumor necrosis factor, interferon or other cytoprotective or immunomodulatory agent.
[0286] For parenteral administration the antibodies will be formulated in a unit dosage injectable form (solution, suspension, emulsion) in association with a pharmaceutically acceptable parenteral vehicle. Such vehicles are inherently nontoxic, and non-therapeutic. Examples of such vehicles are water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils and ethyl oleate can also be used. Liposomes may be used as carriers. The vehicle may contain minor amounts of additives such as substances that enhance isotonicity and chemical stability, e.g., buffers and preservatives. The antibodies will typically be formulated in such vehicles at concentrations of about 1 mg/ml to 10 mg/ml.
[0287] Use of IgM antibodies may be preferred for certain applications, however IgG molecules by being smaller may be more able than IgM molecules to localize to certain types of infected cells.
[0288] There is evidence that complement activation in vivo leads to a variety of biological effects, including the induction of an inflammatory response and the activation of macrophages (Uananue and Benecerraf,
[0289] Optionally, the antibodies of this invention are useful in passively immunizing patients, as exemplified by the administration of humanized anti-HIV antibodies.
[0290] The antibody compositions used in therapy are formulated and dosages established in a fashion consistent with good medical practice taking into account the disorder to be treated, the condition of the individual patient, the site of delivery of the composition, the method of administration and other factors known to practitioners. The antibody compositions are prepared for administration according to the description of preparation of polypeptides for administration, infra.
[0291] Deposit of Materials
[0292] As described above, cultures of the muMAb4D5 have been deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Md., USA (ATCC).
[0293] This deposit was made under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest Treaty). This assures maintenance of viable cultures for 30 years from the date of the deposit. The organisms will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the cultures to the public upon issuance of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC 122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.12 with particular reference to 886 OG 638).
[0294] In respect of those designations in which a European patent is sought, a sample of the deposited microorganism will be made available until the publication of the mention of the grant of the European patent or until the date on which the application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the sample. (Rule 28(4) EPC)
[0295] The assignee of the present application has agreed that if the cultures on deposit should die or be lost or destroyed when cultivated under suitable conditions, they will be promptly replaced on notification with a viable specimen of the same culture. Availability of the deposited strain is not to be construed as a license to practice the invention in contravention of the rights granted under the authority of any government in accordance with its patent laws.
[0296] The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. The present invention is not to be limited in scope by the constructs deposited, since the deposited embodiments are intended to illustrate only certain aspects of the invention and any constructs that are functionally equivalent are within the scope of this invention. The deposit of material herein does not constitute an admission that the written description herein contained is inadequate to enable the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the claims to the specific illustrations that they represent. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description and fall within the scope of the appended claims.
[0297] It is understood that the application of the teachings of the present invention to a specific problem or situation will be within the capabilities of one having ordinary skill in the art in light of the teachings contained herein. Examples of the products of the present invention and representative processes for their isolation, use, and manufacture appear below, but should not be construed to limit the invention.
[0298] Here we report the chimerization of muMAb4D5 (chMAb4D5) and the rapid and simultaneous humanization of heavy (V
[0299] Cloning of Variable Region Genes. The muMAb4D5 V
[0300] Molecular Modelling. Models for muMAb4D5 V
[0301] In humanizing muMAb4D5, consensus human sequences were first derived from the most abundant subclasses in the sequence compilation of Kabat et al. (Kabat, E. A. et al.,
[0302] Construction of Chimeric Genes. Genes encoding chMAb4D5 light and heavy chains were separately assembled in previously described phagemid vectors containing the human cytomegalovirus enhancer and promoter, a 5′ intron and SV40 polyadenylation signal (Gorman, C. M. et al.,
[0303] Construction of Humanized Genes. Genes encoding chMAb4D5 light chain and heavy chain Fd fragment (V
[0304] Expression and Purification of MAb4D5 Variants. Appropriate MAb4D5 light and heavy chain cDNA expression vectors were co-transfected into an adenovirus transformed human embryonic kidney cell line, 293 (Graham, F. L. et al.,
[0305] Cell Proliferation Assay. The effect of MAb4D5 variants upon proliferation of the human mammary adenocarcinoma cell line, SK-BR-3, was investigated as previously described (Fendly, B. M. et at, Cancer Res. 50:1550-1558 (1990)) using saturating MAb4D5 concentrations.
[0306] Affinity Measurements. The antigen binding affinity of MAb4D5 variants was determined using a secreted form of the p185
[0307] Humanization of muMAb4D5. The muMAb4D5 V
[0308] Expression levels of huMAb4D5 variants were in the range of 7 to 15 μg /ml as judged by ELISA using immobilized p185
[0309] huMAb4D5 Variants. In general, the FR residues were chosen from consensus human sequences (Kabat, E. A. et al.,
[0310] The most potent humanized variant designed by molecular modeling, huMAb4D5-8, contains 5 FR residues from muMAb4D5. This antibody binds the p185
[0311] The anti-proliferative activity of huMAb4D5 variants against p185
[0312] The importance of V
[0313] V
[0314] From molecular modeling it appears that the tyrosyl side chain of muMAb4D5 V
[0315] Secondary Immune Function of huMAb4D5-8. MuMAb4D5 inhibits the growth of human breast tumor cells which overexpress p185
[0316] MuMAb4D5 is potentially useful for human therapy since it is cytostatic towards human breast and ovarian tumor lines overexpressing the HER2-encoded p185
[0317] Rapid humanization of huMAb4D5 was facilitated by the gene conversion mutagenesis strategy developed here using long preassembled oligonucleotides. This method requires less than half the amount of synthetic DNA as does total gene synthesis and does not require convenient restriction sites in the target DNA. Our method appears to be simpler and more reliable than a variant protocol recently reported (Rostapshov, V. M. et al.,
[0318] The crucial role of molecular modeling in the humanization of muMAb4D5 is illustrated by the designed variant huMAb4D5-8 which binds the p185
[0319] In addition to retaining tight receptor binding and the ability to inhibit cell growth, the huMAb4D5-8 also confers a secondary immune function (ADCC). This allows for direct cytotoxic activity of the humanized molecule in the presence of human effector cells. The apparent selectivity of the cytotoxic activity for cell types which overexpress p185TABLE 3 p185 proliferative activities of MAb4D5 variants MAb4D5 cell V V Variant 71 73 78 93 102 55 66 K proliferation FR3 FR3 FR3 FR3 CDR3 CDR2 FR3 nM Relative huMAb4D5-1 R D L A V E G 25 102 huMAb4D5-2 Ala D L A V E G 4.7 101 huMAb4D5-3 Ala Thr Ala Ser V E G 4.4 66 huMAb4D5-4 Ala Thr L Ser V E Arg 0.82 56 huMAb4D5-5 Ala Thr Ala Ser V E Arg 1.1 48 huMAb4D5-6 Ala Thr Ala Ser V Tyr Arg 0.22 51 huMAb4D5-7 Ala Thr Ala Ser Tyr E Arg 0.62 53 huMAb4D5-8 Ala Thr Ala Ser Tyr Tyr Arg 0.10 54 muMAb4D5 Ala Thr Ala Ser Tyr Tyr Arg 0.30 37
[0320]
TABLE 4 Selectivity of antibody dependent tumor cell cytotoxicity mediated by huMAb4D5-8 Effector:Target ratio WI-38* SK-BR-3 muMAb4D5 huMAb4D5-8 muMAb4D5 huMAb4D5-8 A. 25:1 <1.0 9.3 7.5 40.6 12.5:1 <1.0 11.1 4.7 36.8 6.25:1 <1.0 8.9 0.9 35.2 3.13:1 <1.0 8.5 4.6 19.6 B. 25:1 <1.0 3.1 6.1 33.4 12.5:1 <1.0 1.7 5.5 26.2 6.25:1 1.3 2.2 2.0 21.0 3.13:1 <1.0 0.8 2.4 13.4
[0321] This example illustrates one stepwise elaboration of the methods for creating a humanized sequence described above. It will be understood that not all of these steps are essential to the claimed invention, and that steps may be taken in different order.
[0322] 1. ascertain a consensus human variable domain amino acid sequence and prepare from it a consensus structural model.
[0323] 2. prepare model of import (the non-human domain to be humanized) variable domain sequences and note structural differences with respect to consensus human model.
[0324] 3. identify CDR sequences in human and in import, both by using Kabat (supra, 1987) and crystal structure criteria. If there is any difference in CDR identity from the different criteria, use of crystal structure definition of the CDR, but retain the Kabat residues as important framework residues to import.
[0325] 4. substitute import CDR sequences for human CDR sequences to obtain initial “humanized” sequence.
[0326] 5. compare import non-CDR variable domain sequence to the humanized sequence and note divergences.
[0327] 6. Proceed through the following analysis for each amino acid residue where the import diverges from the humanized.
[0328] a. If the humanized residue represents a residue which is generally highly conserved across all species, use the residue in the humanized sequence. If the residue is not conserved across all species, proceed with the analysis described in 6b.
[0329] b. If the residue is not generally conserved across all species, ask if the residue is generally conserved in humans.
[0330] i. If the residue is generally conserved in humans but the import residue differs, examine the structural models of the import and human sequences and determine if the import residue would be likely to affect the binding or biological activity of the CDRs by considering 1) could it bind antigen directly and 2) could it affect the conformation of the CDR. If the conclusion is that an affect on the CDRs is likely, substitute the import residue. If the conclusion is that a CDR affect is unlikely, leave the humanized residue unchanged.
[0331] ii. If the residue is also not generally conserved in humans, examine the structural models of the import and human sequences and determine if the import residue would be likely to affect the binding or biological activity of the CDRs be considering 1) could it bind antigen directly and 2) could it affect the conformation of the CDR. If the conclusion is that an affect on the CDRs is likely, substitute the import residue. If the conclusion is that a CDR affect is unlikely, proceed to the next step.
[0332] a) Examine the structural models of the import and human sequences and determine if the residue is exposed on the surface of the domain or is buried within. If the residue is exposed, use the residue in the humanized sequence. If the residue is buried, proceed to the next step.
[0333] (i) Examine the structural models of the import and human sequences and determine if the residue is likely to affect the V
[0334] 7. Search the import sequence, the consensus sequence and the humanized sequence for glycosylation sites outside the CDRs, and determine if this glycosylation site is likely to have any affect on antigen binding and/or biological activity. If no effect is likely, use the human sequence at that site; if some affect is likely, eliminate the glycosylation site or use the import sequence at that site.
[0335] 8. After completing the above analysis, determine the planned humanized sequence and prepare and test a sample. If the sample does not bind well to the target antigen, examine the particular residues listed below, regardless of the question of residue identity between the import and humanized residues.
[0336] a. Examine particular peripheral (non-CDR) variable domain residues that may, due to their position, possibly interact directly with a macromolecular antigen, including the following residues (where the * indicates residues which have been found to interact with antigen based on crystal structures):
[0337] i. Variable light domain: 36, 46, 49*, 63-70
[0338] ii. Variable heavy domain: 2, 47*, 68, 70, 73-76.
[0339] b. Examine particular variable domain residues which could interact with, or otherwise affect, the conformation of variable domain CDRs, including the following (not including CDR residues themselves, since it is assumed that, because the CDRs interact with one another, any residue in one CDR could potentially affect the conformation of another CDR residue) (L=LIGHT, H=HEAVY, residues appearing in bold are indicated to be structurally important according the Chothia et al., Nature 342:877 (1989), and residues appearing in italic were altered during humanization by Queen et al. (PDL), Proc. Natl. Acad. Sci. USA 86:10029 (1989) and Proc. Natl. Aced. Sci. USA 88:2869 (1991).):
[0340] i. Variable light domain:
[0341] a) CDR-1 (residues 24L-34L): 2L, 4L, 66L-69L, 71L
[0342] b) CDR-2 (residues 50L-56L): 35L, 46L, 47L, 48L, 49L, 58L, 62L, 64L-66L, 71L, 73L
[0343] c) CDR-3 (residues 89L-97L): 2L, 4L, 36L, 98L, 37H, 45H, 47H, 58H, 60H
[0344] ii. Variable heavy domain:
[0345] a) CDR-1 (residues 26H-35H): 2H, 4H, 24H, 36H, 71H, 73H, 76H, 78H, 92H, 94H
[0346] b) CDR-2 (residues 50H-55H): 49H, 69H, 69H, 71H, 73H, 78H
[0347] c) CDR-3 (residues 95H-102H): examine all residues as possible interaction partners with this loop, because this loop varies in size and conformation much more than the other CDRs.
[0348] 9. If after step 8 the humanized variable domain still is lacking in desired binding, repeat step 8. In addition, re-investigate any buried residues which might affect the V
[0349] This example demonstrates the construction of a humanized bispecific antibody (BsF(ab′)
[0350] Bispecific antibodies (BsAbs) with specificities for tumor-associated antigens and surface markers on immune effector cells have proved effective for retargeting effector cells to kill tumor targets both in vitro and in vivo (reviewed by Fanger, M. W. et al.,
[0351] BsF(ab′)
[0352] Recently a facile route to a fully humanized BsF(ab′)
[0353] Construction of mutations in the anti-CD3 variable region genes.
[0354] The construction of genes encoding humanized anti-CD3 variant 1 (v1) variable light (VHX11, 5′ GTAGATAAATCCtctAACACAGCCTAtCTGCAAATG 3′; (SEQ.ID. NO. 11) V HX12, 5′ GTAGATAAATCCAAAtctACAGCCTAtCTGCAAATG 3′; (SEQ.ID. NO. 12) V HX13, 5′ GTAGATAAATCCtcttctACAGCCTAtCTGCAAATG 3′; (SEQ.ID. NO. 13) V X14, 5′ CTTATAAAGGTGTTtCcACCTATaaCcAgAaatTCAA (SEQ.ID. NO. 14) V GGatCGTTTCACgATAtcCGTAGATAAATCC 3′; D61Q:S62K:V63F:G65D, v9; LX6, 5′ CTATACCTCCCGTCTgcatTCTGGAGTCCC 3′. (SEQ.ID. NO. 15) V+HL,l E55H, V11
[0355] Oligonucleotides HX11, HX12 and HX13 each remove a site for BspMI, whereas LX6 removes a site for XhoI and HX14 installs a site for EcoRV (bold). Anti-CD3 variant v10 was constructed from v9 by site-directed mutagenesis using oligonucleotide HX13. Mutants were verified by dideoxynucleotide sequencing (Sanger, F. et al.,
[0356]
[0357] The expression plasmid, pAK19, for the co-secretion of light chain and heavy chain Fd′ fragment of the most preferred humanized anti-p185
[0358] Construction of BsF(ab′)
[0359] Fab′ fragments were directly recovered from
[0360] Flow Cytometric Analysis of F(ab′)
[0361] The Jurkat human acute T cell leukemia cell line was purchased from the American Type Culture Collection (Rockville, Md.) (ATCC TIB 152) and grown as recommended by the ATCC. Aliquots of 10
[0362] Design of Humanized Anti-CD3 Variants
[0363] The most potent humanized anti-CD3 variant previously identified, v1, differs from the murine parent antibody, UCHT1 at 19 out of 107 amino acid residues within V
[0364] Preparation of BsF(ab′)
[0365] Soluble and functional anti-p185
[0366] SDS-PAGE analysis of this BsF(ab′)
[0367] Scanning LASER densitometry of a o-PDM coupled F(ab′)
[0368] Binding of BsF(ab′)
[0369] Binding of BsF(ab′)
[0370] A minimalistic strategy was chosen to humanize the anti-p185
[0371] Our humanization strategy was very successful for the anti-p185
[0372] Here we have restored T cell binding of the humanized anti-CD3 close to that of the chimeric variant by replacing six human residues in V
[0373] Our novel approach to the construction of BsF(ab′)
[0374] BsF(ab′)
[0375] A murine antibody directed against the leukocyte adhesion receptor β-chain (known as the H52 antibody) was humanized following the methods described above.