Plaque It!
Sponsored by: Flash of Genius |
[0001] This application claims the benefit of U.S. provisional application No. 60/184,558, filed Feb. 24, 2000, and U.S. provisional application No. 60/218,856, filed Jul. 13, 2000, the entire contents of each of which are incorporated herein by reference.
[0002] The invention described herein relates to a novel gene and its encoded protein, termed 103P2D6, and to diagnostic and therapeutic methods and compositions useful in the management of various cancers that express 103P2D6.
[0003] Cancer is the second leading cause of human death next to coronary disease. Worldwide, millions of people die from cancer every year. In the United States alone, cancer causes the death of well over a half-million people annually, with some 1.4 million new cases diagnosed per year. While deaths from heart disease have been declining significantly, those resulting from cancer generally are on the rise. In the early part of the next century, cancer is predicted to become the leading cause of death.
[0004] Worldwide, several cancers stand out as the leading killers. In particular, carcinomas of the lung, prostate, breast, colon, pancreas, and ovary represent the primary causes of cancer death. These and virtually all other carcinomas share a common lethal feature. With very few exceptions, metastatic disease from a carcinoma is fatal. Moreover, even for those cancer patients who initially survive their primary cancers, common experience has shown that their lives are dramatically altered. Many cancer patients experience strong anxieties driven by the awareness of the potential for recurrence or treatment failure. Many cancer patients experience physical debilitations following treatment. Furthermore, many cancer patients experience a recurrence.
[0005] Worldwide, prostate cancer is the fourth most prevalent cancer in men. In North America and Northern Europe, it is by far the most common cancer in males and is the second leading cause of cancer death in men. In the United States alone, well over 40,000 men die annually of this disease—second only to lung cancer. Despite the magnitude of these figures, there is still no effective treatment for metastatic prostate cancer. Surgical prostatectomy, radiation therapy, hormone ablation therapy, surgical castration and chemotherapy continue to be the main treatment modalities. Unfortunately, these treatments are ineffective for many and are often associated with undesirable consequences.
[0006] On the diagnostic front, the lack of a prostate tumor marker that can accurately detect early-stage, localized tumors remains a significant limitation in the diagnosis and management of this disease. Although the serum prostate specific antigen (PSA) assay has been a very useful tool, however its specificity and general utility is widely regarded as lacking in several important respects.
[0007] Progress in identifying additional specific markers for prostate cancer has been improved by the generation of prostate cancer xenografts that can recapitulate different stages of the disease in mice. The LAPC (Los Angeles Prostate Cancer) xenografts are prostate cancer xenografts that have survived passage in severe combined immune deficient (SCID) mice and have exhibited the capacity to mimic the transition from androgen dependence to androgen independence (Klein et al., 1997, Nat. Med.3:402). More recently identified prostate cancer markers include PCTA-1 (Su et al., 1996, Proc. Natl. Acad. Sci. USA 93: 7252), prostate-specific membrane (PSM) antigen (Pinto et al., Clin Cancer Res September 1996;2(9):1445-51), STEAP (Proc Natl Acad Sci USA. Dec. 7, 1999;96(25):14523-8) and prostate stem cell antigen (PSCA) (Reiter et al., 1998, Proc. Natl. Acad. Sci. USA 95: 1735).
[0008] While previously identified markers such as PSA, PSM, PCTA and PSCA have facilitated efforts to diagnose and treat prostate cancer, there is need for the identification of additional markers and therapeutic targets for prostate and related cancers in order to further improve diagnosis and therapy.
[0009] The present invention relates to a novel gene, designated 103P2D6 that is over-expressed in multiple cancers listed in Table I. Northern blot expression analysis of 103P2D6 gene expression in normal tissues shows a restricted expression pattern in adult tissues. Analysis of 103P2D6 expression in normal prostate and prostate tumor xenografts shows over-expression in LAPC-4 and LAPC-9 prostate tumor xenografts. The nucleotide (
[0010] The invention provides polynucleotides corresponding or complementary to all or part of the 103P2D6 genes, mRNAs, and/or coding sequences, preferably in isolated form, including polynucleotides encoding 103P2D6-related proteins and fragments of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or more than 25 amino acids; as well as the peptides/proteins themselves; DNA, RNA, DNA/RNA hybrids, and related molecules, polynucleotides or oligonucleotides complementary or having at least a 90% homology to the 103P2D6 genes or mRNA sequences or parts thereof, and polynucleotides or oligonucleotides that hybridize to the 103P2D6 genes, mRNAs, or to 103P2D6-encoding polynucleotides. Also provided are means for isolating cDNAs and the genes encoding 103P2D6. Recombinant DNA molecules containing 103P2D6 polynucleotides, cells transformed or transduced with such molecules, and host-vector systems for the expression of 103P2D6 gene products are also provided. The invention further provides antibodies that bind to 103P2D6 proteins and polypeptide fragments thereof, including polyclonal and monoclonal antibodies, murine and other mammalian antibodies, chimeric antibodies, humanized and fully human antibodies, and antibodies labeled with a detectable marker.
[0011] The invention further provides methods for detecting the presence and status of 103P2D6 polynucleotides and proteins in various biological samples, as well as methods for identifying cells that express 103P2D6. A typical embodiment of this invention provides methods for monitoring 103P2D6 gene products in a tissue or hematology sample having or suspected of having some form of growth dysregulation such as cancer.
[0012] The invention further provides various immunogenic or therapeutic compositions and strategies for treating cancers that express 103P2D6 such as prostate cancers, including therapies aimed at inhibiting the transcription, translation, processing or function of 103P2D6 as well as cancer vaccines.
[0013]
[0014] FIGS.
[0015]
[0016]
[0017] FIGS.
[0018]
[0019]
[0020]
[0021]
[0022]
[0023] FIGS.
[0024] FIGS.
[0025]
[0026]
[0027]
[0028]
[0029] I.) Definitions:
[0030] Unless otherwise defined, all terms of art, notations and other scientific terms or terminology used herein are intended to have the meanings commonly understood by those of skill in the art to which this invention pertains. In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, and the inclusion of such definitions herein should not necessarily be construed to represent a substantial difference over what is generally understood in the art. Many of the techniques and procedures described or referenced herein are well understood and commonly employed using conventional methodology by those skilled in the art, such as, for example, the widely utilized molecular cloning methodologies described in Sambrook et al., Molecular Cloning: A Laboratory Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. As appropriate, procedures involving the use of commercially available kits and reagents are generally carried out in accordance with manufacturer defined protocols and/or parameters unless otherwise noted.
[0031] As used herein, the terms “advanced prostate cancer”, “locally advanced prostate cancer”, “advanced disease” and “locally advanced disease” mean prostate cancers that have extended through the prostate capsule, and are meant to include stage C disease under the American Urological Association (AUA) system, stage C1-C2 disease under the Whitmore-Jewett system, and stage T3-T4 and N+ disease under the TNM (tumor, node, metastasis) system. In general, surgery is not recommended for patients with locally advanced disease, and these patients have substantially less favorable outcomes compared to patients having clinically localized (organ-confined) prostate cancer. Locally advanced disease is clinically identified by palpable evidence of induration beyond the lateral border of the prostate, or asymmetry or induration above the prostate base. Locally advanced prostate cancer is presently diagnosed pathologically following radical prostatectomy if the tumor invades or penetrates the prostatic capsule, extends into the surgical margin, or invades the seminal vesicles.
[0032] “Altering the native glycosylation pattern” is intended for purposes herein to mean deleting one or more carbohydrate moieties found in native sequence 103P2D6 (either by removing the underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the native sequence 103P2D6. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, involving a change in the nature and proportions of the various carbohydrate moieties present.
[0033] The term “analog” refers to a molecule that is structurally similar or shares similar or corresponding attributes with another molecule (e.g. a 103P2D6-related protein). For example an analog of the 103P2D6 protein can be specifically bound by an antibody or T cell that specifically binds to 103P2D6.
[0034] The term “antibody” is used in the broadest sense. Therefore an “antibody” can be naturally occurring or man-made such as monoclonal antibodies produced by conventional hybridoma technology. Anti-103P2D6 antibodies comprise monoclonal and polyclonal antibodies as well as fragments containing the antigen-binding domain and/or one or more complementarity determining regions of these antibodies.
[0035] As used herein, an “antibody fragment” is defined as at least a portion of the variable region of the immunoglobulin molecule that binds to its target, i.e., the antigen-binding region. In one embodiment it specifically covers single anti-103P2D6 antibodies and clones thereof (including agonist, antagonist and neutralizing antibodies) and anti-103P2D6 antibody compositions with polyepitopic specificity.
[0036] The term “codon optimized sequences” refers to nucleotide sequences that have been optimized for a particular host species by replacing any codons having a usage frequency of less than about 20%. Nucleotide sequences that have been optimized for expression in a given host species by elimination of spurious polyadenylation sequences, elimination of exon/intron splicing signals, elimination of transposon-like repeats and/or optimization of GC content in addition to codon optimization are referred to herein as an “expression enhanced sequences.”
[0037] The term “cytotoxic agent” as used herein refers to a substance that inhibits or prevents the function of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes chemotherapeutic agents, and toxins such as small molecule toxins or enzymatically active toxins of bacterial, fungal, plant or animal origin, including fragments and/or variants thereof. Examples of cytotoxic agents include, but are not limited to maytansinoids, ytrium, bismuth ricin, ricin A-chain, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin, diphtheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, abrin A chain, modeccin A chain, alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, calicheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other chemotherapeutic agents, as well as radioisotopes such as At
[0038] The term “homolog” refers to a molecule which exhibits homology to another molecule, by for example, having sequences of chemical residues that are the same or similar at corresponding positions.
[0039] As used herein, the terms “hybridize”, “hybridizing”, “hybridizes” and the like, used in the context of polynucleotides, are meant to refer to conventional hybridization conditions, preferably such as hybridization in 50% formamide/6×SSC/0.1% SDS/100 μg/ml ssDNA, in which temperatures for hybridization are above 37 degrees C. and temperatures for washing in 0.1×SSC/0.1% SDS are above 55 degrees C.
[0040] As used herein, a polynucleotide is said to be “isolated” when it is substantially separated from contaminant polynucleotides that correspond or are complementary to genes other than the 103P2D6 gene or that encode polypeptides other than 103P2D6 gene product or fragments thereof. A skilled artisan can readily employ nucleic acid isolation procedures to obtain an isolated 103P2D6 polynucleotide.
[0041] As used herein, a protein is said to be “isolated” when physical, mechanical or chemical methods are employed to remove the 103P2D6 protein from cellular constituents that are normally associated with the protein. A skilled artisan can readily employ standard purification methods to obtain an isolated 103P2D6 protein. Alternatively, an isolated protein can be prepared by chemical means.
[0042] The term “manmal” as used herein refers to any organism classified as a mammal, including mice, rats, rabbits, dogs, cats, cows, horses and humans. In one embodiment of the invention, the mammal is a mouse. In another embodiment of the invention, the mammal is a human.
[0043] As used herein, the terms “metastatic prostate cancer” and “metastatic disease” mean prostate cancers that have spread to regional lymph nodes or to distant sites, and are meant to include stage D disease under the AUA system and stage T×N×M+under the TNM system. As is the case with locally advanced prostate cancer, surgery is generally not indicated for patients with metastatic disease, and hormonal (androgen ablation) therapy is a preferred treatment modality. Patients with metastatic prostate cancer eventually develop an androgen-refractory state within 12 to 18 months of treatment initiation. Approximately half of these androgen-refractory patients die within 6 months after developing that status. The most common site for prostate cancer metastasis is bone. Prostate cancer bone metastases are often osteoblastic rather than osteolytic (i.e., resulting in net bone formation). Bone metastases are found most frequently in the spine, followed by the femur, pelvis, rib cage, skull and humerus. Other common sites for metastasis include lymph nodes, lung, liver and brain. Metastatic prostate cancer is typically diagnosed by open or laparoscopic pelvic lymphadenectomy, whole body radionuclide scans, skeletal radiography, and/or bone lesion biopsy.
[0044] The term “monoclonal antibody” as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, i.e., the antibodies comprising the population are identical except for possible naturally occurring mutations that are present in minor amounts.
[0045] As used herein “motif” as in biological motif of an 103P2D6-related protein, refers to any set of amino acids forming part of the primary sequence of a protein, either contiguous or capable of being aligned to certain positions that are generally invariant, that is associated with a particular function (e.g. protein-protein interaction, protein-DNA interaction, etc) or modification (e.g. that is phosphorylated, glycosylated or aridated), or localization (e.g. secretory sequence, nuclear localization sequence, etc.) or a sequence that is correlated with being immunogenic, either humorally or cellularly.
[0046] As used herein, the term “polynucleotide” means a polymeric form of nucleotides of at least 10 bases or base pairs in length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide, and is meant to include single and double stranded forms of DNA and/or RNA. In the art, this term if often used interchangeably with “oligonucleotide”. A polynucleotide can comprise a nucleotide sequence disclosed herein wherein thymidine (T) (as shown for example in SEQ ID NO: 1) can also be uracil (U); this definition pertains to the differences between the chemical structures of DNA and RNA, in particular the observation that one of the four major bases in RNA is uracil (U) instead of thymidine (T).
[0047] As used herein, the term “polypeptide” means a polymer of at least about 4, 5, 6, 7, or 8 amino acids. Throughout the specification, standard three letter or single letter designations for amino acids are used. In the art, this term is often used interchangeably with “peptide” or “protein”.
[0048] As used herein, a “recombinant” DNA or RNA molecule is a DNA or RNA molecule that has been subjected to molecular manipulation in vitro.
[0049] “Stringency” of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower temperatures. Hybridization generally depends on the ability of denatured nucleic acid sequences to reanneal when complementary strands are present in an environment below their melting temperature. The higher the degree of desired homology between the probe and hybridizable sequence, the higher the relative temperature that can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995).
[0050] “Stringent conditions” or “high stringency conditions”, as defined herein, are identified by, but not limited to, those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50° C.; (2) employ during hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 nM sodium chloride, 75 mM sodium citrate at 42° C.; or (3) employ 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (PH 6.8), 0.1% sodium pyrophosphate, 5× Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC (sodium chloride/sodium. citrate) and 50% formamide at 55° C., followed by a high-stringency wash consisting of 0.1×SSC containing EDTA at 55° C. “Moderately stringent conditions” are described by, but not limited to, those in Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent than those described above. An example of moderately stringent conditions is overnight incubation at 37° C. in a solution comprising: 20% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5× Denhardt's solution, 10% dextran sulfate, and 20 mg/mL denatured sheared salmon sperm DNA, followed by washing the filters in 1×SSC at about 37-50° C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like.
[0051] A “transgenic animal” (e.g., a mouse or rat) is an animal having cells that contain a transgene, which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A “transgene” is a DNA that is integrated into the genome of a cell from which a transgenic animal develops.
[0052] The term “variant” refers to a molecule that exhibits a variation from a described type or norm, such as a protein that has one or more different amino acid residues in the corresponding position(s) of a specifically described protein (e.g. the 103P2D6 protein shown in
[0053] As used herein, the 103P2D6-related gene and 103P2D6-related protein includes the 103P2D6 genes and proteins specifically described herein, as well as structurally and/or functionally similar variants or analog of the foregoing. 103P2D6 peptide analogs generally share at least about 50%, 60%, 70%, 80%, 90% or more amino acid homology (using BLAST criteria). 103P2D6 nucleotide analogs preferably share 50%, 60%, 70%, 80%, 90% or more nucleic acid homology (using BLAST criteria). In some embodiments, however, lower homology is preferred so as to select preferred residues in view of species-specific codon preferences for optimized protein expression and production and/or immunogenicity-modulated peptide epitopes tailored to a particular target population, e.g. HLA type, as is appreciated by those skilled in the art.
[0054] The 103P2D6-related proteins of the invention include those specifically identified herein, as well as allelic variants, conservative substitution variants, analogs and homologs that can be isolated/generated and characterized without undue experimentation following the methods outlined herein or readily available in the art. Fusion proteins that combine parts of different 103P2D6 proteins or fragments thereof, as well as fusion proteins of a 103P2D6 protein and a heterologous polypeptide are also included. Such 103P2D6 proteins are collectively referred to as the 103P2D6-related proteins, the proteins of the invention, or 103P2D6. As used herein, the term “103P2D6-related protein” refers to a polypeptide fragment or an 103P2D6 protein sequence of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or more than 25 amino acids.
[0055] II.) Properties of 103P2D6.
[0056] As disclosed herein, 103P2D6 exhibits specific properties that are analogous to those found in a family of molecules whose polynucleotides, polypeptides, reactive cytotoxic T cells (CTL), reactive helper T cells (HTL) and anti-polypeptide antibodies are used in well known diagnostic assays that examine conditions associated with dysregulated cell growth such as cancer, in particular prostate cancer (see, e.g., both its highly specific pattern of tissue expression as well as its overexpression in prostate cancers as described for example in Example 3). The best-known member of this class is PSA, the archetypal marker that has been used by medical practitioners for years to identify and monitor the presence of prostate cancer (see, e.g., Merrill et al., J. Urol. 163(2): 503-5120 (2000); Polascik et al., J. Urol. August;162(2):293-306 (1999) and Fortier et al., J. Nat. Cancer Inst. 91(19): 1635-1640(1999)). A variety of other diagnostic markers are also used in this context including p53 and K-ras (see, e.g., Tulchinsky et al., Int J Mol Med July 1999;4(1):99-102 and Minimoto et al., Cancer Detect Prev 2000;24(1):1-12). Therefore, this disclosure of the 103P2D6 polynucleotides and polypeptides (as well as the 103P2D6 polynucleotide probes and anti-103P2D6 antibodies used to identify the presence of these molecules) and their properties allows skilled artisans to utilize these molecules in methods that are analogous to those used, for example, in a variety of diagnostic assays directed to examining conditions associated with cancer.
[0057] Typical embodiments of diagnostic methods that utilize the 103P2D6 polynucleotides, polypeptides, reactive T cells and antibodies are analogous to those methods from well-established diagnostic assays that employ, e.g., PSA polynucleotides, polypeptides, reactive T cells and antibodies. For example, just as PSA polynucleotides are used as probes (for example in Northern analysis, see, e.g., Sharief et al., Biochem. Mol. Biol. Int. 33(3):567-74(1994)) and primers (for example in PCR analysis, see, e.g., Okegawa et al., J. Urol. 163(4): 1189-1190 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of monitoring PSA overexpression or the metastasis of prostate cancers, the 103P2D6 polynucleotides described herein can be utilized in the same way to detect 103P2D6 overexpression or the metastasis of prostate and other cancers expressing this gene. Alternatively, just as PSA polypeptides are used to generate antibodies specific for PSA which can then be used to observe the presence and/or the level of PSA proteins in methods to monitor PSA protein overexpression (see, e.g., Stephan et al., Urology 55(4):560-3 (2000)) or the metastasis of prostate cells (see, e.g., Alanen et al., Pathol. Res. Pract. 192(3):233-7 (1996)), the 103P2D6 polypeptides described herein can be utilized to generate antibodies for use in detecting 103P2D6 overexpression or the metastasis of prostate cells and cells of other cancers expressing this gene.
[0058] Specifically, because metastases involves the movement of cancer cells from an organ of origin (such as the lung or prostate gland etc.) to a different area of the body (such as a lymph node), assays which examine a biological sample for the presence of cells expressing 103P2D6 polynucleotides and/or polypeptides can be used to provide evidence of metastasis. For example, when a biological sample from tissue that does not normally contain 103P2D6-expressing cells (lymph node) is found to contain 103P2D6-expressing cells such as the 103P2D6 expression seen in LAPC4 and LAPC9, xenografts isolated from lymph node and bone metastasis, respectively, this finding is indicative of metastasis.
[0059] Alternatively 103P2D6 polynucleotides and/or polypeptides can be used to provide evidence of cancer, for example, when cells in a biological sample that do not normally express 103P2D6 or express 103P2D6 at a different level are found to express 103P2D6 or have an increased expression of 103P2D6 (see, e.g., the 103P2D6 expression in kidney, lung and colon cancer cells and in patient samples etc. shown in FIGS.
[0060] Just as PSA polynucleotide fragments and polynucleotide variants are employed by skilled artisans for use in methods of monitoring PSA, 103P2D6 polynucleotide fragments and polynucleotide variants are used in an analogous manner. In particular, typical PSA polynucleotides used in methods of monitoring PSA are probes or primers that consist of fragments of the PSA cDNA sequence. Illustrating this, primers used to PCR amplify a PSA polynucleotide must include less than the whole PSA sequence to function in the polymerase chain reaction. In the context of such PCR reactions, skilled artisans generally create a variety of different polynucleotide fragments that can be used as primers in order to amplify different portions of a polynucleotide of interest or to optimize amplification reactions (see, e.g., Caetano-Anolles, G. Biotechniques 25(3): 472-476, 478-480 (1998); Robertson et al., Methods Mol. Biol. 98:121-154 (1998)). An additional illustration of the use of such fragments is provided in Example 3, where a 103P2D6 polynucleotide fragment is used as a probe to show the expression of 103P2D6 RNAs in cancer cells. In addition, variant polynucleotide sequences are typically used as primers and probes for the corresponding mRNAs in PCR and Northern analyses (see, e.g., Sawai et al., Fetal Diagn. Ther. November-December 1996;11(6):407-13 and Current Protocols In Molecular Biology, Volume 2, Unit 2, Frederick M. Ausubul et al. eds., 1995)). Polynucleotide fragments and variants are useful in this context where they are capable of binding to a target polynucleotide sequence (e.g. the 103P2D6 polynucleotide shown in SEQ ID NO: 1) under conditions of high stringency.
[0061] Furthermore, PSA polypeptides which contain an epitope that can be recognized by an antibody or T cell that specifically binds to that epitope are used in methods of monitoring PSA. 103P2D6 polypeptide fragments and polypeptide analogs or variants can also be used in an analogous manner. This practice of using polypeptide fragments or polypeptide variants to generate antibodies (such as anti-PSA antibodies or T cells) is typical in the art with a wide variety of systems such as fusion proteins being used by practitioners (see, e.g., Current Protocols In Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubul et al. eds., 1995). In this context, each epitope(s) functions to provide the architecture with which an antibody or T cell is reactive. Typically, skilled artisans create a variety of different polypeptide fragments that can be used in order to generate immune responses specific for different portions of a polypeptide of interest (see, e.g., U.S. Pat. No. 5,840,501 and U.S. Pat. No. 5,939,533). For example it may be preferable to utilize a polypeptide comprising one of the 103P2D6 biological motifs discussed herein or available in the art. Polypeptide fragments, variants or analogs are typically useful in this context as long as they comprise an epitope capable of generating an antibody or T cell specific for a target polypeptide sequence (e.g. the 103P2D6 polypeptide shown in SEQ ID NO: 2).
[0062] As shown herein, the 103P2D6 polynucleotides and polypeptides (as well as the 103P2D6 polynucleotide probes and anti-103P2D6 antibodies or T cells used to identify the presence of these molecules) exhibit specific properties that make them useful in diagnosing cancers of the prostate. Diagnostic assays that measure the presence of 103P2D6 gene products, in order to evaluate the presence or onset of a disease condition described herein, such as prostate cancer, are used to identify patients for preventive measures or further monitoring, as has been done so successfully with PSA. Moreover, these materials satisfy a need in the art for molecules having similar or complementary characteristics to PSA in situations where, for example, a definite diagnosis of metastasis of prostatic origin cannot be made on the basis of a test for PSA alone (see, e.g., Alanen et al., Pathol. Res. Pract. 192(3): 233-237 (1996)), and consequently, materials such as 103P2D6 polynucleotides and polypeptides (as well as the 103P2D6 polynucleotide probes and anti-103P2D6 antibodies used to identify the presence of these molecules) must be employed to confirm metastases of prostatic origin.
[0063] Finally, in addition to their use in diagnostic assays, the 103P2D6 polynucleotides disclosed herein have a number of other specific utilities such as their use in the identification of oncogenetic associated chromosomal abnormalities in 2q34, the chromosomal region to which the 103P2D6 gene maps (see Example 7 below). Moreover, in addition to their use in diagnostic assays, the 103P2D6-related proteins and polynucleotides disclosed herein have other utilities such as their use in the forensic analysis of tissues of unknown origin (see, e.g., Takahama K Forensic Sci Int Jun. 28, 1996;80(1-2): 63-9).
[0064] Additionally, 103P2D6-related proteins or polynucleotides of the invention can be used to treat a pathologic condition characterized by the over-expression of 103P2D6. For example, the amino acid or nucleic acid sequence of
[0065] III.) 103P2D6 Polynucleotides
[0066] One aspect of the invention provides polynucleotides corresponding or complementary to all or part of an 103P2D6 gene, mRNA, and/or coding sequence, preferably in isolated form, including polynucleotides encoding an 103P2D6-related protein and fragments thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, polynucleotides or oligonucleotides complementary to an 103P2D6 gene or mRNA sequence or a part thereof, and polynucleotides or oligonucleotides that hybridize to an 103P2D6 gene, mRNA, or to an 103P2D6 encoding polynucleotide (collectively, “103P2D6 polynucleotides”). In all instances when referred to in this section, T can also be U in
[0067] Embodiments of a 103P2D6 polynucleotide include: a 103P2D6 polynucleotide having the sequence shown in
[0068] (a) at least 10 contiguous nucleotides of a polynucleotide having the sequence as shown in
[0069] (b) at least 10 contiguous nucleotides of a polynucleotide having the sequence as shown in
[0070] (c) at least 10 contiguous nucleotides of a polynucleotide having the sequence as shown in
[0071] (d) a polynucleotide whose starting base is in the range of 1-804 of
[0072] (e) a polynucleotide whose starting base is in the range of 977-1036 of
[0073] (f) a polynucleotide whose starting base is in the range of 1414-1815 of
[0074] (g) a polynucleotide whose starting base is in the range of 805-976 of
[0075] (h) a polynucleotide whose starting base is in the range of 805-2493 of
[0076] (i) a polynucleotide of (d-g) that is at least 10 nucleotide bases in length; or
[0077] (j) a polynucleotide that selectively hybridizes under stringent conditions to a polynucleotide of (a)-(g);
[0078] wherein a range is understood to specifically disclose all whole unit positions thereof. Moreover, a peptide that is encoded by any of the foregoing is also within the scope of the invention.
[0079] Also within the scope of the invention is a nucleotide, as well as any peptide encoded thereby, that starts at any of the following positions or ranges, and ends at a higher position or range: 1, 804, a range of 1-804, 805, a range of 805-976; a range of 805-2493; a range of 977-1036, a range of 1037-1413; a range of 1414-1815; a range of 1816-2493; a range of 2494-4727; wherein a range as used in this section is understood to specifically disclose all whole unit positions thereof.
[0080] Another embodiment of the invention comprises a polynucleotide that encodes a 103P2D6-related protein whose sequence is encoded by the cDNA contained in the plasmids deposited with American Type Culture Collection as Accession No. PTA-1155 or PTA-1895. Another embodiment comprises a polynucleotide that hybridizes under stringent hybridization conditions, to the human 103P2D6 cDNA shown in SEQ ID NO: 1 or to a polynucleotide fragment thereof.
[0081] Typical embodiments of the invention disclosed herein include 103P2D6 polynucleotides that encode specific portions of the 103P2D6 mRNA sequence (and those which are complementary to such sequences) such as those that encode the protein and fragments thereof, for example of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more contiguous amino acids.
[0082] For example, representative embodiments of the invention disclosed herein include: polynucleotides and their encoded peptides themselves encoding about amino acid 1 to about amino acid 10 of the 103P2D6 protein shown in
[0083] Polynucleotides encoding relatively long portions of the 103P2D6 protein are also within the scope of the invention. Additional illustrative embodiments of the invention disclosed herein include 103P2D6 polynucleotide fragments encoding one or more of the biological motifs contained within the 103P2D6 protein sequence, including one or more of the motif-bearing subsequences of the 103P2D6 protein set forth in Table XIX. In another embodiment, typical polynucleotide fragments of the invention encode one or more of the regions of 103P2D6 that exhibit homology to a known molecule. In another embodiment of the invention, typical polynucleotide fragments can encode one or more of the 103P2D6 N-glycosylation sites, cAMP and cGMP-dependent protein kinase phosphorylation sites, casein kinase II phosphorylation sites or N-myristoylation site and amidation sites.
[0084] III.A.) Uses of 103P2D6 Polynucleotides
[0085] III.A.1.) Monitoring of Genetic Abnormalities
[0086] The polynucleotides of the preceding paragraphs have a number of different specific uses. The human 103P2D6 gene maps to chromosome 4p12-p14 as determined using the GeneBridge4 radiation hybrid panel (see Example 7). For example, because the 103P2D6 gene maps to chromosome 4p12-p14, polynucleotides that encode different regions of the 103P2D6 protein are used to characterize cytogenetic abnormalities on chromosome 4, band p12-p14 that have been identified as being associated with various cancers. In particular, a variety of chromosomal abnormalities in 4p12-p14 including translocations and deletions have been identified as frequent cytogenetic abnormalities in a number of different cancers (see, e.g., Zimonjic, D. B. et al., 1999, Hepatology 29(4):1208-14; Wu, X. et al., 1995, Cancer Res. 55(3):557-61; Arribas, R. et al., 1999, Lab. Invest. 79(2):111-22). Thus, polynucleotides encoding specific regions of the 103P2D6 protein provide new tools that can be used to delineate, with greater precision than previously possible, cytogenetic abnormalities in this region of chromosome 2 that may contribute to the malignant phenotype. In this context, these polynucleotides satisfy a need in the art for expanding the sensitivity of chromosomal screening in order to identify more subtle and less common chromosomal abnormalities (see e.g. Evans et al., Am. J. Obstet. Gynecol 171(4): 1055-1057 (1994)).
[0087] Furthermore, as 103P2D6 was shown to be highly expressed in prostate and other cancers, 103P2D6 polynucleotides are used in methods assessing the status of 103P2D6 gene products in normal versus cancerous tissues. Typically, polynucleotides that encode specific regions of the 103P2D6 protein are used to assess the presence of perturbations (such as deletions, insertions, point mutations, or alterations resulting in a loss of an antigen etc.) in specific regions of the 103P2D6 gene, such as such regions containing one or more motifs. Exemplary assays include both RT-PCR assays as well as single-strand conformation polymorphism (SSCP) analysis (see, e.g., Marrogi et al., J. Cutan. Pathol. 26(8): 369-378 (1999), both of which utilize polynucleotides encoding specific regions of a protein to examine these regions within the protein.
[0088] III.A.2.) Antisense Embodiments
[0089] Other specifically contemplated nucleic acid related embodiments of the invention disclosed herein are genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as nucleic acid molecules based on an alternative backbone, or including alternative bases, whether derived from natural sources or synthesized, and include molecules capable of inhibiting the RNA or protein expression of 103P2D6. For example, antisense molecules can be RNAs or other molecules, including peptide nucleic acids (PNAs) or non-nucleic acid molecules such as phosphorothioate derivatives, that specifically bind DNA or RNA in a base pair-dependent manner. A skilled artisan can readily obtain these classes of nucleic acid molecules using the 103P2D6 polynucleotides and polynucleotide sequences disclosed herein.
[0090] Antisense technology entails the administration of exogenous oligonucleotides that bind to a target polynucleotide located within the cells. The term “antisense” refers to the fact that such oligonucleotides are complementary to their intracellular targets, e.g., 103P2D6. See for example, Jack Cohen, Oligodeoxynucleotides, Antisense Inhibitors of Gene Expression, CRC Press, 1989; and Synthesis 1:1-5 (1988). The 103P2D6 antisense oligonucleotides of the present invention include derivatives such as S-oligonucleotides (phosphorothioate derivatives or 8-oligos, see, Jack Cohen, supra), which exhibit enhanced cancer cell growth inhibitory action. S-oligos (nucleoside phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in which a nonbridging oxygen atom of the phosphate group is replaced by a sulfur atom. The S-oligos of the present invention can be prepared by treatment of the corresponding O-oligos with 3H-1,2-benzodithiol-3-one-1,1-dioxide, which is a sulfur transfer reagent. See Iyer, R. P. et al, J. Org. Chem. 55:4693-4698 (1990); and Iyer, R. P. et al., J. Am. Chem. Soc. 112:1253-1254 (1990). Additional 103P2D6 antisense oligonucleotides of the present invention include morpholino antisense oligonucleotides known in the art (see, e.g., Partridge et al., 1996, Antisense & Nucleic Acid Drug Development 6: 169-175).
[0091] The 103P2D6 antisense oligonucleotides of the present invention typically can be RNA or DNA that is complementary to and stably hybridizes with the first 100 5′ codons or last 100 3′ codons of the 103P2D6 genomic sequence or the corresponding mRNA. Absolute complementarity is not required, although high degrees of complementarity are preferred. Use of an oligonucleotide complementary to this region allows for the selective hybridization to 103P2D6 mRNA and not to mRNA specifying other regulatory subunits of protein kinase. In one embodiment, 103P2D6 antisense oligonucleotides of the present invention are 15 to 30-mer fragments of the antisense DNA molecule that have a sequence that hybridizes to 103P2D6 mRNA. Optionally, 103P2D6 antisense oligonucleotide is a 30-mer oligonucleotide that is complementary to a region in the first 10 5′ codons or last 10 3′ codons of 103P2D6. Alternatively, the antisense molecules are modified to employ ribozymes in the inhibition of 103P2D6 expression, see, e.g., L. A. Couture & D. T. Stinchcomb;
[0092] III.A.3.) Primers and Primer Pairs
[0093] Further specific embodiments of this nucleotides of the invention include primers and primer pairs, which allow the specific amplification of polynucleotides of the invention or of any specific parts thereof, and probes that selectively or specifically hybridize to nucleic acid molecules of the invention or to any part thereof Probes can be labeled with a detectable marker, such as, for example, a radioisotope, fluorescent compound, bioluminescent compound, a chemiluminescent compound, metal chelator or enzyme. Such probes and primers are used to detect the presence of a 103P2D6 polynucleotide in a sample and as a means for detecting a cell expressing a 103P2D6 protein.
[0094] Examples of such probes include polypeptides comprising all or part of the human 103P2D6 cDNA sequences shown in
[0095] The 103P2D6 polynucleotides of the invention are useful for a variety of purposes, including but not limited to their use as probes and primers for the amplification and/or detection of the 103P2D6 gene(s), mRNA(s), or fragments thereof; as reagents for the diagnosis and/or prognosis of prostate cancer and other cancers; as coding sequences capable of directing the expression of 103P2D6 polypeptides; as tools for modulating or inhibiting the expression of the 103P2D6 gene(s) and/or translation of the 103P2D6 transcript(s); and as therapeutic agents.
[0096] III.A.4.) Isolation of 103P2D6-Encoding Nucleic Acid Molecules
[0097] The 103P2D6 cDNA sequences described herein enable the isolation of other polynucleotides encoding 103P2D6 gene product(s), as well as the isolation of polynucleotides encoding 103P2D6 gene product homologs, alternatively spliced isoforms, allelic variants, and mutant forms of the 103P2D6 gene product as well as polynucleotides that encode analogs of 103P2D6-related proteins. Various molecular cloning methods that can be employed to isolate fall length cDNAs encoding an 103P2D6 gene are well known (See, for example, Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 2d edition., Cold Spring Harbor Press, New York, 1989; Current Protocols in Molecular Biology. Ausubel et al., Eds., Wiley and Sons, 1995). For example, lambda phage cloning methodologies can be conveniently employed, using commercially available cloning systems (e.g., Lambda ZAP Express, Stratagene). Phage clones containing 103P2D6 gene cDNAs can be identified by probing with a labeled 103P2D6 cDNA or a fragment thereof For example, in one embodiment, the 103P2D6 cDNA (
[0098] III.A.5.) Recombinant Nucleic Acid Molecules and Host-Vector Systems
[0099] The invention also provides recombinant DNA or RNA molecules containing an 103P2D6 polynucleotide, fragment, analog or homologue thereof, including but not limited to phages, plasmids, phagemids, cosmids, YACs, BACs, as well as various viral and non-viral vectors well known in the art, and cells transformed or transfected with such recombinant DNA or RNA molecules. Methods for generating such molecules are well known (see, for example, Sambrook et al, 1989, supra).
[0100] The invention further provides a host-vector system comprising a recombinant DNA molecule containing a 103P2D6 polynucleotide, fragment, analog or homologue thereof within a suitable prokaryotic or eukaryotic host cell. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a mammalian cell or an insect cell (e.g., a baculovirus-infectible cell such as an Sf9 or HighFive cell). Examples of suitable mammalian cells include various prostate cancer cell lines such as DU145 and TsuPr1, other transfectable or transducible prostate cancer cell lines, primary cells (PrEC), as well as a number of mammalian cells routinely used for the expression of recombinant proteins (e.g., COS, CHO, 293, 293T cells). More particularly, a polynucleotide comprising the coding sequence of 103P2D6 or a fragment, analog or homolog thereof can be used to generate 103P2D6 proteins or fragments thereof using any number of host-vector systems routinely used and widely known in the art.
[0101] A wide range of host-vector systems suitable for the expression of 103P2D6 proteins or fragments thereof are available, see for example, Sambrook et al., 1989, supra; Current Protocols in Molecular Biology, 1995, supra). Preferred vectors for mammalian expression include but are not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and the retroviral vector pSRαtkneo (Muller et al., 1991, MCB 11:1785). Using these expression vectors, 103P2D6 can be expressed in several prostate cancer and non-prostate cell lines, including for example 293, 293T, rat-1, NIH 3T3 and TsuPr1. The host-vector systems of the invention are useful for the production of a 103P2D6 protein or fragment thereof Such host-vector systems can be employed to study the functional properties of 103P2D6 and 103P2D6 mutations or analogs.
[0102] Recombinant human 103P2D6 protein or an analog or homolog or fragment thereof can be produced by mammalian cells transfected with a construct encoding a 103P2D6-related nucleotide. For example, 293T cells can be transfected with an expression plasmid encoding 103P2D6 or fragment, analog or homolog thereof, the 103P2D6 or related protein is expressed in the 293T cells, and the recombinant 103P2D6 protein is isolated using standard purification methods (e.g., affinity purification using anti-103P2D6 antibodies). In another embodiment, a 103P2D6 coding sequence is subcloned into the retroviral vector pSRαMSVtkneo and used to infect various mammalian cell lines, such as NIH 3T3, TsuPr1, 293 and rat-1 in order to establish 103P2D6 expressing cell lines. Various other expression systems well known in the art can also be employed. Expression constructs encoding a leader peptide joined in frame to the 103P2D6 coding sequence can be used for the generation of a secreted form of recombinant 103P2D6 protein.
[0103] As discussed herein, redundancy in the genetic code permits variation in 103P2D6 gene sequences. In particular, it is known in the art that specific host species often have specific codon preferences, and thus one can adapt the disclosed sequence as preferred for a desired host. For example, preferred analog codon sequences typically have rare codons (i.e., codons having a usage frequency of less than about 20% in known sequences of the desired host) replaced with higher frequency codons. Codon preferences for a specific species are calculated, for example, by utilizing codon usage tables available on the INTERNET such as: http://www.dna.affrc.go.jp/˜nakamura/codon.html.
[0104] Additional sequence modifications are known to enhance protein expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon/intron splice site signals, transposon-like repeats, and/or other such well-characterized sequences that are deleterious to gene expression. The GC content of the sequence is adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. Where possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures. Other useful modifications include the addition of a translational initiation consensus sequence at the start of the open reading frame, as described in Kozak,
[0105] IV.) 103P2D6-Related Proteins
[0106] Another aspect of the present invention provides 103P2D6-related proteins. Specific embodiments of 103P2D6 proteins comprise a polypeptide having all or part of the amino acid sequence of human 103P2D6 as shown in
[0107] In general, naturally occurring allelic variants of human 103P2D6 share a high degree of structural identity and homology (e.g., 90% or more homology). Typically, allelic variants of the 103P2D6 protein contain conservative amino acid substitutions within the 103P2D6 sequences described herein or contain a substitution of an amino acid from a corresponding position in a homologue of 103P2D6. One class of 103P2D6 allelic variants are proteins that share a high degree of homology with at least a small region of a particular 103P2D6 amino acid sequence, but further contain a radical departure from the sequence, such as a non-conservative substitution, truncation, insertion or frame shift. In comparisons of protein sequences, the terms, similarity, identity, and homology each have a distinct meaning as appreciated in the field of genetics. Moreover, orthology and paralogy can be important concepts describing the relationship of members of a given protein family in one organism to the members of the same family in other organisms.
[0108] Amino acid abbreviations are provided in Table II. Conservative amino acid substitutions can frequently be made in a protein without altering either the conformation or the function of the protein. Such changes include substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice versa. Other substitutions can also be considered conservative, depending on the environment of the particular amino acid and its role in the three-dimensional structure of the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, as can alanine (A) and valine (V). Methionine (M), which is relatively hydrophobic, can frequently be interchanged with leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are frequently interchangeable in locations in which the significant feature of the amino acid residue is its charge and the differing pK's of these two amino acid residues are not significant. Still other changes can be considered “conservative” in particular environments (see, e.g. Table III herein; pages 13-15 “Biochemistry” 2
[0109] Embodiments of the invention disclosed herein include a wide variety of art-accepted variants or analogs of 103P2D6 proteins such as polypeptides having amino acid insertions, deletions and substitutions. 103P2D6 variants can be made using methods known in the art such as site-directed mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis (Carter et al.,
[0110] Scanning amino acid analysis can also be employed to identify one or more amino acids along a contiguous sequence that is involved in a specific biological activity such as a protein-protein interaction. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of the variant. Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and exposed positions (Creighton,
[0111] As defined herein, 103P2D6 variants, analogs or homologs, have the distinguishing attribute of having at least one epitope that is “cross reactive” with a 103P2D6 protein having the amino acid sequence of SEQ ID NO: 2. As used in this sentence, “cross reactive” means that an antibody or T cell that specifically binds to an 103P2D6 variant also specifically binds to the 103P2D6 protein having the amino acid sequence of SEQ ID NO: 2. A polypeptide ceases to be a variant of the protein shown in SEQ ID NO: 2 when it no longer contains any epitope capable of being recognized by an antibody or T cell that specifically binds to the 103P2D6 protein. Those skilled in the art understand that antibodies that recognize proteins bind to epitopes of varying size, and a grouping of the order of about four or five amino acids, contiguous or not, is regarded as a typical number of amino acids in a minimal epitope. See, e.g., Nair et al., J. Immunol 2000 165(12): 6949-6955; Hebbes et al., Mol Immunol (1989) 26(9):865-73; Schwartz et al., J Immunol (1985) 135(4):2598-608.
[0112] Another class of 103P2D6-related protein variants share 70%, 75%, 80%, 85% or 90% or more similarity with the amino acid sequence of SEQ ID NO: 2 or a fragment thereof. Another specific class of 103P2D6 protein variants or analogs comprise one or more of the 103P2D6 biological motifs described herein or presently known in the art. Thus, encompassed by the present invention are analogs of 103P2D6 fragments (nucleic or amino acid) that have altered functional (e.g. immunogenic) properties relative to the starting fragment. It is to be appreciated that motifs now or which become part of the art are to be applied to the nucleic or amino acid sequences of
[0113] As discussed herein, embodiments of the claimed invention include polypeptides containing less than the 532 amino acid sequence of the 103P2D6 protein shown in
[0114] Moreover, representative embodiments of the invention disclosed herein include polypeptides consisting of about amino acid 1 to about amino acid 10 of the 103P2D6 protein shown in
[0115] 103P2D6-related proteins are generated using standard peptide synthesis technology or using chemical cleavage methods well known in the art. Alternatively, recombinant methods can be used to generate nucleic acid molecules that encode a 103P2D6related protein. In one embodiment, nucleic acid molecules provide a means to generate defined fragments of the 103P2D6 protein (or variants, homologs or analogs thereof).
[0116] IV.A.) Motif-Bearing Protein Embodiments
[0117] Additional illustrative embodiments of the invention disclosed herein include 103P2D6 polypeptides comprising the amino acid residues of one or more of the biological motifs contained within the 103P2D6 polypeptide sequence set forth in
[0118] Motif bearing subsequences of the 103P2D6 protein are set forth and identified in Table XIX.
[0119] Table XX sets forth several frequently occurring motifs based on pfam searches (http://pfam.wustl.edu/). The columns of Table XX list (1) motif name abbreviation, (2) percent identity found amongst the different member of the motif family, (3) motif name or description and (4) most common function; location information is included if the motif is relevant for location.
[0120] Polypeptides comprising one or more of the 103P2D6 motifs discussed above are useful in elucidating the specific characteristics of a malignant phenotype in view of the observation that the 103P2D6 motifs discussed above are associated with growth dysregulation and because 103P2D6 is overexpressed in certain cancers (See, e.g., Table I). Casein kinase II, cAMP and cCMP-dependent protein kinase, and Protein Kinase C, for example, are enzymes known to be associated with the development of the malignant phenotype (see e.g. Chen et al., Lab Invest., 78(2): 165-174 (1998); Gaiddon et al., Endocrinology 136(10): 4331-4338 (1995); Hall et al., Nucleic Acids Research 24(6): 1119-1126 (1996); Peterziel et al., Oncogene 18(46): 6322-6329 (1999) and O'Brian, Oncol. Rep. 5(2): 305-309 (1998)). Moreover, both glycosylation and myristoylation are protein modifications also associated with cancer and cancer progression (see e.g. Dennis et al., Biochem. Biophys. Acta 1473(1):21-34 (1999); Raju et al., Exp. Cell Res. 235(1): 145-154 (1997)). Amidation is another protein modification also associated with cancer and cancer progression (see e.g. Treston et al., J. Natl. Cancer Inst. Monogr. (13): 169-175 (1992)).
[0121] In another embodiment, proteins of the invention comprise one or more of the immunoreactive epitopes identified in accordance with art-accepted methods, such as the peptides set forth in Tables V-XVIII. CTL epitopes can be determined using specific algorithms to identify peptides within an 103P2D6 protein that are capable of optimally binding to specified HLA alleles (e.g., Table IV (A) and Table IV (B); Epimatrix™ and Epimer™, Brown University, http://www.brown.edu/Research/TB-HIV_Lab/epimatrix/epimatrix .html; and BIMAS, http:/bimas.dcrt.nih.gov/. Moreover, processes for identifying peptides that have sufficient binding affinity for HLA molecules and which are correlated with being immunogenic epitopes, are well known in the art, and are carried out without undue experimentation. In addition, processes for identifying peptides that are immunogenic epitopes, are well known in the art, and are carried out without undue experimentation either in vitro or in vivo.
[0122] Also known in the art are principles for creating analogs of such epitopes in order to modulate immunogenicity. For example, one begins with an epitope that bears a CTL or HTL motif (see, e.g., the HLA Class I motifs or Table IV (A) and the HTL motif of Table IV (B)). The epitope is analoged by substituting out an amino acid at one of the specified positions, and replacing it with another amino acid specified for that position.
[0123] A variety of references reflect the art regarding the identification and generation of epitopes in a protein of interest as well as analogs thereof. See, for example, WO 9733602 to Chestnut et al.; Sette, Immunogenetics 1999 50(3-4): 201-212; Sette et al., J. Immunol. 2001 166(2): 1389-1397; Sidney et al., Hum. Immunol. 1997 58(1): 12-20; Kondo et al., Immunogenetics 1997 45(4): 249-258; Sidney et al., J. Immunol. 1996 157(8): 3480-90; and Falk et al., Nature 351: 290-6 (1991); Hunt et al., Science 255:1261-3 (1992); Parker et al., J. Immunol. 149:3580-7 (1992); Parker et al., J. Immunol. 152:163-75 (1994)); Kast et al., 1994 152(8): 3904-12; Borras-Cuesta et al., Hum. Immunol. 2000 61(3): 266-278; Alexander et al., J. Immunol. 2000 164(3); 164(3): 1625-1633; Alexander et al., PMID: 7895164, UI: 95202582; O'Sullivan et al., J. Immunol. 1991 147(8): 2663-2669; Alexander et al., Immunity 1994 1(9): 751-761 and Alexander et al., Immunol. Res. 1998 18(2): 79-92.
[0124] Related embodiments of the inventions include polypeptides comprising combinations of the different motifs set forth in Table XIX, and/or, one or more of the predicted CTL epitopes of Table V through Table XVIII, and/or, one or more of the T cell binding motifs known in the art. Preferred embodiments contain no insertions, deletions or substitutions either within the motifs or the intervening sequences of the polypeptides. In addition, embodiments which include a number of either N-terminal and/or C-terminal amino acid residues on either side of these motifs may be desirable (to, for example, include a greater portion of the polypeptide architecture in which the motif is located). Typically the number of N-terminal and/or C-terminal amino acid residues on either side of a motif is between about 1 to about 100 amino acid residues, preferably 5 to about 50 amino acid residues.
[0125] 103P2D6-related proteins are embodied in many forms, preferably in isolated form. A purified 103P2D6 protein molecule will be substantially free of other proteins or molecules that impair the binding of 103P2D6 to antibody, T cell or other ligand. The nature and degree of isolation and purification will depend on the intended use. Embodiments of a 103P2D6-related proteins include purified 103P2D6-related proteins and functional, soluble 103P2D6-related proteins. In one embodiment, a functional, soluble 103P2D6 protein or fragment thereof retains the ability to be bound by antibody, T cell or other ligand.
[0126] The invention also provides 103P2D6 proteins comprising biologically active fragments of the 103P2D6 amino acid sequence shown in
[0127] 103P2D6-related polypeptides that contain particularly interesting structures can be predicted and/or identified using various analytical techniques well known in the art, including, for example, the methods of Chou-Fasman, Garnier-Robson, Kyte-Doolittle, Eisenberg, Karplus-Schultz or Jameson-Wolf analysis, or on the basis of immunogenicity. Fragments that contain such structures are particularly useful in generating subunit-specific anti-103P2D6 antibodies, or T cells or in identifying cellular factors that bind to 103P2D6.
[0128] CTL epitopes can be determined using specific algorithms to identify peptides within an 103P2D6 protein that are capable of optimally binding to specified HLA alleles (e.g., Table IV (A) and Table IV (B); Epimatrix™ and Epimer™, Brown University (http://www.brown.edu/Research/TB-HIV_Lab/epimatrix/epimatri x.html); and BIMAS, http://bimas.dcrt.nih.gov/). Illustrating this, peptide epitopes from 103P2D6 that are presented in the context of human MHC class I molecules HLA-A1, A2, A3, A11, A24, B7 and B35 were predicted (Tables V-XVIII). Specifically, the complete amino acid sequence of the 103P2D6 protein was entered into the HLA Peptide Motif Search algorithm found in the Bioinformatics and Molecular Analysis Section (BIMAS) web site listed above. The HLA peptide motif search algorithm was developed by Dr. Ken Parker based on binding of specific peptide sequences in the groove of HLA Class I molecules and specifically HLA-A2 (see, e.g., Falk et al., Nature 351: 290-6 (1991); Hunt et al., Science 255:1261-3 (1992); Parker et al., J. Immunol. 149:3580-7 (1992); Parker et al., J. Immunol. 152:163-75 (1994)). This algorithm allows location and ranking of 8-mer, 9-mer, and 10-mer peptides from a complete protein sequence for predicted binding to HLA-A2 as well as numerous other HLA Class I molecules. Many HLA class I binding peptides are 8-, 9-, 10 or 11-mers. For example, for class I HLA-A2, the epitopes preferably contain a leucine (L) or methionine (M) at position 2 and a valine (V) or leucine (L) at the C-terminus (see, e.g., Parker et al., J. Immunol. 149:3580-7 (1992)). Selected results of 103P2D6 predicted binding peptides are shown in Tables V-XVIII herein. In Tables V-XVIII, the top 50 ranking candidates, 9-mers and 10-mers, for each family member are shown along with their location, the amino acid sequence of each specific peptide, and an estimated binding score. The binding score corresponds to the estimated half-time of dissociation of complexes containing the peptide at 37° C. at pH 6.5. Peptides with the highest binding score are predicted to be the most tightly bound to HLA Class I on the cell surface for the greatest period of time and thus represent the best immunogenic targets for T-cell recognition.
[0129] Actual binding of peptides to an HLA allele can be evaluated by stabilization of HLA expression on the antigen-processing defective cell line T2 (see, e.g., Xue et al., Prostate 30:73-8 (1997) and Peshwa et al., Prostate 36:129-38 (1998)). Immunogenicity of specific peptides can be evaluated in vitro by stimulation of CD8+ cytotoxic T lymphocytes (CTL) in the presence of antigen presenting cells such as dendritic cells.
[0130] It is to be appreciated that every epitope predicted by the BIMAS site, Epimer™ and Epimatrix™ sites, or specified by the HLA class I or class I motifs available in the art or which become part of the art such as set forth in Table IV (A) and Table IV (B) are to be “applied” to the 103P2D6 protein. As used in this context “applied” means that the 103P2D6 protein is evaluated, e.g., visually or by computer-based patterns finding methods, as appreciated by those of skill in the relevant art. Every subsequence of the 103P2D6 of 8, 9, 10, or 11 amino acid residues that bears an HLA Class I motif, or a subsequence of 9 or more amino acid residues that bear an HLA Class II motif are within the scope of the invention.
[0131] IV.B.) Expression of 103P2D6-Related Proteins
[0132] In an embodiment described in the examples that follow, 103P2D6 can be conveniently expressed in cells (such as 293T cells) transfected with a commercially available expression vector such as a CMV-driven expression vector encoding 103P2D6 with a C-terminal 6×His and MYC tag (pcDNA3.1/mycHIS, Invitrogen or Tag5, GenHunter Corporation, Nashville Tenn.). The Tag5 vector provides an IgGK secretion signal that can be used to facilitate the production of a secreted 103P2D6 protein in transfected cells. The secreted HIS-tagged 103P2D6 in the culture media can be purified, e.g., using a nickel column using standard techniques.
[0133] IV.C.) Modifications of 103P2D6-Related Proteins
[0134] Modifications of 103P2D6-related proteins such as covalent modifications are included within the scope of this invention. One type of covalent modification includes reacting targeted amino acid residues of a 103P2D6 polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C-terminal residues of the 103P2D6. Another type of covalent modification of the 103P2D6 polypeptide included within the scope of this invention comprises altering the native glycosylation pattern of a protein of the invention. Another type of covalent modification of 103P2D6 comprises linking the 103P2D6 polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Pat. Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337.
[0135] The 103P2D6-related proteins of the present invention can also be modified to form a chimeric molecule comprising 103P2D6 fused to another, heterologous polypeptide or amino acid sequence. Such a chimeric molecule can be synthesized chemically or recombinantly. A chimeric molecule can have a protein of the inventi