| QPTGRSWGQ | (SEQ ID NO 93) | |
| RSEGRTSWAQ | (SEQ ID NO 220) | |
| RTEGRTSWAQ | (SEQ ID NO 221) | |
| SRRQPIPRARRTEGRSWAQ | (SEQ ID NO 268) | |
| LEWRNTSGLYVL | (SEQ ID NO 83) | |
| VNYRNASGIYHI | (SEQ ID NO 126) | |
| QHYRNISGIYHV | (SEQ ID NO 127) | |
| EHYRNASGIYHI | (SEQ ID NO 128) | |
| IHYRNASGIYHI | (SEQ ID NO 224) | |
| VPYRNASGIYHV | (SEQ ID NO 84) | |
| VNYRNASGIYHI | (SEQ ID NO 225) | |
| VNYRNASGVYHI | (SEQ ID NO 226) | |
| VNYHNTSGIYHL | (SEQ ID NO 227) | |
| QHYRNASGIYHV | (SEQ ID NO 228) | |
| QHYRNVSGIYHV | (SEQ ID NO 229) | |
| IHYRNASDGYYI | (SEQ ID NO 230) | |
| LQVKNTSSSYMV | (SEQ ID NO 231) | |
| VYEADDVILHT | (SEQ ID NO 85) | |
| VYETEHHILHL | (SEQ ID NO 129) | |
| VYEADHHIMHL | (SEQ ID NO 130) | |
| VYETDHHILHL | (SEQ ID NO 131) | |
| VYEADNLILHA | (SEQ ID NO 86) | |
| VWQLRAIVLHV | (SEQ ID NO 232) | |
| VYEADYHILHL | (SEQ ID NO 233) | |
| VYETDNHILHL | (SEQ ID NO 234) | |
| VYETENHILHL | (SEQ ID NO 235) | |
| VFETVHHILHL | (SEQ ID NO 236) | |
| VFETEHHILHL | (SEQ ID NO 237) | |
| VFETDHHIMHL | (SEQ ID NO 238) | |
| VYETENHILHL | (SEQ ID NO 239) | |
| VYEADALILHA | (SEQ ID NO 240) | |
| VQDGNTSTCWTPV | (SEQ ID NO 87) | |
| VQDGNTSACWTPV | (SEQ ID NO 241) | |
| VRVGNQSRCWVAL | (SEQ ID NO 132) | |
| VRTGNTSRCWVPL | (SEQ ID NO 133) | |
| VRAGNVSRCWTPV | (SEQ ID NO 134) | |
| EEKGNISRCWIPV | (SEQ ID NO 242) | |
| VKTGNQSRCWVAL | (SEQ ID NO 243) | |
| VRTGNQSRCWVAL | (SEQ ID NO 244) | |
| VKTGNQSRCWIAL | (SEQ ID NO 245) | |
| VKTGNVSRCWIPL | (SEQ ID NO 247) | |
| VKTGNVSRCWISL | (SEQ ID NO 248) | |
| VRKDNVSRCWVQI | (SEQ ID NO 249) | |
| VRYVGATTAS | (SEQ ID NO 89) | |
| APYIGAPLES | (SEQ ID NO 135) | |
| APYVGAPLES | (SEQ ID NO 136) | |
| AVSMDAPLES | (SEQ ID NO 137) | |
| APSLGAVTAP | (SEQ ID NO 90) | |
| APSFGAVTAP | (SEQ ID NO 250) | |
| VSQPGALTKG | (SEQ ID NO 251) | |
| VKYVGATTAS | (SEQ ID NO 252) | |
| APYIGAPVES | (SEQ ID NO 253) | |
| AQHLNAPLES | (SEQ ID NO 254) | |
| SPYVGAPLEP | (SEQ ID NO 255) | |
| SPYAGAPLEP | (SEQ ID NO 256) | |
| APYLGAPLEP | (SEQ ID NO 257) | |
| APYLGAPLES | (SEQ ID NO 258) | |
| APYVGAPLES | (SEQ ID NO 259) | |
| VPYLGAPLTS | (SEQ ID NO 260) | |
| APHLRAPLSS | (SEQ ID NO 261) | |
| APYLGAPLTS | (SEQ ID NO 262) | |
| RPRRHQTVQT | (SEQ ID NO 91) | |
| QPRRHWTTQD | (SEQ ID NO 138) | |
| RPRRHWTTQD | (SEQ ID NO 139) | |
| RPRQHATVQN | (SEQ ID NO 92) | |
| RPRQHATVQD | (SEQ ID NO 263) | |
| SPQHHKFVQD | (SEQ ID NO 264) | |
| RPRRLWTTQE | (SEQ ID NO 265) | |
| PPRIHETTQD | (SEQ ID NO 266) | |
| TISYANGSGPSDDK | (SEQ ID NO 267) | |
[0002] The present invention relates to new nucleotide and amino acid sequences corresponding to the coding region of a new type
[0003] The technical problem underlying the present invention is to provide new type-specific sequences of the Core, the E
[0004] Hepatitis C viruses (HCV) have been found to be the major cause of non-A, non-B hepatitis. The sequences of cDNA clones covering the complete genome of several prototype isolates have been determined (Kato et al., 1990; Choo et al., 1991; Okamoto et al., 1991; Okamoto et al., 1992). Comparison of these isolates shows that the variability in nucleotide sequences can be used to distinguish at least 2 different genotypes, type
[0005] The identification of type
[0006] New sequences of the
[0007] The aim of the present invention is to provide new HCV nucleotide and amino acid sequences enabling the detection of HCV infection.
[0008] Another aim of the present infection is to provide new nucleotide and amino acid HCV sequences enabling the classification of infected biological fluids into different serological groups unambiguously linked to types and subtypes at the genome level.
[0009] Another aim of the present invention is to provide new nucleotide and amino acid HCV sequences ameliorating the overall HCV detection rate.
[0010] Another aim of the present invention is to provide new HCV sequences, useful for the design of HCV vaccine compositions.
[0011] Another aim of the present invention is to provide a pharmaceutical composition consisting of antibodies raised against the polypeptides encoded by these new HCV sequences, for therapy or diagnosis.
[0012] The present invention relates more particularly to a composition comprising or consisting of at least one polynucleic acid containing at least 5, and preferably 8 or more contiguous nucleotides selected from at least one of the following HCV sequences:
[0013] an HCV type
[0014] the region spanning positions
[0015] the region spanning positions
[0016] the region spanning positions
[0017] the region spanning positions
[0018] an HCV subtype
[0019] more particularly the coding regions of the above-specified regions;
[0020] an HCV subtype
[0021] an HCV type
[0022] an HCV type
[0023] with said nucleotide numbering being with respect to the numbering of HCV nucleic acids as shown in Table 1, and with said polynucleic acids containing at least one nucleotide difference with known HCV (type
[0024] It is to be noted that the nucleotide difference in the polynucleic acids of the invention may involve or not an amino acid difference in the corresponding amino acid sequences coded by said polynucleic acids.
[0025] According to a preferred embodiment, the present invention relates to a composition comprising or containing at least one polynucleic acid encoding an HCV polyprotein, with said polynucleic acid containing at least 5, preferably at least 8 nucleotides corresponding to at least part of an HCV nucleotide sequence encoding an HCV polyprotein, and with said HCV polyprotein containing in its sequence at least one of the following amino acid residues: L7, Q43, M44, S60, R67, Q70, T71, A79, A87, N106, K115, A127, A190, S130, V134, G142, 1144, E152, A157, V158, P165, S177 or Y177, I178, V180 or E180 or F182, R184, I186, H187, T189, A190, S191 or G191, Q192 or L192 or I192 or V192 or E192, N193 or H193 or P193, W194 or Y194, H195, A197 or I197 or V197 or T197, V202, I203 or L203, Q208, A210, V212, F214, T216, R217 or D217 or E217 or V217, H218 or N218, H219 or V219 or L219, L227 or I227, M231 or E231 or Q231, T232 or D232 or A232 or K232, Q235 or I235, A237 or T237, I242, I246, S247, S248, V249, S250 or Y250, I251 or V251 or M251 or F251, D252, T254 or V254, L255 or V255, E256 or A256, M258 or F258 or V258, A260 or Q260 or S260, A261, T264 or Y264, M265, I266 or A266, A267, G268 or T268, F271 or M271 or V271, I277, M280 or H280, I284 or A284 or L84, V274, V291, N292 or S292, R293 or I293 or Y293, Q294 or R294, L297 or I297 or Q297, A299 or K299 or Q299, N303 or T303, T308 or L308, T310 or F310 or A310 or D310 or V310, L313, G317 or Q317, L333, S351, A358, A359, A363, S364, A366, T369, L373, F376, Q386, I387, S392, I399, F402, I403, R405, D454, A461, A463, T464, K484, Q500, E501, S521, K522, H524, N528, S531, S532, V534, F536, F537, M539, I546, C1282, A1283, H1310, V1312, Q1321, P1368, V1372, V1373, K1405, Q1406, S1409, A1424, A1429, C1435, S1436, S1456, H1496, A1504, D1510, D1529, I1543, N1567, D1556, N1567, M1572. Q1579, L1581, S1583, F1585, V1595, E1606 or T1606, M1611, V1612 or L1612, P1630. C1636, P1651, T1656 or I1656, L1663, V1667, V1677, A1681, H1685, E1687, G1689, V1695, A1700, Q1704, Y1705, A1713, A1714 or S1714, M1718, D1719, A1721 or T1721, R1722, A1723 or V1723, H1726 or G1726, E1730, V1732, F1735, I1736, S1737, R1738, T1739, G1740, Q1741, K1742, Q1743, A1744, T1745, L1746, E1747 or K1747, I1749, A1750, T1751 or A1751, V1753, N1755, K1756, A1757, P1758, A1759, H1762, T1763, Y1764, P2645, A2647, K2650, K2653 or L2653, S2664, N2673, F2680, K2681, L2686, H2692, Q2695 or L2695 or I2695, V2712, F2715, V2719 or Q2719, T2722, T2724, S2725, R2726, G2729, Y2735, H2739, I2748, G2746 or I2746, I2748, P2752 or K2752, P2754 or T2754, T2757 or P2757, with said notation being composed of a letter representing the amino acid residue by its one-letter code, and a number representing the amino acid numbering according to Kato et al., 1990.
[0026] Each of the above-mentioned residues can be found in any of
[0027] More particularly, a polynucleic acid contained in the composition according to the present invention contains at least 5, preferably 8, or more contiguous nucleotides corresponding to a sequence of contiguous nucleotides selected from at least one of HCV sequences encoding the following new HCV amino acid sequences:
[0028] new sequences spanning amino acid positions
[0029] new sequences spanning amino acid positions
[0030] new sequences spanning amino acid positions
[0031] new sequences spanning amino acid positions
[0032] Using the LiPA system mentioned above, Brazilian blood donors with high titer type
[0033] The term “polynucleic acid” refers to a single stranded or double stranded nucleic acid sequence which may contain at least 5 contiguous nucleotides to the complete nucleotide sequence (f.i. at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more contiguous nucleotides). A polynucleic acid which is up till about 100 nucleotides in length is often also referred to as an oligonucleotide. A polynucleic acid may consist of deoxyribonucleotides or ribonucleotides, nucleotide analogues or modified nucleotides, or may have been adapted for therapeutic purposes. A polynucleic acid may also comprise a double stranded cDNA clone which can be used for cloning purposes, or for in vivo therapy, or prophylaxis.
[0034] The term “polynucleic acid composition” refers to any kind of composition comprising essentially said polynucleic acids. Said composition may be of a diagnostic or a therapeutic nature.
[0035] The expression “nucleotides corresponding to” refers to nucleotides which are homologous or complementary to an indicated nucleotide sequence or region within a specific HCV sequence.
[0036] The term “coding region” corresponds to the region of the HCV genome that encodes thy HCV polyprotein. In fact, it comprises the complete genome with the exception of the
[0037] The term “HCV polyprotein” refers to the HCV polyprotein of the HCV-J isolate (Kato et al., 1990). The adenine residue at position
[0038] This adenine is designated as position
TABLE 1 Positions Positions Positions Positions described in described for described for described for the HCV-J HCV-1 HC-J6, HC-J8 present (Kato et al., (Choo et al., (Okamoto et Region invention* 1990) 1991) al., 1992) Nucleotides NS5b 8023/8235 8352/8564 8026/8238 8433/8645 7932/8271 8261/8600 7935/8274 8342/8681 NS3/4 4664/5292 4993/5621 4664/5292 5017/5645 4664/4730 4993/5059 4664/4730 5017/5083 4892/5292 5221/5621 4892/5292 5245/5645 3856/4209 4185/4528 3856/4209 4209/4762 4936/5292 5265/5621 4936/5292 5289/5645 coding 330/9359 1/9033 342/9439 region of present invention Amino NS5b 2675/2745 2675/2745 2676/2746 2698/2768 Acids 2645/2757 2645/2757 2646/2758 2668/2780 NS3/4 1556/1764 1556/1764 1556/1764 1560/1768 1286/1403 1286/1403 1286/1403 1290/1407 1646/1764 1646/1764 1646/1764 1650/1768
[0039] Table 1: Comparison of the HCV nucleotide and amino acid numbering system used in the present invention (*) with the numbering used for other prototype isolates. For example,
[0040] The term “HCV type” corresponds to a group of HCV isolates of which the complete genome shows more than 74% homology at the nucleic acid level, or of which the NS
[0041] More preferably the definition of HCV types is concluded from the classification of HCV isolates according to their nucleotide distances calculated as detailed below
[0042] (1) based on phylogenetic analysis of nucleic acid sequences in the NS5b region between nucleotides
[0043] (2) based on phylogenetic analysis of nucleic acid sequences in the core/E
[0044] (3) based on phylogenetic analysis of nucleic acid sequences in the NS
TABLE 2 Molecular evolutionary distances Core/E1 E1 NS5B NS5B Region 579 bp 384 bp 340 bp 222 bp Isolates* 0.0017 − 0.1347 0.0026 − 0.2031 0.0003 − 0.1151 0.000 − 0.1323 (0.0750 ± 0.0245) (0.0969 ± 0.0289) (0.0637 ± 0.0229) (0.0607 ± 0.0205) Subtypes* 0.1330 − 0.3794 0.1645 − 0.4869 0.1384 − 0.2977 0.117 − 0.3538 (0.2786 ± 0.0363) (0.3761 ± 0.0433) (0.2219 ± 0.0341) (0.2391 ± 0.0399) Types* 0.3479 − 0.6306 0.4309 − 0.9561 0.3581 − 0.6670 0.3457 − 0.7471 (0.4703 ± 0.0525) (0.6308 ± 0.0928) (0.4994 ± 0.0495) (0.5295 ± 0.0627)
[0045] In a comparative phylogenetic analysis of available sequences, ranges of molecular evolutionary distances for different regions of the genome were calculated, based on 19,781 pairwise comparisons by means of the DNA DIST program of the phylogeny inference package PHYLIP version 3.5C (Felsenstein, 1993). The results are shown in Table 2 and indicate that although the majority of distances obtained in each region fit with classification of a certain isolate, only the ranges obtained in the 340 bp NS
[0046] Designation of a number to the different types of HCV and HCV types nomenclature is based on chronological discovery of the different types. The numbering system used in the present invention might still fluctuate according to international conventions or guidelines. For example, “type
[0047] The term “subtype” corresponds to a group of HCV isolates of which the complete polyprotein shows a homology of more than 90% both at the nucleic acid and amino acid levels, or of which the NS
[0048] The term “BR
[0049] It is to be understood that extremely variable regions like the E
[0050] Using these criteria, HCV isolates can be classified into at least 6 types. Several subtypes can clearly be distinguished in types
TABLE 3 HCV CLASSIFICATION OKA- NAKA MOTO MORI O CHA PROTOTYPE 1a I I Pt GI HCV-1, HCV-H, HC-J1 1b II II KI GIII HCV-J, HCV-BK, HCV-T, HC-JK1, HC- J4, HCV-CHINA 1c HC-G9 2a III III K2a GIII HC-J6 2b IV IV K2b GIII HC-J8 2c S83, ARG6, ARG8, I10, T983 2d NE92 3a V V K3 GIV E-b1, Ta, BR36, BR33, HD10, NZL1 3b VI K3 GIV HCV-TR, Tb 3c BE98 4a Z4, GB809-4 4b Z1 4c GB116, GB358, GB215, Z6, Z7 4d DK13 4e GB809-2, CAM600, CAM736 4f CAM622, CAM627 4g GB549 4h GB438 4i CAR4/1205 4j CAR1/501 4k EG29 5a GV SA3, SA4, SA1, SA7, SA11, BE95 6a HK1, HK2, HK3, HK4
[0051] The term “complement” refers to a nucleotide sequence which is complementary to an indicated sequence and which is able to hybridize to the indicated sequences.
[0052] The composition of the invention can comprise many combinations. By way of example, the composition of the invention can comprise:
[0053] two (or more) nucleic acids from the same region or,
[0054] two nucleic acids (or more), respectively from different regions, for the same isolate or for different isolates,
[0055] or nucleic acids from the same regions and from at least two different regions (for the same isolate or for different isolates).
[0056] The present invention relates more particularly to a polynucleic acid composition as defined above, wherein said polynucleic acid corresponds to a nucleotide sequence selected from any of the following HCV type
[0057] an HCV genomic sequence having a homology of at least 67%, preferably more than 69%, more preferably 71%, even more preferably more than 73%, or most preferably more than 76% to any of the sequences as represented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 (HD
[0058] an HCV genomic sequence having a homology of at least 65%, preferably more than 67%, preferably more than 69%, even preferably more than 70%, most preferably more than 74% to any of the sequences as represented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 (HD
[0059] an HCV genomic sequence as having a homology of at least 79%, more preferably at least 81%, most preferably more than 83% or more to any of the sequences as represented in SEQ ID NO 147 (representing positions
[0060] an HCV genomic sequence of HVC type
[0061] an HCV genomic sequence of HCV type
[0062] an HCV genomic sequence as having a homology of more than 73.5%, preferably more than 74%, most preferably 75% homology to the sequence as represented in SEQ ID NO 29 (HCC
[0063] an HCV genomic sequence having a homology of more than 70%, preferably more than 72%, most preferably more than 74% homology to any of the sequences as represented in SEQ ID NO 29, 31, 33, 35, 37 or 39 (HCC
[0064] an HCV genomic sequence of the BR
[0065] an HCV genomic sequence of the BR
[0066] an HCV genomic sequence of HCV type
[0067] Preferentially the above-mentioned genomic HCV sequences depict sequences from the coding regions of all the above-mentioned sequences.
[0068] According to the nucleotide distance classification system (with said nucleotide distances being calculated as explained above), said sequences of said composition are selected from:
[0069] an HCV genomic sequence being characterized as having a nucleotide distance of less than 0.44, preferably of less than 0.40, most preferably of less than 0.36 to any of the sequences as represented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 in the region spanning positions
[0070] an HCV genomic sequence being characterized having a nucleotide distance of less than 0.53, preferably less than 0.49, most preferably of less than 0.45 to any of the sequences as represented in SEQ ID NO 19, 21, 23, 25 or 27 in the region spanning positions
[0071] an HCV genomic sequence characterized having a nucleotide distance of less than 0.15, preferably less than 0.13, and most preferably less than 0.11 to any of the sequences as represented in SEQ ID NO 147 in the region spanning positions
[0072] an HCV genomic sequence of HVC type
[0073] an HCV genomic sequence of HCV type
[0074] an HCV genomic sequence of the BR
[0075] an HCV genomic sequence of HCV type
[0076] In the present application, the E
[0077] Also within the present invention are new subtype
[0078] Finally the present invention also relates to a new subtype
[0079] Also included within the present invention are sequence variants of the polynucleic acids as selected from any of the nucleotide sequences as given in any of the above mentioned SEQ ID numbers, with said sequence variants containing either deletions and/or insertions of one or more nucleotides, mainly at the extremities of oligonucleotides (either
[0080] According to another embodiment, the present invention relates to a polynucleic acid composition as defined above, wherein said polynucleic acids correspond to a nucleotide sequence selected from any of the following HCV type
[0081] an HCV genomic sequence as having a homology of more than 85%, preferably more than 86%, most preferably more than 87% homology to any of the sequences as represented in SEQ ID NO 41, 43, 45, 47, 49, 51, 53 (PC sequences) or 151 (BE
[0082] an HCV genomic sequence as having a homology of more than 61%, preferably more than 63%, more preferably more than 65% homology, even more preferably more than 66% homology and most preferably more than 67% homology (f.i. 69 and 71%) to any of the sequences as represented in SEQ ID NO 41, 43, 45, 47, 49, 51, 53 (PC sequences), 153 or 155 (BE
[0083] an HCV genomic sequence having a homology of more than 76.5%, preferably of more than 77%, most preferably of more than 78% homology with any of the sequences as represented in SEQ ID NO 55, 57, 197 or 199 (PC sequences) in the region spanning positions
[0084] an HCV genomic sequence having a homology of more than 68%, preferably of more than 70%, most preferably of more than 72% homology with the sequence as represented in SEQ ID NO 157 (BE
[0085] an HCV genomic sequence having a homology of more than 57%, preferably more than 59%, most preferably more than 61% homology to any of the sequences as represented in SEQ ID NO 59 or 61 (PC sequences) in the region spanning positions
[0086] an HCV genomic sequence as having a homology of more than 93%, preferably more than 93.5%, most preferably more than 94% homology to any of the sequences as represented in SEQ ID NO 159 or 161 (BE
[0087] Preferentially the above-mentioned genomic HCV sequences depict sequences from the coding regions of all the above-mentioned sequences.
[0088] According to the nucleotide distance classification system (with said nucleotide distances being calculated as explained above), said sequences of said composition are selected from:
[0089] a nucleotide distance of less than 0.53, preferably less than 0.51, more preferably less than 0.49 for the E
[0090] a nucleotide distance of less than 0.3, preferably less than 0.28, more preferably of less than 0.26 for the Core region to the type
[0091] a nucleotide distance of less than 0.072, preferably less than 0.071, more preferably less than 0.070 for the NS
[0092] Isolates with similar sequences in the
[0093] Also included within the present invention are sequence variants of the polynucleic acids as selected from any of the nucleotide sequences as given in any of the above given SEQ ID numbers with said sequence variants containing either deletion and/or insertions of one or more nucleotides, mainly at the extremities of oligonucleotides (either
[0094] Another group of isolates including BU