Protein biochemistry and structural biology
Reagents
Ubiquitin Rhodamine110 (UbRh, UbiQ Bio, UbiQ-002), isopropyl ß-D-1-thiogalactopyranoside (IPTG, Gold-Bio #I2481C100), β-mercaptoethanol (Sigma M3148), imidazole (Sigma 56749), lysozyme (Glentham Life Science GE8228)
DNase I (Roche 11284932001), tris(2-carboxyethyl)phosphine, (TCEP, GoldBio TCEP25), Bovine Serum Albumin (Sigma, A2153), Triton-X-100 (Sigma 9002-93-1)
L-Glutathione-Reduced (L-GSH, Sigma G4251), DMSO (Sigma-Aldrich, #472301-100 ML), doxycycline (dox, #D5207, Sigma Aldrich), tris(hydroxymethyl)aminomethane (Tris, Sigma Aldrich 9210-OP), L-glutathione (GSH, Sigma Aldrich 1294820), 4-(2-hydroxyethyl)piperazine-1-ethanesulfonic acid (HEPES Sigma Aldrich, H3375). Compound 5c was synthesised as previously58.
Molecular biology
Bacterial pOPIN-B expression vectors59 for SARS-CoV-2 PLpro (amino acids (aa) 1564-1878 of polyprotein 1ab, GenBank: QHD43415, with aa E1564 designated as residue 1, were reported previously18. SARS-CoV PLproWT (aa 1541-1855 of polyprotein 1ab, RefSeq: NP_828849.7), MERS-CoV PLproWT (aa 1482-1803 of polyprotein 1ab, RefSeq: YP_009047202), HKU1-CoV PLproWT (aa 1648-1958 of polyprotein 1ab, RefSeq: YP_009944268), OC43-CoV PLproWT (aa 1561-1872 of polyprotein 1ab, RefSeq: AY391777), 229E-CoV PL2proWT (aa 1599-1905 of polyprotein 1ab, RefSeq: NP_073549), NL63-CoV PL2proWT (aa 1578-1876 of polyprotein 1ab, RefSeq: YP_003766) were codon optimised for bacterial expression, synthesised (Integrated DNA Technologies) and cloned into pOPIN-B (pOPIN-S for OC43) digested with KpnI and HindIII using In-Fusion™ HD cloning (Takara Clontech). The SARS-CoV-2 PLpro BL2 mutant (SARS-CoV-2 PLproBL) was generated as described previously27. For SPR, constructs were ordered with an N-terminal AviTag™ (GLNDIFEAQKIEWHE) and cloned as above. A GSS or GSSG linker was placed preceding the 3 C cleavage site or protein CDS respectively. For crystallography, we matched a construct used previously27,34, which has a 1-aa shorter SARS-CoV-2 PLpro sequence (aa 1564-1878) preceded by a Ser-Asn-Ala sequence and includes a catalytic Cys111 mutation to Ser (SARS-CoV-2 PLproC111S). The coding sequence was cloned into pOPIN-S which features a His-SUMO-tag. SUMO protease (SENP1) was produced as per literature60.
Protein purification
All protein expression vectors were transformed into E. coli RosettaTM 2(DE3) competent cells (Novagen) and bacterial cells were grown in 2xYT medium at 37 °C. At OD600 = 0.8 the temperature was reduced to 18 °C and expression was induced with 0.3 mM IPTG. Cells were harvested 16 h post induction and stored at -80 °C until purification. For purification, cells were resuspended in lysis buffer/Buffer A (50 mM Tris pH 7.5, 500 mM NaCl, 5 mM b-ME, 10 mM imidazole) supplemented with lysozyme (2 mg/mL), DNaseI (100 µg/mL), MgCl2 (5 mM) and cOmplete EDTA-free protease inhibitor cocktail tablets (Roche) and lysed by sonication. Lysates were cleared by centrifugation at 40,000 g for 30 min at 4 °C. The clarified lysate was filtered through a 0.45 µM syringe filter and His-tagged proteins were captured using a HisTrap HP column (5 mL, Cytvia). The captured protein was washed with 10 CV of 30 mM imidazole wash buffer (Buffer A + 10% (v/v) Buffer B) and eluted using five column volumes of 100% Buffer B (Buffer A + 300 mM Imidazole). Pooled fractions were desalted into 100% Buffer A using a HiPrep 26/10 Desalting column (Cytiva) and then supplemented with His-3C or His-SENP1 protease for His-tag and His-SUMO tag cleavage respectively. Following overnight incubation at 4 °C, the cleaved His-tag, His-SUMO tag and His-tagged proteases were captured using a HisTrap HP column (5 mL, Cytvia). The extracted PLpro found in the flow-through was further purified by size exclusion chromatography using a HiLoad 16/600 Superdex 75 pg column (Cytiva) equilibrated with storage buffer (20 mM Tris pH 7.5, 150 mM NaCl, 1 mM TCEP).
For HTS, SARS-CoV-2 PLproWT was purified as above. For SPR storage buffer, 20 mM Tris pH 7.5 was replaced with 10 mM HEPES pH 7.5, for crystallisation, 150 mM NaCl was replaced with 50 mM NaCl. Protein samples were concentrated, and flash frozen in liquid nitrogen and stored at −80 °C.
PL1pro/PL2pro activity assay
Activity assays were performed as described previously18. In short, SARS-2-CoV PLpro activity was monitored in a fluorescence intensity assay using the substrate Ub-Rhodamine 110 (UbRh), that upon cleavage becomes fluorescent. The assay buffer contained 20 mM Tris (pH 8), 1 mM TCEP, 0.03% BSA (w/v) and 0.01% (v/v) Triton-X. Experiments were performed in 1536-well black non-binding plates (Greiner 782900) with a final reaction volume of 6 µL. SARS-2-CoV PLpro enzyme was added to the plates (50 nM or 5 nM) and incubated at ambient temperature for 10 min. UbRh (final concentration 100 nM) was added to start the reaction and incubated for 12 min (50 nM PLpro), or 2 h (5 nM PLpro), at room temperature. For endpoint assays the reaction was stopped by addition of citric acid (1 µL) at a final concentration of 10 mM. All additions were performed using the CERTUS FLEX (v2.0.1, Gyger). The reaction was monitored by an increase in fluorescence (excitation 485 nm and emission 520 nm) on a PHERAstar® (v5.41, BMG Labtech) using the FI 485 520 optic module. Data was normalised to 1% (v/v) DMSO (negative control, 0% inhibition) and 100 µM 5c58 (positive control, 100% inhibition).
High throughput screen
A total of 412,644 compounds (in-house library) were screened using the PLpro activity assay. Assay-ready plates were prepared at Compounds Australia. Compounds were dry spotted onto 1536-well non-binding black plates (Greiner 782900) to a final concentration of 29.17 µM in 2% (v/v) DMSO. Stock concentrations of compounds were either 10 mM or 5 mM. Reagents were dispensed using the CERTUS FLEX (v2.0.1, Gyger). Microplates were centrifuged using the Microplate Centrifuge (Agilent) and read on the PHERAstar® (v5.41, BMG Labtech) using the FI 485 520 optic module.
Data was normalised to 2% (v/v) DMSO (negative control, 0% inhibition) and 100 µM Compound 5c58 (positive control, 100% inhibition). Screen assay quality was monitored by calculation of robust Z’ by the following formula where (+) denotes the positive controls (low signal), (-) denotes the negative controls (high signal) and MAD is the median absolute deviation: robust Z’ = 1- (3*(MAD– + MAD+) / abs(median– – median+)) where MAD = 1.4826 * median(abs(x – median(x))). Plates were excluded from analysis if robust Z’ < 0.5. Hits were selected as >3* SD over the median of the negative control. For 36 plates, hits were selected as >1.5 SD over the median of the negative control due to an over dispense of DMSO (4% (v/v) DMSO final) during assay-ready plate preparation.
To determine the potency of the inhibitors, a series of 10-pt, 1:2 serial dilutions was performed from the highest starting concentration of 100 µM. The 10-point titration curves were fitted with a 4-parameter logistic nonlinear regression model and the IC50 reported is the inflection point of the curve. Data was analysed in TIBCO Spotfire® 7.11.2.
Counter screen
To confirm that the compounds were specifically inhibiting SARS-2-CoV PLpro rather than interfering with the fluorescence readout, human USP21 was used as the counter screen assay as previously described18. The same buffer, reagent dispenser and plate reader as in the PLpro assay was used. USP21 enzyme (final concentration 5 nM) was added to the plates and incubated at room temperature for 10 min. UbRh (final concentration 100 nM) was added to start the reaction and incubated for 2 min at room temperature. Reaction was stopped by the addition of citric acid (1 µL) at a final concentration of 10 mM. A series of 10-pt, 1:2 serial dilutions was performed from the highest starting concentration of 100 µM. The 10-point titration curves were fitted with a 4-parameter logistic nonlinear regression model and the IC50 reported is the inflection point of the curve. Data was analysed in TIBCO Spotfire® 7.11.2.
Comparative analysis of PLpro variants and human DUBs against inhibitors
Activity and inhibition of PL1pro and PL2pro from diverse viruses, as well as several human DUBs, were tested in PLpro activity assay as described, except that 1 mM GSH was used in place of 1 mM TCEP. The final concentration of UbRh was 100 nM, except for ATXN3 where it was adjusted to 2000 nM to account for low activity of the enzyme.
Experiments were performed in 384-well black non-binding plates (Greiner 784900 or Aurora ABA000000A) with a final reaction volume of 6 µL. A series of 10-pt, 1:3 serial dilutions was performed on test compounds using the Echo® Acoustic Dispenser (LabCyte) with the highest starting concentration of 100 µM of compounds. 5 µL of enzyme was added to the assay-ready plates and incubated for 10 min. UbRh was added to start the reaction and incubated for the required incubation time at room temperature. For endpoint assays, the reaction was stopped with the addition of citric acid (1 µL) at a final concentration of 10 mM. All reagents were dispensed using the MultidropTM Combi reagent dispenser (Thermo Fisher). Fluorescence was measured on a PHERAstar® (v5.41, BMG Labtech) using the FI 485 520 optic module. Data was normalised to 2% (v/v) DMSO (negative control, 0% inhibition) and 100% inhibition control (control compound was used where available, buffer excluding the enzyme was used if none were accessible).
Assay conditions were optimised to account for potent inhibitors WEHI-P4 and WEHI-P8. Here, PLpro enzyme (final concentration 5 nM) was added to the plates and incubated at room temperature for 10 min. UbRh (final concentration 100 nM, except for ATXN3 where 2000 nM was used) was added to start the reaction and incubated for 120 min before stopping the reaction. The assay conditions for each enzyme are as follows and in this format (enzyme: final concentration (nM) / reaction time (min)): SARS-CoV-2 PLpro: 50 or 5 nM, 12 or 120 min; SARS-CoV-2 PLproBL: 50 nM, 12 min; SARS-CoV-PLpro: 20 nM, 12 min; MERS-CoV PLpro: 10 nM, 12 min; HKU1 PLpro: 0.2 nM, 3 min; OC43 PLpro: 2.5 nM, 3 min; NL63 PL2pro: 0.025 nM, 10 min; 229E PL2pro: 0.1 nM, 30 min; USP21: 5 nM, 2 min; ATXN3: 50 nM, 2 min; Cezanne: 0.5 nM, 2 min; OTUD1: 10 nM, 2 min; USP10: 50 nM, 12 min; UCHL3: 0.005 nM, 2 min.
WEHI-P3 Specificity Assay (Ubiquigent)
WEHI-P3 was assayed using the commercial UbRh-based DUBprofiler™ drug discovery screening platform and results were analysed and provided by Ubiquigent (Dundee, Scotland). SARS-CoV-2 PLpro protein and compound WEHI-P3 were supplied to Ubiquigent.
Surface plasmon resonance
Experiments were performed on a BIAcore 8 K+ instrument (Cytiva, USA) PLpro proteins were diluted into HBS-P+ (see below) prior to immobilisation on a Sensor Chip SA (Cytiva, USA) by coupling. Compounds were spotted onto a Greiner 96-well U bottom plate (Item no. 650001) using the ECHO acoustic liquid dispenser from a 10 mM stock to desired concentrations and backfilled with DMSO to give a final DMSO of 2% (v/v). Compounds were further diluted into a dilution buffer consisting of 20 mM HEPES pH 7.4, 150 mM sodium chloride, 0.05% (v/v) P20 detergent (HBS-P + ) and 1 mM TCEP. Running buffer consisted of dilution buffer supplemented further with 2% (v/v) DMSO (HBS-P + , 2% (v/v) DMSO). Multi-cycle kinetics were performed with 270 sec associations and 1800 s dissociations with no further regeneration. Binding constants were determined in BIAcore insight evaluation (version 3.0.12) at equilibrium averaging response over 5 sec with a midpoint 5 sec before the end of the association phase. Final KD values were determined by averaging the values from two independent experiments.
SEC-MALS
Size-exclusion chromatography multi-angle light scattering (SEC–MALS) experiments were performed using a Superdex 200 Increase 10/300 GL column (Cytiva) coupled with DAWN HELEOS II light scattering detector and Optilab T-rEX refractive index detector (Wyatt Technology). The system was equilibrated in 50 mM Tris (pH 7.5), 50 mM NaCl, 2% (v/v) DMSO running buffer and calibrated using bovine serum albumin (2 mg/mL) before analysis of experimental samples. The system was then equilibrated again with running buffer where DMSO was replaced with 100 µM of the respective compound before each run. For each experiment, 50 µL of purified protein (2 mg/mL) was injected onto the column and eluted at a flow rate of 0.5 mL/min. Experimental data were collected and processed using ASTRA (Wyatt Technology, v.7.3.19)
FRET assay
A cellular assay to test compound efficacy was used as described30. For details on tissue culture and cell line verification see Part 3 below.
Briefly, we constructed a HEK293T cell line stably expressing a FRET biosensor composed of mClover3 donor and mRuby3 acceptor fluorophores separated by a linker containing PLpro cleavage motif TP5LP4KP3GP2GP1 ↓ AP-1PP-2TP-3KP-4VP-5. The cell line was then lentivirally transduced with PLpro coding sequencing embedded in a Tet-On expression vector, allowing PLpro expression to be controlled by the addition of dox. This cell line, expressing both the FRET bionsensor, and PLpro under dox control, was used for drug screening.
We prepared 7 titrations of compounds in 3-fold dilution and seeded 1.5 μL of each onto wells of a 96-well flat-bottom plate. We also seeded 1.5 μL DMSO as a no-treatment control. Next, 2.5 × 105 cells in 150 μL media with 300 ng/mL dox were added into those wells and incubated at 37˚C, 10% CO2 overnight. Additional wells were prepared with 1.5 μL DMSO and 2.5 × 105 cells in 150 μL media without dox as no dox control. Cells were detached and analysed by flow cytometry (WEHI FACS facility) to determine the FRET+ percentage.
We fit the dose-response curves in prism10 using Eq. 1 (below):
$${{{\rm{FRET}}}}\%= {{{\rm{Bottom}}}}+\frac{{{{\rm{Top}}}}-{{{\rm{Bottom}}}}}{\frac{{{{\rm{EC}}}}50\,}{[{{{\rm{inhibitor\; concentration}}}}]}+1}$$
(1)
FRET% from no treatment was assigned a concentration of 1 nM to capture the baseline without treatment. If complete inhibition was not observed from the top two concentrations, FRET% from no-dox treatment was assigned with a concentration of 0.1 mM and included for curve fitting. The reported EC50 was used to evaluate compound efficiency.
Crystallography
Crystallisation screening was performed at the CSIRO’s Collaborative Crystallisation Centre (C3) or the Monash Macromolecular Crystallisation Platform in Melbourne, Australia. SARS-CoV-2 PLpro complex crystals were generated by incubation of SARS-CoV-2 PLproC111S (13 mg/mL) with 0.44 mM inhibitor (0.88% (v/v) DMSO final), overnight at 4 °C and precipitate removed by centrifugation prior to dispensing.
Crystallisation of SARS-CoV-2 PLproC111S–WEHI-P1
Crystals grew from a reservoir containing 0.2 M sodium succinate, 0.1 M trisodium citrate-citric acid pH 5.72, 10% PEG 8 K (w/v) at 8 °C in a sitting-drop vapour-diffusion experiment (150 nL protein to 150 nL reservoir solution). Crystals were cryoprotected with reservoir solution supplemented with 17% (v/v) PEG400 and 0.44 mM inhibitor prior to flash freezing in liquid nitrogen.
Crystallisation of SARS-CoV-2 PLproC111S–WEHI-P2
Crystals grew from a reservoir containing 0.2 M sodium acetate, 0.1 M trisodium citrate-citric acid pH 5.4, 10% (w/v) PEG 8 K at 8 °C in a sitting-drop vapour-diffusion experiment (150 nL protein to 150 nL reservoir solution). Crystals were cryoprotected with reservoir solution supplemented with 17% (v/v) glycerol and 0.44 mM inhibitor prior to flash freezing in liquid nitrogen.
Crystallisation of SARS-CoV-2 PLproC111S–WEHI-P4
Crystals grew from a reservoir containing 0.3 M sodium malonate, 0.1 M tris pH 7.6, 6% (w/v) PGA-LM (poly-γ-glutamic acid low molecular weight polymer), 50 µM ZnCl2 at 4 °C in a sitting-drop vapour-diffusion experiment (150 nL protein to 150 nL reservoir solution). Crystals were cryoprotected with reservoir solution supplemented with 20% (v/v) glycerol and 0.44 mM inhibitor prior to flash freezing in liquid nitrogen.
Crystallisation of SARS-CoV-2 PLproC111S–WEHI-P24
Crystals grew from a reservoir containing 0.2 M lithium acetate, 0.1 M trisodium citrate-citric acid pH 5.97, 10% (w/v) PEG 8 K at 8 °C in a sitting-drop vapour-diffusion experiment (150 nL protein to 150 nL reservoir solution). Crystals were cryoprotected with reservoir solution supplemented with 17% (v/v) glycerol and 0.44 mM inhibitor prior to flash freezing in liquid nitrogen.
Data collection, phasing and refinement
Diffraction data were collected at the Australian Synchrotron (Australian Nuclear Science and Technology Organisation, ANSTO) beamline MX261 (wavelength: 0.9537 Å, temperature: 100 K). Datasets were either auto-processed at the synchrotron using XDS62, Aimless and Pointless63,64 or using similar methods in the ccp4i2 ‘Data reduction task’65. Datasets were solved by molecular replacement in Phaser66. In the case of WEHI-P4 and WEHI-P1, the apo structure of SARS-CoV-2 PLpro was used as a search model (PDB: 6WRH)34. For WEHI-P2 and WEHI-P24, WEHI-P1 with the ligand removed was used as a search model and its FreeR flags were used in the working dataset for refinement. Two rounds of simulated annealing were also conducted prior to model refinement to minimise model bias. Refinement and model building was performed in Phenix67 and Coot68. TLS parameters were set to one TLS group per chain where appropriate. Additional NCS refinement was utilised in each refinement cycle. Geometric restrains for compounds were generated by the GRADE web server (http://grade.globalphasing.org) or using Phenix eLBOW69. Models were validated using MolProbity70. Final Ramachandran statistics for; WEHI-P1 were 0.00% outliers, 2.21% allowed and 97.79% favoured; WEHI-P2 were 0.00% outliers, 1.37% allowed and 98.63% favoured; WEHI-P4 were 0.00% outliers, 2.93% allowed and 97.07% and WEHI-P24 were 0.00% outliers, 2.04% allowed and 97.96% favoured. Structural figures were generated using ChimeraX71. Data collection and refinement statistics can be found in Supplementary Table 1.
AlphaFold2
Alphafold237 was used to generate a model for OC43-CoV PLproWT (aa 1561-1872 of polyprotein 1ab, RefSeq: AY391777), 229E-CoV PL2proWT (aa 1599-1905 of polyprotein 1ab, RefSeq: NP_073549) and NL63-CoV PL2proWT (aa 1578-1876 of polyprotein 1ab, RefSeq: YP_003766). ColabFold72 (ver1.5.5) was used to output five predicted models (relaxed) of which the number one ranked model for each protein was used in this manuscript. Source code was downloaded and run on internal servers.
Molecular modelling and molecular dynamics simulations
Modelling was performed using the Schrödinger suite (Release 2024-2: Maestro, Schrödinger, LLC, New York, NY, 2024). The crystal structure of WEHI-P4 was used as the starting model. Prior to simulations all solvent molecules were removed and S111 mutated to the WT cysteine before the structure was prepared for simulations using the Protein Preparation Wizard73. The structure was prepared for molecular dynamics using the System Builder wizard with a TIP3P water model and an orthorhombic water box buffered to 15 Å in all directions. Sodium and Chloride Ions were placed to neutralise the model and ions added to a concentration of 0.15 M. Molecular dynamics simulations were performed using an OPLS474 forcefield in Desmond75. Initial simulations were run for 1.2 ns for 250 frames at a constant temperature and pressure of 300 K and 1.01325 bar (NPT ensemble), using the Relax model system before the simulation protocol. After the initial 1.2 ns simulation, 40 ns, 1000 frame simulations were performed on the relaxed model in triplicate. In the case of WEHI-P70, the WEHI-P4 crystal structure pose was edited to include the pyrazole moiety in place of the cyclohexanol, then the structure minimised using Prime76 prior to performing molecular dynamics simulations using the protocol previously described. For NL63 PL2pro an AlphaFold2 prediction37 was used (see above). The PL2pro AF2 model was prepared by modifying the apo Zinc Finger domain to coordinate a Zn2+ ion, and the blocking loop (residues 251-257) was removed. The WEHI-P4 crystal structure was aligned on the PL2pro compound binding site and the blocking loop conformation from WEHI-P4 merged into the PL2pro structure and mutated to NL63 sequence. The WEHI-P4 pose was merged into the PL2pro model, and the model was minimised in Prime76 prior to performing the simulation protocols described for PLpro above. Chosen frames from the 40 ns simulations represent a consensus conformation from the simulations.
PLpro sequence alignments
Annotated PLpro domains from Orf1ab (see Molecular Biology for accession codes) were extracted and a multiple sequence alignment (MSA) performed using CLUSTAL Omega (EMBL-EBI). The outputted sequence identity table and MSA appears in Supplementary Figs. 3 and 4 respectively. To generate the MSA Figure the alignment was inputted into ESPript 3.0 server77 and annotated in Adobe Illustrator CC 2024. Sequences were visually assessed using the MSA and predicted structured (AF2) to determine appropriately aligned residues.
Medicinal chemistry and DMPK studies
Details on medicinal chemistry including synthesis and compound characterisation used in this study can be found in the Supplementary Information.
Kinetic solubility
Kinetic solubility of compounds was determined based on a method described previously78. Test compounds prepared at 10 mg/mL in DMSO were diluted into buffer (pH 2.0 or pH 6.5) to give a 1% (v/v) final DMSO concentration. After standing for 30 min at ambient temperature, samples were analysed via nephelometry to determine a solubility range. The maximum value of the assay is 100 μg/mL and the minimum value is 1.6 μg/mL.
Partition co-efficient estimation and physicochemical properties
Partition coefficient values (LogD) were estimated at pH 7.4 by correlation of their chromatographic retention properties against the characteristics of a series of standard compounds with known partition coefficient values. The method employed gradient HPLC based on a previously published method79. Physicochemical properties for drug-likeness calculated using the ChemAxon for Excel software (ver. 20.21.0.768).
Microsome Stability
Mouse (lot #2210246) and human liver microsomes (lot # 1910096) were sourced from XenoTech LLC, Kansas City, KS. The microsomal stability assay was performed by incubating compounds (0.5 μM) with human or mouse liver microsomes (0.5 mg/mL), suspended in 0.1 M potassium phosphate buffer (pH 7.4) containing 3.3 mg/mL MgCl2 at 37 °C. The metabolic reaction was initiated by the addition of NADPH (to give 1.3 mM). Control samples in the absence of cofactor were also included. Samples were mixed and maintained at 37 °C using a microplate incubator (THERMOstar ®, BMG Labtech GmbH, Offenburg, Germany) and quenched at various time points over 60 min by the addition of MeCN containing an internal standard. Quenched samples were centrifuged, and the supernatant removed and analysed by LC/MS (Waters Xevo G2 QToF MS coupled to an Acquity UPLC) using a Supelco Ascentis Express RP C8 column (5 cm × 2.1 mm, 2.7 μm) and a mobile phase consisting of 0.05% (v/v) FA in H2O and 0.05% (v/v) FA in MeCN and mixed under gradient conditions. The flow rate was 0.4 mL/min and injection volume was 5 μL. The in vitro intrinsic clearance was calculated from the first-order degradation rate constant for substrate depletion.
Hepatocyte Stability
Mouse (lot # 2310051) and human cryopreserved hepatocytes (lot # 2310092) were sourced from XenoTech LLC, Kansas City, KS. The hepatocyte stability assay was performed on a plate shaker (900 rpm) placed in a humidified incubator set at 37 °C with 7.5% CO2 atmosphere and ~95% RH. Cryopreserved hepatocytes were suspended in protein-free Krebs-Henseleit buffer (KHB; pH 7.4) at a concentration of 0.5 million viable cells/mL. The hepatocyte cell viability was assessed using Trypan blue dye exclusion method. The metabolic reaction was initiated by addition of compounds (test or QC cocktail) to aliquots of hepatocyte suspension that were pre-equilibrated (for 10 min) at 37 °C and 7.5% CO2. At various time points over 240 min, samples were quenched by addition of MeCN containing an internal standard. Quenched samples were left on ice for approximately 15 min, centrifuged and the supernatant removed and analysed by tandem quadrupole-Time of Flight MS (Waters G2 QToF) with a mass range scan of 50-1200 Da. The in vitro intrinsic clearance (µL/min/106 cells) was calculated from the first order degradation rate constant.
Protein plasma binding
Mouse (CD1, pooled, mixed gender, Na Heparin as anticoagulant; lot # MSE433327) and human plasma (pooled, mixed gender, Na Heparin as anticoagulant; lot # HMN921520) was sourced from BioIVT, Hicksville, NY. Plasma protein binding was conducted by rapid equilibrium dialysis (RED) using a modification of a method published previously80. Briefly, plasma was spiked with compound, mixed, and aliquots taken to plasma. The remaining spiked plasma was equilibrated at 37 °C ( ~ 10 min) prior to adding to the RED inserts (300 µL per insert). Inserts (n = 4) were placed in a teflon holding plate and dialysed against protein-free buffer (500 µL per insert) at 37 °C on an orbital plate shaker (ThermoMixer C, Eppendorf; 800 rpm). To control the pH of the assay matrix, the dialysis was performed in an incubator under a humidified CO2-enriched atmosphere; the pH of post-dialysis plasma and dialysate was confirmed to be within pH 7.4 ± 0.1. At the end of the dialysis period, aliquots were taken from the donor and dialysate chambers to obtain measures of the total and free compound concentration, respectively. To allow quantification using a single calibration curve, each sample was mixed with an equivalent volume of the opposite medium (i.e. blank assay matrix for dialysate samples and blank dialysate medium for donor samples). The matrix-matched samples were stored at -80 °C until analysis by LC-MS. For the stability assessment, residual spiked plasma was incubated at 37 °C in parallel to the RED samples. Aliquots were taken at 3 and 6 h, mixed with an equivalent volume of blank dialysate medium, snap frozen on dry ice and at -80 °C until analysis by LC-MS.
Quantitation was performed following protein precipitation with MeCN (2 to 1 volume ratio relative to the matched matrix samples) and separation of the supernatant. Samples were analysed by mass spectrometry using a SCIEX Triple Quad 6500+ mass spectrometer coupled to an Echo module for sample ejection. Detection was by positive electrospray ionisation with multiple reaction monitoring. The carrier solvent was 0.1% (v/v) FA, 1 mM ammonium fluoride, 0.5 mM citric acid in 70% (v/v) MeCN/H2O with a 400 µL/min flow rate. Quantitation was by comparison of the response to calibration standards prepared in the same matrix and processed using the same method. Assay acceptance was based on the calibration range (2.5-2000 ng/mL) and accuracy and precision at low, mid and high concentrations.
Mouse exposure after oral dosing of 100 mg/kg
The systemic exposure of WEHI-P8 was studied in non-fasted male C57BL/6 mice weighing 20.5 – 22.6 g. Mice had access to food and water ad libitum throughout the pre- and post-dose sampling period. On the day of dosing, solid compound was dispersed in pre-mixed 0.5% (w/v) methylcellulose (Methocel A4M) in Milli-Q water using vortexing and sonication, creating a uniform fine white suspension with an apparent pH of 7.2. The bulk formulation was mixed by inverting the tube prior to drawing each dosing volume. Dosing was by oral gavage (10 mL/kg) and blood samples were collected up to 24 h (n = 2-3 mice per time point) with a maximum of three samples from each mouse via submandibular bleed (approximately 120 μL; conscious sampling). Blood was collected into polypropylene Eppendorf tubes containing heparin as anticoagulant, centrifuged immediately, supernatant plasma was removed, and stored at -80 °C until analysis by LC-MS. Just prior to analysis, proteins were precipitated using MeCN at a 1:3 volume ratio (plasma to MeCN) and samples were centrifuged and the supernatant injected onto the LC/MS system.
Processed samples were analysed using a Waters Xevo TQD coupled to a Waters Acquity UPLC with positive electrospray ionisation and multiple reaction monitoring. The column was a Supelco Ascentis Express RP C8 column (50 × 2.1 mm, 2.7 μm) maintained at 40 °C and the mobile phase was 0.05% (v/v) FA in H2O and 0.05% (v/v) FA in MeCN mixed by gradient elution from 15 to 75% (v/v) MeCN over 2 min with a flow rate of 0.8 mL/min. Quantitation was by comparison to calibration standards prepared in blank mouse plasma and processed as for the samples. Diazepam was included as an internal standard in both samples and calibration standards. The assay was validated for calibration range (1-5000 ng/mL), lower limit of quantitation (1 ng/mL), accuracy, (within ± 10%) and precision (relative standard deviation of <10%). Plasma concentration versus time data were analysed using non-compartmental methods.
CYP inhibition
The CYP inhibition assay was performed with human liver microsomes utilising a substrate-specific interaction approach which relies on the formation of a metabolite that is mediated by a specific CYP isoform. The assay conditions employed for each CYP isoform are based on that previously reported81. Phosphate buffer (0.1 M) was prepared by dissolving monobasic potassium phosphate (KH2PO4) and dibasic potassium phosphate (K2HPO4) in 500 mL deionised water and adjusting pH to 7.4. Magnesium chloride was added at 3.3 mM to prepare the final incubation buffer.
A suspension of human liver microsomes was prepared in the incubation buffer at the required protein concentration. Multiple concentrations of test compound and positive control inhibitors were incubated with human liver microsomes at 37 °C concomitantly with each substrate. The total organic solvent concentration was kept at 0.5% (v/v). The reactions were initiated by the addition of NADPH (final concentration 1.3 mM) and the samples were quenched by the addition of ice-cold acetonitrile containing internal standard (0.15 µg/mL of diazepam). Concentrations of the substrate-specific metabolites in quenched samples were determined by UPLC-MS relative to calibration standards prepared in quenched microsomal matrix. Control samples were included to assess whether the UPLC-MS assay of the specific metabolites was affected in the presence of each test compound (and potential metabolites). Positive control compounds for each CYP are outlined in the Supplementary Information.
Time dependent inhibition
The time-dependent CYP inhibition assay was performed with human liver microsomes utilising a substrate-specific interaction approach which relies on the formation of a metabolite that is mediated by CYP3A4/5. The assay conditions employed are based on that previously reported82.
hERG study
hERG binding assessment was carried out with WEHI-P8 in 10-pt dose IC50 mode at Reaction Biology (Malvern, PA) using a PredictorTM hERG Fluorescence Polarisation Assay83.
Studies in cells and in vivo
Reagents include antibodies against CD3 (1:500, Agilent A045201), MPO (1:1000, Agilent A039829), F4/80 (1:1000, WEHI in-house antibody) or SARS-CoV-2 nucleocapsid (1:4000, abcam ab271180) using the automated Omnis EnVision G2 template (Dako, Glostrup, Denmark). Chemical reagents include the Pgp inhibitor CP-100356 (Sigma Aldrich PZ0171), 4% paraformaldehyde (PFA) in PBS (Thermo J61899-AK) and 2-chloroacetamide (Sigma Aldrich C0267).
Tissue culture and cell line verification
HEK293T cells (FRET assay) were authenticated and sourced from CellBank Australia.
VERO cells were purchased from ATCC (clone CCL-81). Calu-3 and Vero (CCL-81) cells displayed expected cell morphologies and were sent for validation to Garvan Molecular Genetics facility (on 15 June 2020). Cell lines were screened on a monthly basis for mycoplasma contamination using the PlasmoTest kit (Invitrogen) as per the manufacturer’s instructions. All used cells were mycoplasma free.
Measurement of Calu-3 in vitro 50% tissue culture infectious dose (TCID50)
For infection assays Calu-3 cells were seeded in a volume of 100 μL DMEM F12 into tissue culture-treated flat-bottom 96-well plates (Falcon) at a density of 3.5 × 104 cells/well and incubated over night before infection and/or treatment at confluency. On day of infection and/or treatment cells were washed twice with serum-free DMEM medium and infected with SARS-CoV-2 clinical isolate VIC00184 and MOI of 0.1 in 25 μL of serum-free medium containing TPCK trypsin (0.5 μg/mL working concentration, ThermoFisher). Cells were cultured at 37 °C and 5% CO2 for 30 min. Cells were topped up with 150 µL of medium containing PLpro inhibitor compounds at indicated concentrations in 6 replicates per concentration. At 48 h post infection/treatment, 100 μL of supernatant was harvested from each well and kept frozen at -80 °C. For TCID50 assays, Vero cells were seeded in a volume of 100 µL DMEM medium into tissue culture-treated flat-bottom 96-well plates (Falcon) at a density of 1 × 104 cells/well and incubated overnight. The next day, Vero plates were washed twice with PBS and 125 µL of DMEM + 100 U/mL penicillin and 100 mg/mL streptomycin (serum free) + TPCK trypsin (0.5 µg/mL working conc) was added and kept at 37 °C, 5% CO2. Calu-3 cell supernatants were thawed and serial 1:7 dilutions were prepared in 96-well round bottom plates at 6 replicates per dilution. 25 µL of serially diluted calu-3 supernatant were added onto Vero cells and plates incubated for 4 days at 37 °C, 5% CO2 before measuring cytopathic effect under a light microscope. The TCID50 calculation was performed using the Spearman and Kärber method85.
Plaque assay
Plaque assay was adapted and performed based on protocols previously described86. Briefly, African green monkey kidney epithelial Vero cells, purchased from ATCC (clone CCL-81), were seeded in flat bottom 24-well plates (8 × 104 cells/well) and left to adhere overnight at 37°C/5% CO2. Cells were washed twice with PBS and transferred to serum-free DMEM containing TPCK trypsin (0.5 µg/mL working concentration). Cells were infected with 150 µL of SARS-CoV-2 clinical isolate VIC001 (TCID50 2.6 × 103/mL) and incubated at 37°C/5% for 30 min. Next, 150 µL of 1:2 serial dilutions of the hit compounds ranging from final concentrations of 5 µM to 0.0098 µM with or without 2 µM of the P-glycoprotein inhibitor CP100356 were transferred to the infected cells and incubated at 37°C/5% CO2 for 30 min. Cells were then overlayed with 1.5% (w/v) methylcellulose and 4% FCS (v/v) in DMEM and incubated at 37°C/5% CO2 for 4 days. At 4 dpi the overlay was removed, and cells were washed once with PBS before fixation with 4% paraformaldehyde (PFA) (v/v) in PBS for 40 min at room temperature. Wells were then stained with 0.2% crystal violet (w/v) in 20% methanol (v/v) for 10 min, then washed twice with MilliQ water and air dried before plaque counting and calculation of antiviral EC50 for each compound using four-parameter logistic regression using GraphPad Prism 8.0 (GraphPad Software Inc).
Ethics statement
In vivo efficacy and long COVID studies were performed at The Walter and Eliza Hall Institute of Medical Research (WEHI). Procedures and mouse strains were reviewed and approved by the WEHI Animal Ethics Committee (ethics number 2020.016 and 2024.006). Mouse exposure studies were conducted at Monash Institute using established procedures in accordance with the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes, and were reviewed and approved by the Monash Institute of Pharmaceutical Sciences Animal Ethics Committee (ethics protocol number 26789). All animal experiments were conducted in accordance with the Prevention of Cruelty to Animals Act (1986) and the Australian National Health and Medical Research Council Code of Practice for the Care and Use of Animals for Scientific Purposes (1997).
Mice
Male or female WT C57BL/6 J mice were bred and maintained in the Specific Pathogen Free (SPF) Physical Containment Level 2 (PC2) Bioresources Facility at The Walter and Eliza Hall Institute of Medical Research (WEHI).
All procedures involving animals and live SARS-CoV-2 strains were conducted in an OGTR-approved Physical Containment Level 3 (PC3) facility at WEHI (Cert-3621). Mice were transferred to the PC3 laboratory for all SARS-CoV-2 infection experiments at least 4 days prior to the start of experiments. Animals were age- and sex-matched within experiments (both sexes were used). Experimental mice were housed in individually ventilated microisolator cages under level 3 biological containment conditions with a 12-h light/dark cycle and provided standard rodent chow and sterile acidified water ad libitum.
SARS-CoV-2 strains
SARS-CoV-2 VIC2089 clinical isolate (hCoV-19/Australia/VIC2089/2020) was obtained from the Victorian Infectious Disease Reference Laboratory (VIDRL). Viral passages were achieved by serial passage of VIC2089 through successive cohorts of young C57BL/6 J (WT) mice38. Briefly, mice were infected with SARS-CoV-2 clinical isolate intranasally. At 3 dpi, mice were euthanised and lungs were harvested and homogenised in a Bullet Blender (Next Advance Inc) in 1 mL Dulbecco’s modified Eagle’s medium (DMEM) (Gibco/ThermoFisher) containing steel homogenisation beads (Next Advance Inc). Samples were clarified by centrifugation at 10,000 x g for 5 min before intranasal delivery of 30 µL lung homogenate into a new cohort of naïve C57BL/6 J mice. This process was repeated a further 20 times to obtain the SARS-CoV-2 VIC2089 P21 isolate. Lung homogenates from all passages were stored at -80°C.
Infection of mice with SARS-CoV-2
Mice were anesthetised with methoxyflurane and inoculated intranasally with 30 μL SARS-CoV-2. Virus stocks were diluted in serum-free DMEM to a final concentration of 104 TCID50/mouse. After infection, animals were visually checked and weighed daily for a minimum of 10 days. Mice were euthanised at the indicated times post-infection by CO2 asphyxiation. For histological analysis, animals were euthanised by cervical dislocation. Lungs were collected and stored at -80°C in serum-free DMEM until further processing.
SARS-CoV-2 P21 infections were performed with animals of 3 different aged groups depending on experimental outcome: young = 6-8 week-old; adult = 9-12 week-old; aged > 6 month-old. All animals were monitored and weighed daily, for a minimum of 10 days post-infection. Animals older than 10 weeks infected with SARS-CoV-2 P21 may require euthanasia due to excessive weight loss or extreme signs of disease. Humane points include weight loss greater than 20% of initial weight, and signs of lack of grooming, decreased body condition score, sustained weight loss ( > 15% over 3 consecutive days), laboured breathing, lethargy and/or decreased mobility. For PASC cohorts, where animal survival beyond the acute infection phase is essential for experimental outcomes, supportive care measures were implemented to minimize losses. These included saline administration upon reaching 15% weight loss and providing mashed food to prevent further deterioration. For the analysis of PASC phenotypes, animals were euthanized between 1 and 3 months post-infection (mpi). The 1 mpi cohorts correspond to 30–45 days post-infection (dpi), while the 3 mpi cohorts correspond to 75–90 dpi.
In vivo antiviral treatment
C57BL/6 (WT) mice were treated with either vehicle (10% DMSO in corn oil), Paxlovid-like treatment (56 mg/kg nirmatrelvir (MedChemExpress, HY-138687) + 19 mg/kg ritonavir (MedChemExpress, HY-90001)), 100 mg/kg or 150 mg/kg WEHI-P8. Acute infection experiments were performed either with a post-infection regime (6, 24 and 48 h post-infection) or starting 2 h pre-infection, followed by 6 and 24 h post-infection. PASC experiments were performed following the pre-infection regime.
Measurement of lung viral loads via 50% tissue culture infectious dose (TCID50)
TCID50 was performed as previously described85. Briefly, African green monkey kidney epithelial Vero cells, purchased from ATCC (clone CCL-81), were seeded in flat bottom 96-well plates (1.75 × 104 cells/well) and left to adhere overnight at 37°C/5% CO2. Cells were washed twice with PBS and transferred to serum-free DMEM containing TPCK trypsin (0.5 µg/mL working concentration). Infected organs were defrosted, homogenised, clarified by centrifugation at 10,000 x g for 5 min at 4°C and supernatant was added to the first row of cells at a ratio of 1:7, followed by 9 rounds of 1:7 serial dilutions in the other rows. Cells were incubated at 37°C/5% CO2 for 4 days until virus-induced cytopathic effect (CPE) was scored. TCID50 was calculated using the Spearman & Kärber algorithm85.
Histological analysis and immunohistochemical staining
Organs were harvested and fixed in 4% paraformaldehyde (PFA) (v/v) for 24 h, followed by 70% ethanol (v/v) dehydration, paraffin embedding and sectioning. Slides were stained with either haematoxylin and eosin (H&E), or immunohistochemically with antibodies against CD3 (1:500, Agilent A045201), MPO (1:1000, Agilent A039829), F4/80 (1:1000, WEHI in-house antibody) or SARS-CoV-2 nucleocapsid (1:4000, abcam ab271180) using the automated Omnis EnVision G2 template (Dako, Glostrup, Denmark). Dewaxing was performed with Clearify Clearing Agent (Dako) and antigen retrieval with EnVision FLEX TRS, High pH (Dako) at 97 °C for 30 min. Primary antibodies were diluted in EnVision Flex Antibody Diluent (Dako) and incubated at 32 °C for 60 min. HRP-labelled secondary antibodies (Invitrogen, Waltham, USA) were applied at 32 °C for 30 min. Slides were counter-stained with Mayer Haematoxylin, dehydrated, cleared, and mounted with MM24 Mounting Medium (Surgipath-Leica, Buffalo Grove, IL, USA). Slides were scanned with an Aperio ScanScope AT slide scanner (Leica Microsystems, Wetzlar, Germany). An American board-certified pathologist (Smitha Rose Georgy) performed a qualitative analysis of H&E staining.
For Sirius Red staining, lung sections of 4 µm were dewaxed and rehydrated, followed by fixation in Bouin’s fixative with heat for 1 h. Weigert’s haematoxylin was used to stain nuclei before placing the sections into Picro-sirius red solution for 1 h, to stain for collagen. Stained sections were differentiated in acidified water, before dehydration, clearing and mounting with DPX. Images were acquired using an Olympus SlideView VS200 whole slide scanner, under 20x objective in brightfield mode.
Scoring of H&E stained intestine sections
Blinded histopathology scoring87 of hematoxylin and eosin (H&E)-stained lungs and intestine sections was performed. For lungs, areas of haemorrhage and inflammation were scored by a researcher blinded to the experimental groups. Scoring ranged from 0, indicating no observable pathology, to 5, representing extensive pathology affecting the majority of the lung tissue. For intestines, scores were recorded for the proximal, middle and distal colon, and proximal and distal small intestine. No epithelial damage was scored as 0, hyperproliferation of crypts was scored as 1, less than 50% crypt loss was scored as 2, more than 50% crypt loss was scored as 3, 100% crypt loss was scored as 4 and the presence of an ulcer was scored as 5. The presence of inflammatory cells in the mucosa, submucosa and muscle was scored separately. The presence of occasional inflammatory cells was scored as 0, increased mild numbers of inflammatory cells was scored as 1, moderate inflammatory cell presence was scored as 2 and severe inflammation was scored as 3. The scores of proximal and distal were summed to reveal the total histological score of small intestines, and for large intestines, the whole length was analysed, and the score is a sum of proximal, middle and distal sections. Average scores for an individual animal are presented +/- SEM.
Lung cytokine and chemokine analysis
Lungs were thawed, homogenised and clarified by centrifugation at 10,000 x g for 5 min at 4°C. Supernatants were pre-treated for 20 min with 1% Triton-X-100 (v/v) for viral deactivation and the Cytokine & Chemokine 26-Plex Mouse ProcartaPlex Panel 1 (EPX260-26088-901) was used as described in the manufacturer’s manual. 25 µL of clarified lung samples were diluted with 25 µL universal assay buffer, incubated with magnetic capture beads, washed, incubated with detection antibodies and SA-PE. Cytokines were recorded on a Luminex 200 Analyser (Luminex) and quantitated by comparison to a standard curve. Analysis was performed using R Studio.
Proteomics
Lungs of mock and infected animals were washed three times with ice-cold TBS, lysed in 2% sodium deoxycholate (SDC) (v/v) and 100 mM Tris-HCl (pH 8.5), and boiled immediately. After sonication, protein amounts were adjusted to 20 μg using the BCA protein assay kit. Samples were reduced with 10 mM (TCEP), alkylated with 40 mM 2-chloroacetamide, and digested with trypsin and lysC (1:50, enzyme/protein, w/w) overnight. Peptides were desalted using SDB-RPS-stage tips. Peptides were resolubilised in 5 μL 2% (v/v) acetonitrile (ACN) and 0.3% (v/v) trifluoroacetic acid (TFA) and 200 ng were injected into the mass spectrometer.
LC-MS
Samples were loaded onto a C18 fused silica column (inner diameter 75 µm, OD 360 µm × 15 cm length, 1.6 µm C18 beads) packed into an emitter tip (IonOpticks) using pressure-controlled loading with a maximum pressure of 1500 bar with the Neo Vanquish liquid chromatography system (Thermo Fisher Scientific) coupled to the MS (Orbitrap Astral, Thermo Fisher Scientific). Peptides were introduced onto the column with buffer A (0.1% FA) and 4% buffer B (80% ACN, 0.1% FA) followed by an increase of buffer B to 34% for 20 min, and 100% for 3 min at a flow rate of 400 nL/min.
A data-independent acquisition MS method was used in which one full scan (380–980 m/z, R = 240,000) at a target of 5 × 106 ions was first performed, followed by 300 windows with a resolution of 80,000 (at m/z 524) where precursor ions were fragmented with higher-energy collisional dissociation (collision energy 25%) and analysed with an AGC target of 8 × 104 ions and a maximum injection time of 3 ms in profile mode using positive polarity.
Novel-object recognition test (NORT)
NORT88 was performed to study object memory and preference for novelty. In short, mice were individually habituated to an empty testing arena (50 cm × 50 cm) for 10 min. On the next day, mice were placed in the same arena with two identical objects and allowed to freely roam for 10 min. A second trial was performed after an interval of 1 h in which mice were placed back into the testing area containing one of the familiar objects from trial 1 and one novel object. Mice were allowed to explore the testing arena for 5 min. The recognition index from trial 2 was calculated as a proportion of the time exploring the novel object over the total time spent exploring both objects.
Quantification and statistical analysis
Statistical analyses were performed using Prism v10.2.3 software (GraphPad Software, Inc.). Unpaired two-tailed t-tests were used for normally distributed data for comparisons between two independent groups. Data that violated the assumption of normality were transformed by generating log10 prior to statistical analysis. Bars in figures represent the mean or median ( ± SD or ±SEM) of normally or non-normally distributed datasets, respectively and as indicated in the Figure legends, and each symbol represents one mouse. Sample sizes (n), replicate numbers and significance can be found in the figures and figure legends.
Statistical analysis of cytokine data consisted of Wilcoxon rank sum test between group medians, with Bonferroni adjustment for multiple comparisons. Boxplots in figures depict the median and interquartile ranges. Loess smoothing was applied to the infection time course data, with the shaded area indicating 95% confidence intervals.
For proteome analysis, MS raw files were processed by the Spectronaut software version 1989. Mouse uniport FASTA databases (25,367 entries, 2021) were used as forward databases. Cysteine carbamidomethylation was included as fixed modification and N-terminal acetylation and methionine oxidations were included as variable modifications. The false discovery rate (FDR) and PEP cutoff were set to less than 1% at the peptide and protein levels and a minimum length of seven amino acids for peptides was specified. Enzyme specificity was set as C-terminal to arginine and lysine as expected using trypsin and LysC as proteases and a maximum of two missed cleavages. Statistical tests were performed with Perseus90. The 1D annotation-enrichment analysis detects whether expression values of proteins belonging to an enrichment term (here we used keywords, GOCC, GOMF, GOBP, and KEGG name) show a systematic enrichment or de-enrichment compared with the distribution of all expression values.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.