Glossary

Following is the glossary for the terms used in MICHA drug screening protocol. Highlighted fields can be automatically extracted by MICHA and users may not need to enter that information.


Samples (cell lines)

If data is focusing on cell lines, users have to enter following cell line information. Highlighted fields can be automatically extracted by MICHA and users do not need to enter.

  1. Name: Name of the cell line (e.g. MOLM-13, DU4475).
  2. Cellosaurus ID: ID for cell lines.
  3. Cell type: It shows the tissue type for cell line e.g. acute myeloid leukemia or breast cancer.
  4. Organism (BAO_0000551): Cell line specie e.g. Homo sapiens.

  5. Cell line modifications (BAO_0000238): Modification to actual cell line.


Compounds

Comprehensive Information on screened compounds. Most of this information is automatically extracted from public resources such as ChEMBL, DrugComb and ClinicalTrials. Users just need to upload Compound names and standard inchikeys. For example standard inchiKey for imatinib is: XDXDZDZNSLXDNA-TZNDIEGXSA-N. Highlighted fields can be automatically extracted by MICHA and users do not need to enter.

  1. Clinical Phase: This is the max clinical phase for compound that has been clinically tested. This is automatically extracted by MICHA using ChEMBL API.
  2. Links to clinical studies: Links to the ongoing or completed clinical studies.
  3. Cross referencing drug databases: Provide hyperlinks to other compound databases such as: Pubchem, ChEMBL, DTC, BindingDB, Drugbank, Zinc, emolecules, atlas, gtopDB, Chebi, PharmGKB, sure ChEMBL , Lincs.
  4. Molecule type: For example, Small molecular, antibody
  5. Physiochemical properties: Nearly 20 physiochemical properties are automatically extracted by MICHA such as: ALOGP, number hydrogen bond acceptors (HBA) and number of aromatic rings etc. Explanation of each of these properties is as follows:
    1. Mw freebase: Molecular weight of parent compound
    2. Mw freebase: Molecular weight of parent compound
    3. HBA: Number hydrogen bond acceptors
    4. HDB: Number hydrogen bond donors
    5. PSA: Polar surface area
    6. RTB: Number rotatable bonds
    7. Num lipinski ro5 violations: Indicates whether the compound passes the rule-of-three (mw < 300, logP < 3 etc)
    8. ACD most APKA: The most acidic pKa calculated using ACDlabs v12.01
    9. ACD most BPKA: The most basic pKa calculated using ACDlabs v12.01
    10. ACD LOGP: The calculated octanol/water partition coefficient using ACDlabs v12.01
    11. ACD LOGD: The calculated octanol/water distribution coefficient at pH7.4 using ACDlabs v12.01
    12. Full MWT: Molecular weight of the full compound including any salts
    13. AROMATIC_RINGS: Number of aromatic rings
    14. HEAVY_ATOMS: Number of heavy (non-hydrogen) atoms
    15. QED Weighted: Weighted quantitative estimate of drug likeness (as defined by Bickerton et al., Nature Chem 2012)
    16. MW monoisotopic: Monoisotopic parent molecular weight
    17. FULL Mol formula: Molecular formula for the full compound (including any salt)
    18. HBA lipinski: Number of hydrogen bond acceptors calculated according to Lipinski's original rules (i.e., N + O count))
    19. HBD lipinski: Number of hydrogen bond donors calculated according to Lipinski's original rules (i.e., NH + OH count)
  6. Disease indication: Disease indications which are currently tested or under trial. This can be automatically extracted based on inchiKey.
  7. 2D structure: Can be extracted automatically from PubChem.
  8. Primary target: Primary protein targets for the compounds can be automatically extracted, if available in public databases (ChEMBL, Drug Target Commons, PubChem and DrugBank).
  9. Off targets: Potent protein targets other than primary targets can also be automatically extracted from public databases such as: (ChEMBL, Drug Target Commons, PubChem and DrugBank).
  10. Concentration range: Users must have to provide concentration range (1-1000nM) or maximum concentration used for their experiments.
  11. Dilution steps: e.g. ten-fold, half-log
  12. Vehicle of compound: e.g. DMSO/H2O

Experiments

Following is the description about experimental terms used in MICHA.

  1. Medium (BAO_0000596): RPMI1640, 10% FBS, pen/strep, glutamine

  2. Plate type (BAO_0000508): Corning 384-well # 3764

  3. Surviving cells (%): Percentage of surviving cells.
  4. Volume/well: This information must be provide in ul.
  5. Time of treatment: Such as 72 hours
  6. Cell density (cells/well) (BAO_0000572): Number of cells per well e.g 3000.

  7. Method of dispensing: Such as Biotek Multiflo FX with RAD cassette (1 channel).
  8. Measurement type: Its shows the experimental data metric such as EC50, IC50, DSS etc
  9. Measurement value: This must be entered into Nano molar units.
  10. Assay format (BAO_0000019): An assay format is a conceptualization of assays based on the biological and/or chemical features of the experimental system. For example, assay formats include 1) biochemical, assays with purified protein, 2) cell-based, assays in whole cells, 3) cell-free, assays in cell derivatives, 4) organism-based, assays performed in an organism, 5) physiochemical, assays which measure physical or chemical properties, and 6) tissue-based, assays using tissue derived from a living organism. This information must be selected from drop down list in MICHA excel template.

    1. Biochemical (BAO_0000217): A biochemical assay format is an in vitro format used to measure the activity of a biological macromolecule, e.g. a purified protein or nucleic acid. It is most often a homogeneous assay, but can be heterogeneous if a solid phase, such as beads, is used to immobilize the macromolecule.

    2. Cell-based (BAO_0000219): A cell-based assay format involves the use of living eukaryotic cells and is a heterogeneous assay.

    3. Cell-free (BAO_0000366): A cell-free assay format originates from cells, but does not use intact, live cells. This format is distinct from biochemical assays. It is most often a homogeneous assay, but can be heterogeneous if a solid phase, such as beads, is used to immobilize the components.

    4. Organism-based (BAO_0000218): An organism-based assay format involves the use of a living organism and is a heterogeneous assay.

    5. Physiochemical (BAO_0000684,BAO_0010009): A physiochemical assay format involves the measurement of physical and chemical properties of perturbagens, namely aqueous solubility, octanol/water partition, or cell permeability models e.g. parallel artificial membrane permeability assay (PAMPA).

    6. Tissue based (BAO_0000221): A tissue-based assay format involves the use of a tissue derived from a living organism and is a heterogeneous assay type.

  11. Detection technology (BAO_0000035): A detection technology is the physical method or technique readout used to measure an effect caused by a perturbagen in an assay environment. This information must be selected from drop down list in MICHA excel template.

    1. Fluorescence (BAO_0000046): Fluorescence detection methods use the principles of fluorescence, whereby incident light excites a fluorophore that then emits light at lower energy and higher wavelength, typically in the visible portion of the UV-Visible spectrum.

    2. Fluorescence polarization (FP) (BAO_0000003): Fluorescence polarization (FP) measurements are based on the assessment of size-dependent rotational motions of species and used to measure binding interactions.

    3. Alpha Screen (BAO_0000130): Amplified Luminescent Proximity Homogeneous Assay Screen is a subtype of fluorescence detection technologies.

    4. Time Resolved Fluorescence (TRF) (BAO_0000004): Time Resolved Fluorescence (TRF) is a subtype of fluorescence detection technologies. One commercial name is DELFIA.

    5. Time -Resolved Fluorescence Energy Transfer (TR-FRET) (BAO_0000004): Time-Resolved Fluorescence Energy Transfer (TR-FRET) is a subtype of fluorescence detection technologies. Commercial names are LANCE and HTRF.

    6. Label-free technology (BAO_0000427): Label-free detection technologies measure binding interactions and cell-based reactions in the absence of conventional labels, e.g., fluorescent probes. Advantages include the ability to measure a) functional activity without modifying the binding partners with labels, b) binding interactions independent of functional activity, and c) cell-based assays without the need to engineer cell-lines to over-express given targets, such as GPCRs.

    7. Luminescence (BAO_0000045): Luminescence detection technologies make use of light emission that occurs from an electronically excited state reached by a physical, mechanical, or chemical mechanism.

    8. Microscopy (BAO_0000452): Microscopic detection technologies use microscopes to see objects that cannot be seen with an unaided eye.

    9. Quantitative PCR (qPCR) (BAO_0002089): Quantitative PCR, sometimes referred to as RT-PCR, detection technologies use DNA labeled protein in a binding assay and are detected by quantifying the amount of DNA by PCR. A commercial assay called KINOMEscan was specifically developed for screening purposes.

    10. Radiometry (BAO_0000657): Radiometry detection technologies use radioactive tracers. Examples of assays that use radiometry are filter assays and Scintillation Proximity Assay (SPA).

    11. Spectrophotometry (BAO_0000049): Spectrophotometry detection technologies measure the amount of light that a sample absorbs. A spectrophotometer operates by passing a beam of light through a sample and measuring the intensity of light reaching a detector.

    12. Thermal shift (BAO_0000058): Thermal shift detection technologies detect temperature shifts using a fluorescent dye that is sensitive to a protein environment. Upon heating, a protein unfolds and loses native conformation. Binding of a small molecule can often stabilize the protein conformation, resulting in a higher unfolding temperature. Endpoint mode of action:

  12. An assay endpoint mode of action refers to the effect of the perturbagen on the assay target, e.g. inhibition, activation, cytotoxicity etc. This information must be selected from drop down list in MICHA excel template.
    1. Activation: An activation endpoint action is the initiation, enhancement, or increase of a biological action, such as increasing the rate of an enzymatic reaction or positively regulating gene expression through an increased rate of transcription.
    2. Cytotoxicity: A cytotoxicity endpoint action is a toxic or cell death-inducing property of a perturbagen. Known cytotoxic agents mediate cell death commonly by inducing necrosis, apoptosis, or autophagy. Cytotoxicity can be detected by measuring various aspects of the death process, e.g. membrane permeability, ATP concentration, DNA fragmentation, etc.
    3. Growth inhibition: A growth inhibition endpoint action refers to growth reduction of a predefined stimulus. The unit of measure is always percent, when normalized to the dynamic range of the assay. Cell viability or proliferation can be measured in a variety of ways, e.g. 1) protein content by sulforhodamine B staining followed by absorbance measurement, 2) mitochondrial dehydrogenase activity by 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyl tetrasodium bromide, MTT staining, followed by absorbance measurement, 3) expression of proliferation-associated antigens by immunostaining for Ki-67, and 4) ATP content by using CellTiter-Glo reagent (Promega) followed by luminescence measurement.
    4. Inhibition: An inhibition endpoint action refers to reduction of a predefined stimulus.
    5. Inverse agonist: An inverse agonist endpoint action refers to a modulator that binds to the same receptor as an agonist, but induces a response opposite to that of an agonist.
  13. Assay test type: It shows the test type of assay such as In-vitro, In-vivo, Ex-vivo etc. This information must be selected from drop down list in MICHA excel template.