Skip to content

Latest commit

 

History

History
479 lines (345 loc) · 25.4 KB

target-identification.md

File metadata and controls

479 lines (345 loc) · 25.4 KB

Target Identification

Via the Target Identification panel users can collect identification information about targets of interest (genes), store this information and use it for their own research purposes.

Target identification panel can be opened via the VIEWS menu and includes sub-tabs:

  • Targets (opened by default) - sub-panel where user can manage targets
  • Identifications - sub-panel where user can view and manage target identification reports

NGB GUI

Manage targets

Targets management sub-tab contains the following objects:

  • Table of previously created (added) targets. For each target in that table, there are columns:
    • Name - target name. Name is displayed as a hyperlink. By click it, the target details form will be opened
    • Genes - genes/transcripts that are included in this target
    • Species - species that are included in this target
    • Diseases - diseases that are associated with the target
    • Products - products that are associated with the target
    • Owner - name of the user who created the target
    • Column that contains buttons to launch identification process for the selected target
  • Button to add a new target

Note: if there are more than 2 items in any column of the targets table, additional items will be hidden and only their summary number is shown. Click that number to view details, e.g.:
NGB GUI
NGB GUI

To sort the Targets table by the Name column - click the corresponding header. To invert the sorting order, click the header again, e.g.:
NGB GUI

Targets table supports filtering. To filter displayed targets - use one or more parameters to filter items in the table.
To enable filters - use the menu NGB GUI in the right upper corner of the panel. The approach is fully the same as described for the Variants panel:
NGB GUI

NGB GUI

NGB GUI

To clear all filters - use the menu NGB GUI in the right upper corner of the panel.

Target creation

To create a new target, there are following ways:

  • Via GUI:
    • Add a new target manually from the Targets sub-tab
    • Create a target from any found orthogroup (using the Homologs panel)
  • Via CLI

Creation from "Targets" sub-panel

To create a new target:

  1. Click the "Create target" button at the Targets sub-panel:
    NGB GUI
  2. The following form will appear:
    NGB GUI
    This form contains:
    • button to return to the targets table form
    • field to specify the name of the creating target
    • button to add a gene/transcript to the target
    • button to add an associated disease to the target
    • button to add an associated product to the target
    • button to save the creating target
  3. Specify the new target name.
  4. Click the button "Add gene" to add a gene/transcript to the target.
    A new row with empty fields will appear - for the specifying info for a gene - Gene ID, Gene Name, Tax ID (taxonomy ID of species), Species Name and Priority (priority of the adding gene in the current target's genes list):
    NGB GUI
  5. Specify all fields for a gene/transcript (Priority field is non-required, others - mandatory).
    There are abilities for specifying genes:
    • fully manual way when you shall specify the value in each field as a plain text
    • "autofilling" with automatically suggested values of genes using appearing ones during the input Gene ID/Gene Name - these values are being suggested from the existing genes database, e.g.:
      NGB GUI
      When the gene/gene ID is selected from the suggested list - all other fields (except Priority) will be autofilled in:
      NGB GUI
    • mix of the manual and "autofilling" ways
  6. Priority can be left empty or manually selected from the list:
    NGB GUI
  7. By the described way (steps 4-6), add desired number of genes/transcripts. Near each gene/transcript, there is a button to remove gene from the list, e.g.:
    NGB GUI
  8. Add disease(s) associated with the target (optionally).
    To add a disease, click the corresponding field and specify the disease manually. After the input, press Enter key, e.g.:
    NGB GUI
    You may add any count of associated diseases by the described way. To remove a disease from the list - use the cross button near the disease name.
  9. Add product(s) associated with the target (optionally).
    To add a product, click the corresponding field and specify the product manually. After the input, press Enter key, e.g.:
    NGB GUI
    You may add any count of associated products by the described way. To remove a product from the list - use the cross button near the product name.
  10. Once all desired values are added - click the Create button.
    Target will be saved, the Remove button will appear, the Save button will become disabled:
    NGB GUI
    Click the button NGB GUI to return to the Targets table. Just created target will appear in the list:
    NGB GUI

Creation from "Homologs" panel

To create a new target from the "Homologs" panel:

  1. Open the Homologs panel and any sub-tab with the found list of homologous orthogroups ("Homologene" or "Orthologs & Paralogs")
  2. Right-click any orthogroup from the search results list to view the context menu, e.g.:
    NGB GUI
    Select the item "Create target" from the context menu.
  3. The pre-filled form of the target creation will appear:
    NGB GUI
    Here, list of all genes from the selected orthogroup will be autofilled in.
  4. Specify the name for the creating target.
  5. Specify another missing info, if necessary.
  6. Otherwise, the procedure is the same as in the section above.

Edit/remove target

To edit previously created target:

  1. Click the target row/name in the Targets table, e.g.:
    NGB GUI
  2. The form similar to the target's creation form will be opened:
    NGB GUI

Here, you can edit the target (edit target name, add/edit/remove genes from the list, add/remove diseases and products) and then save changes or remove the target - by the corresponding buttons.

Launch identification

To launch a new identifying process:

  1. Find the target in the Targets table and click the button to launch identification process, e.g.:
    NGB GUI
  2. The pop-up will appear according to the selected target - to define parameters (species of interest and translational species):
    NGB GUI
  3. Here, the following items shall be specified:
    • Genes of interest - list of species of interest. Corresponds to species for which gene (target) the identification will be performed. Species can be selected only from ones that were added to the current target. Multiselect is supported
    • Translational genes - list of species to which the gene comparison shall be performed during the identification. Species can be selected only from ones that were added to the current target. Multiselect is supported
  4. Once all fields are specified, click the Identify button to launch the identification, e.g.:
    NGB GUI
    NGB GUI

Identification results

Identifications sub-tab contains identification results. Here, users can:

  • view summary results of the current/selected identification process
  • view the current/selected identification details
  • go up a level to view a list of previously identifications with the ability to open any of them

Once the new identifying process is finished, the Identifications sub-tab will be automatically opened with the identification results (report):
NGB GUI

Identification report form contains:

  • Button to go up a level to view identification reports' list form
  • Label with the target name
  • List of species of interest and list of translational species used for the identification
  • Button to store the report. Note: if this button is not clicked for a report - it will be removed once closed
  • Description section with short overview of the main target gene with link(s) to the source(s) from which the description is
  • Set of expandable sections - these sections contain detailed info of the target identification

Below, there are details by each expandable block.

Description block

This block contains summary description of the target gene(s) with the link(s) to the source(s) from which the description is constructed.
Description is being obtained for all species of interest and merged.

Example of view:
NGB GUI
NGB GUI

Known drugs block

This block contains info about drugs with investigational or approved indications targeting the current target gene(s) according to their curated mechanism of action.

When the block is collapsed - its header contains short info about the count of associated drugs to the target gene(s):
NGB GUI

When the block is expanded - it contains a table with known drugs that treat diseases associated with the current target:
NGB GUI

Information for this block is obtained from different sources (different databases).
Therefore information from each database can vary - switching between sources is being performed by the corresponding dropdown list in the right upper corner of the block:
NGB GUI

For Open Targets source (see the image above), table includes columns:

  • Target - gene/transcript name. Allows to filter the table by the specific gene from the target
  • Drug - drug name. Each drug is presented as a hyperlink to the corresponding page in Open Targets platform
  • drug details columns:
    • Type - drug type (antibody, cell, enzyme, molecule, protein, etc.)
    • Mechanism of action - drug action to the target (includes action type and target name)
    • Action type - drug action type
  • Disease - disease name treated by the drug. Each disease is a hyperlink to the corresponding page in Open Targets platform
  • clinical trials info columns:
    • Phase - current stage of a clinical trial studying a drug
    • Status - state of the current clinical trial phase
    • Source - hyperlink to the corresponding page of the clinical trial

For DGIdb source, table includes columns:

  • Target - gene/transcript name. Allows to filter the table by the specific gene from the target
  • Drug - drug name. Each drug is presented as a hyperlink to the corresponding page in DGIdb platform
  • Interaction claim source - source (knowledge base) from which the info about drug is received
  • Interaction types - type(s) of the drug interaction

NGB GUI


For PharmGKB source, table includes columns:

  • Target - gene/transcript name. Allows to filter the table by the specific gene from the target
  • Drug - drug name. Each drug is presented as a hyperlink to the corresponding page in PharmGKB platform
  • Source - source (knowledge base) from which the info about drug is received

NGB GUI

By default (no matter which source is selected), table is sorted by the Drug column.
Table supports sorting by any column. Click the column header to sort by this column. To invert the sorting order, click the header again.

Table supports filtering. To filter displayed drugs - specify the desired value(s) to the filter field(s) under headers row.

Associated diseases block

This block contains info about diseases associated with the current target gene(s).

When the block is collapsed - its header contains short info about the count of associated diseases to the target gene(s):
NGB GUI

When the block is expanded - it contains info about diseases associated with the target gene:
NGB GUI

Information for this block is obtained from different sources (different databases).
Therefore information from each database can vary - switching between sources is being performed by the corresponding dropdown list in the right upper corner of the block:
NGB GUI

For different source databases, there could be several views of data representation - each view can be selected by the special control (tab selector) inside the block, on the upper side.

Available views for Open Targets source:

Table view (default)

In this view, diseases associated with the target are presented as a table.
The table includes columns:

  • Target - gene/transcript name. Allows to filter the table by the specific gene from the target
  • Disease - disease name associated with the target. Each disease is the hyperlink to the corresponding page in Open Targets platform. Allows to filter the table by the specific disease associated with the target
  • Overall score - summary score of all target-disease association scores
  • other columns - specific target-disease association scores. For more details see Associations info page.

Table itself is colorized as a heatmap - according to the score value in each cell.

NGB GUI

By default, table is sorted descending by the Overall score column.
Table supports sorting by any column. Click the column header to sort by this column. To invert the sorting order, click the header again.

Bubbles view

This view provides a hierarchical representation of the associated diseases using encapsulated bubbles:

  • Each associated disease is presented as a bubble colored by its overall association score.
  • Big circles represent all the therapeutic areas (from the Open Targets platform). Such circles contain associated diseases bubbles inside.
  • As the same disease can belong to more than one therapeutic area, diseases bubbles can be duplicated.
  • There is the ability to select a gene from target genes for which the representation of the associated diseases will be displayed.

NGB GUI

There is the ability to adjust the view of bubbles using a minimum score filter in the upper side of the view (slider control).
By setting this score, only bubbles with overall score above it will be displayed at the diagram, e.g.:
NGB GUI

Hovering over any disease bubble, a tooltip appears with details:

  • associated disease name and ID

  • overall association score of this disease

  • hyperlink to the corresponding disease page in Open Targets platform

    NGB GUI

Graph view

This view shows the subset of the directed acyclic graph containing the therapeutic areas (from the Open Targets platform) on the left side and associated diseases for the current target on the right.
Disease nodes are colored by their overall association scores.
There is the ability to select a gene from target genes for which the representation of the associated diseases will be displayed.

NGB GUI

There is the ability to adjust the view of nodes using a minimum score filter in the upper side of the view (slider control).
By setting this score, only nodes with overall score above it will be displayed at the diagram, e.g.:
NGB GUI

Hovering over any disease node, a tooltip appears with details:

  • associated disease name and ID

  • overall association score of this disease

  • hyperlink to the corresponding disease page in Open Targets platform

    NGB GUI


For PharmGKB source, only Table view is available.
In this view, diseases associated with the target are presented as a table.
The table includes columns:

  • Target - gene/transcript name. Allows to filter the table by the specific gene from the target
  • Disease - disease name associated with the target. Each disease is the hyperlink to the corresponding page in PharmGKB platform. Allows to filter the table by the specific disease associated with the target

NGB GUI

Sequences block

This block contains list of sequence(s) corresponding to the target - DNA, RNA, proteins.

When the block is collapsed - its header contains short summary info about the number of found related sequences to the target gene(s):
NGB GUI

When the block is expanded - it contains list of sequences related to the specific target gene, e.g.:
NGB GUI

Switching between target genes is being performed by the corresponding dropdown list in the right upper corner of the block:
NGB GUI

In the top of the block, there is an info about DNA sequence of the gene (if it is found), e.g.:
NGB GUI

Such block includes:

  • gene DNA accession. If this gene is presented in any dataset of the current NGB deployment - accession is shown as a hyperlink, user may click such link - the gene track will be opened for that gene, e.g.:
    NGB GUI
  • reference to the corresponding NCBI page

List of transcripts related to the specific target gene is presented as a table:
NGB GUI

Such table includes columns:

  • Target - gene/transcript name
  • Transcript - transcript accession. Each transcript is the hyperlink to the corresponding page in NCBI platform
  • Length (nt) - transcript length (number of nucleotides in the sequence)
  • Protein - protein accession. Each protein is the hyperlink to the corresponding page in NCBI platform
  • Length (aa) - protein length (number of amino acid items in the sequence)
  • Protein name - name of the protein

Comparative genomics block

This block contains information about homologous genes for the target genes:

  • paralogs of the current target gene(s)/transcrition(s) in the species of interest
  • orthologs of the current target gene(s) in the translational species

Structure block

This block contains structures related to the target gene(s)/transcription(s) and allows to view found structure models.

When the block is collapsed - its header contains short summary info about the number of found structures related to the target gene(s):
NGB GUI

When the block is expanded - it contains list of structures related to the target:
NGB GUI

Information for this block is obtained from different sources (databases), local sources are also supported.
Switching between sources is being performed by the corresponding dropdown list in the right upper corner of the block:
NGB GUI

List of structures related to the target is presented as a table:
NGB GUI

Such table includes columns (set of columns may vary due to selected source):

  • ID - structure ID in a database. It is presented as a hyperlink to the corresponding page of the structure in the selected database (e.g., PDB)
  • Name - structure name
  • Method - method by which the structure is obtained
  • Source - source database of the structure
  • Resolution - a measure of the quality of the data (level of details presented in the diffraction pattern)
  • Chains - name of macromolecule's polymeric chain of the current gene/protein
  • Owner - owner user name of the structure file (only for local sources)

Table supports filtering by ID and Name columns. To filter by any of these columns - specify the filter text to the field under the header of the column, e.g.:
NGB GUI

To open the structure model in a viewer - click its row in the table, e.g.:
NGB GUI
Structure model is being opened in the 3D viewer.
This viewer is identical to the one using as NGB Molecular Viewer.

Bibliography block

This block contains references corresponding to publications, literature, articles that mentioned the target gene(s) in the species of interest.
Main source of these publications - PubMed.

When the block is collapsed - its header contains short info about the summary count of all publications mentioned the target gene(s):
NGB GUI

When the block is expanded - it contains list of references to publications mentioned the target gene(s).
Each reference contains:

  • publication title - as the hyperlink to the corresponding publication on PubMed
  • list of publication authors
  • publication source title and date

NGB GUI

There is the ability to generate a summary of the found bibliography using AI capabilities.

Note: this summary is generated using pretrained AI language model and its text generation functionality.
For the summary, only abstracts of the several "top" (most relevant) publications are used.
As the summary is generated by AI and this process is poorly controlled, the summary block can vary for each request for the same list of publications.

To generate a summary of the found bibliography - click the corresponding button in the upper side of the block:
NGB GUI

It takes some time to generate the summary. Once the summary is prepared - it will appear in the upper side of the block:
NGB GUI

You can (re)generate the summary in the described way as many times as you need.

Note: you can change the AI model that will be used for the Bibliography summary generation - select the model from the dropdown list of AI settings - see details in the section below.
Once the model is changed - the summary will be regenerated with that AI model, e.g.:
NGB GUI

Chat with AI

Additional ability, available for users at the identification results' form - AI chat.
There is a simple build-in chat like ChatGPT where user can try to find answer for medical or bioinformatics questions .

To open chat:

  1. Click the corresponding hyperlink in the right upper corner of the Identifications tab:
    NGB GUI
  2. Chat will be opened in the left bottom corner of the page:
    NGB GUI
  3. To start chat - just specify your question in the field and press Enter key, e.g.:
    NGB GUI
  4. The answer will appear in a few seconds:
    NGB GUI
  5. You may ask any count of questions here by the described way.

AI model settings

You can change the language model that will be used in chat - select the desired model from the dropdown list of AI settings in the right upper corner of the Identifications tab:
NGB GUI
NGB GUI

Notes:

  • not all models are supported for the chat
  • changing of the model here affects the process of the summary generation of the Bibliography

Also, you can configure each AI model used here:

  1. Open the dropdown list of AI models in the right upper corner of the Identifications tab.
  2. Click the Configure model item in the list:
    NGB GUI
  3. The pop-up with model's configuration will appear:
    NGB GUI
    Here, the following options are available:
    • Model - select the AI model for which settings will be configured.
    • Temperature - defines the degree of randomness in model responses. A higher temperature makes the model's answers more varied and creative. A lower temperature makes the answers more focused and consistent.
    • Max size - defines the maximum size of tokens that can be used for model responses. Refers to the limit on the number of words, characters, or chunks of text that the language model can handle at once.
  4. Once options are configured - click the OK button to save them and apply new config.