Skip to content

HIPC Dashboard pipeline v1.5.0

Latest
Compare
Choose a tag to compare
@zhouji2013 zhouji2013 released this 24 Apr 20:47
· 1 commit to master since this release

Changes in version 1.5.0 (Data)

  • Process curations generated by the Bob Hancock group, for the COVID-19 Literature
    Annotation Project, representing SARS-CoV-2 infection signatures. The following
    32 PMIDs were processed:
    32217835, 32307550, 32320677, 32514047, 32632085, 32650275, 32717743, 32788292,
    32794454, 32816392, 32826343, 32829467, 32941618, 32958614, 33013872, 33028979,
    33033248, 33060840, 33119547, 33187979, 33195414, 33208459, 33239683, 33391280,
    33425248, 33478949, 33546489, 33625796, 33785765, 33846275, 33937096, 34099652
  • Update the curation templates under docs\ with the addition of new values for
    some of the pick lists.

Changes in version 1.5.0 (Pipeline)

  • A large scope of code is cleaned up and re-organized.
    -- Especially, the main script is separated into four. This make the processing and output
    fully reproducible. Previously, various control parameters are changed manually
    to create all output.
    -- Many small errors and corner cases are fixed. Some of the issues showed up only with new data.
    -- Text encoding handling is corrected.
  • Move PMID lookup handling to separate routine to support different scenarios.
  • No longer accept space as a valid separator character for gene symbols.
  • Allow to complete cell-type runs even if some cell-types not mapped.
  • Modify the observation summary template for cell-type frequency signatures, to improve
    readability.