Skip to content

Files

Latest commit

eaf3bea · Aug 4, 2019

History

History

data-engineering

W266 Summer 2019 Final Group Project: Financial Domain Specific Word Embedding

  • Vinicio De Sola
  • Pri Nonis
  • Kevin Hanna

Documents:

  • Codes - List of publicly traded corporations (stored in data directory)
  • Sec10kFilings - Provides functionality to fetch annual 10-K's as-is extracted from EDGAR fetched from project's Cloud Storage.
  • Sec10k - Parsed 10-K's, broken in to documents along with meta data for the filing. Stored and Fetched from both project's BigQuery and Cloud Storage.