Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🎉 wizard: anomalist (v2) #3388

Merged
merged 36 commits into from
Oct 21, 2024
Merged

🎉 wizard: anomalist (v2) #3388

merged 36 commits into from
Oct 21, 2024

Commits on Oct 9, 2024

  1. Configuration menu
    Copy the full SHA
    1835b2a View commit details
    Browse the repository at this point in the history
  2. show anomalist

    lucasrodes committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    b382d6c View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2024

  1. Configuration menu
    Copy the full SHA
    b3338d8 View commit details
    Browse the repository at this point in the history
  2. 🎉 anomalist: CLI for creating anomalies (#3381)

    * 🎉 Add CLI for running anomaly detectors
    
    * merge with wizard-anomalist, pass ci/cd
    
    ---------
    
    Co-authored-by: lucasrodes <[email protected]>
    Marigold and lucasrodes authored Oct 10, 2024
    Configuration menu
    Copy the full SHA
    9a4447b View commit details
    Browse the repository at this point in the history
  3. ✨ indicator-upgrader: store variable mapping (#3389)

    * Start a new staging server for branch 'variable-mapping'
    
    * add to_sql
    
    * define sqlite db name in variable
    
    * new methods to store variable mapping
    
    * force int if possible
    
    * fix inifinte loop
    
    * save variable mapping
    
    * minor ui tweak
    
    * add undo capabilities
    
    * store var mapping
    lucasrodes authored Oct 10, 2024
    Configuration menu
    Copy the full SHA
    d09cdbb View commit details
    Browse the repository at this point in the history
  4. ✨ wizard: anomalist ui (#3394)

    * ✨ wizard: anomalist ui
    
    * rename file
    
    * rename + tweak UI
    
    * function to get variable uris from indicator list
    
    * tweak config
    
    * minor fixes
    
    * demo
    
    * org: folder for app
    
    * ci/cd
    lucasrodes authored Oct 10, 2024
    Configuration menu
    Copy the full SHA
    925936e View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. use large blob for dfScore

    Marigold committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    4e6aaf5 View commit details
    Browse the repository at this point in the history
  2. 🎉 anomalist: Detect new datasets automatically (#3396)

    * 🎉 anomalist: Detect new datasets automatically
    
    * Add temporary duplicates of the energy and electricity mix datasets for testing purposes
    
    * Add another temporary step
    
    * Move common function to detect new datasets to utils cached
    
    * Fix wrong mapping of dataset ids in indicator upgrader
    
    * Edit dag and energy steps to be able to play around with mappings and anomalies
    
    * Improve map_datasets
    
    * Let anomalist detect new datasets and list them
    
    * Cache inputs
    
    * remove redundant code
    
    ---------
    
    Co-authored-by: lucasrodes <[email protected]>
    pabloarosado and lucasrodes authored Oct 11, 2024
    Configuration menu
    Copy the full SHA
    6333bf4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c224fb7 View commit details
    Browse the repository at this point in the history
  4. 🎉 anomalist: Detect abrupt changes in consecutive versions of an indi…

    …cator (#3368)
    
    * ✨ wizard: anomalies
    
    * wip
    
    * bump streamlit
    
    * wip
    
    * wip: chart
    
    * wip
    
    * todo
    
    * plot indicator
    
    * re-structure
    
    * wip: loading indicators
    
    * fix API grapher_chart
    
    * deprecate chart_html
    
    * chart_html -> grapher_chart
    
    * clean
    
    * feature: Detect abrupt changes in consecutive versions of an indicator
    
    * Improve compare_tables
    
    * Add new BARD score and improve compare_tables
    
    * ci/cd
    
    * wip
    
    * wip
    
    * changed module name
    
    * custom components module
    
    * add methods to get uris
    
    * get dataset uris
    
    * update import
    
    * update gpt pricing
    
    * update import
    
    * wip
    
    * provide entity-context for anomaly
    
    * wip: anomalist v2
    
    * Implement detection of different kinds of anomaly types
    
    * Rename script
    
    * Rename script
    
    * Rename script
    
    * Create a class AnomalyDetector, simplify code
    
    * Improve scores dataframe
    
    * Rename score column
    
    * wip
    
    * wip
    
    * Improve detection of abrupt changes in time series
    
    * Add population score
    
    * Create function to get views for a list of variables
    
    * Add analytics score
    
    * Improve anomaly aggregation
    
    * Align with master
    
    * Align with master
    
    * Fix minor bug
    
    * minor cleaning
    
    * map entities only if explicitly asked
    
    * reduce re-implemented functions
    
    * avoid usage of get_connection
    
    * Ignore formatting issues
    
    ---------
    
    Co-authored-by: lucasrodes <[email protected]>
    pabloarosado and lucasrodes authored Oct 11, 2024
    Configuration menu
    Copy the full SHA
    4ed14b3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5f3e9b6 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a8d77ea View commit details
    Browse the repository at this point in the history
  7. 🎉 anomalist: Improve anomalist CLI (#3399)

    * 🎉 anomalist: Improve anomalist CLI
    
    * Allow for multiple anomalies, datasets and variable ids
    
    * Fix small issues and let data loading use maximum number of workers
    pabloarosado authored Oct 11, 2024
    Configuration menu
    Copy the full SHA
    b50c1a1 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2024

  1. Configuration menu
    Copy the full SHA
    f9f5668 View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2024

  1. Configuration menu
    Copy the full SHA
    a552776 View commit details
    Browse the repository at this point in the history
  2. ✨ anomalist: ui flow (#3402)

    * ✨ anomalist: ui flog
    
    * wip
    
    * wip
    
    * enable multiple indicator plot
    
    * allow full entity mapping load
    
    * bugfix
    
    * polish demo
    
    * ci/cd
    lucasrodes authored Oct 14, 2024
    Configuration menu
    Copy the full SHA
    991d295 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2024

  1. 🎉 anomalist: Improve Anomalist backend (#3405)

    * 🎉 anomalist: Improve Anomalist backend
    
    * Improve types of anomaly_detection and cli
    
    * Minor refactor and removing useless todo
    
    * Move anomaly detection to a separate module
    
    * Prevent Anomaly from failing if table already exists
    
    * Big refactor to be able to add version change anomalies
    
    * Rename anomalies
    
    * Move detectors to a separate module
    
    * Use entity_name instead of entity_id
    
    * Convert to long format afterwards
    
    * Pass data explicitly to generate scores df
    pabloarosado authored Oct 15, 2024
    Configuration menu
    Copy the full SHA
    47ac33e View commit details
    Browse the repository at this point in the history
  2. ✨ wizard: improve app flow (#3407)

    * ✨ wizard: improve app flow
    
    * add option to drop table when creating
    
    * adapt to new api
    
    * new function to create tables in anomalist
    
    * improve comments
    
    * checkfirst flag when creating table
    
    * re-order code
    
    * bug fixes in app flow
    
    * improve pagination ui
    
    * tweak internal grapher_chart flow
    
    * entity selection
    
    * module for chart configs
    
    * adjust for indicator upgrades
    
    * enable re-scan
    
    * help text, anomaly types, upgrade anomalies
    lucasrodes authored Oct 15, 2024
    Configuration menu
    Copy the full SHA
    8a6569b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    09184fc View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2024

  1. ✨ Integrate GP outlier detector (#3411)

    * ✨ Add GP outlier detector
    
    * drop anomalies with zero values
    Marigold authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    42bab09 View commit details
    Browse the repository at this point in the history
  2. ✨ anomalist: stop using mock data (#3410)

    * ✨ anomalist: stop using mock
    
    * style
    
    * ✨ anomalist: stop using mock data
    
    * re-order mock data
    
    * replace mock data with real data
    
    * discard df if all-zero
    lucasrodes authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    ad5aa7e View commit details
    Browse the repository at this point in the history
  3. 🐛 anomalist: Fix unknown variable ids (#3413)

    * 🐛 anomalist: Fix unknown variable ids
    
    * Fix missing variable ids when detecting anomalies in multiple datasets
    
    * Update misleading comment
    pabloarosado authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    2d98d3f View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a163c35 View commit details
    Browse the repository at this point in the history
  5. fix type

    lucasrodes committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    786f81a View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    aab0dc2 View commit details
    Browse the repository at this point in the history
  7. ✨ anomalist: GP support, refactor functions, add dfReduced (#3416)

    * ✨ anomalist: nits
    
    * abstract df parsing logic
    
    * add GP outlier
    
    * add dfReduced to table
    
    * reset index
    
    * incorporate GP
    
    * re-arrange functions, add link to indicator
    
    * stop reducing dfScore
    lucasrodes authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    fe3027b View commit details
    Browse the repository at this point in the history
  8. 🎉 anomalist: Add population and analytics scores (#3412)

    * ✨ anomalist: stop using mock
    
    * style
    
    * ✨ anomalist: stop using mock data
    
    * re-order mock data
    
    * replace mock data with real data
    
    * 🎉 anomalist: Add population and analytics scores
    
    * Store scores with all years and combine them on app
    
    * Add anomaly and population score, as well as weighted score
    
    * Move get_scores to utils
    
    ---------
    
    Co-authored-by: lucasrodes <[email protected]>
    pabloarosado and lucasrodes authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    000d0c5 View commit details
    Browse the repository at this point in the history
  9. ✨ anomalist: test llms for summary (#3417)

    * ✨ anomalist: nits
    
    * abstract df parsing logic
    
    * add GP outlier
    
    * add dfReduced to table
    
    * reset index
    
    * incorporate GP
    
    * re-arrange functions, add link to indicator
    
    * ✨ anomalist: test llms for summary
    
    * stop reducing dfScore
    
    * wip
    
    * wip
    
    * llm summary button
    
    * add function to get variables from DB
    
    * tag: icon is optional
    
    * AI summary
    lucasrodes authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    f5b58a9 View commit details
    Browse the repository at this point in the history
  10. ✨ Add max_time and n_jobs to GP (#3422)

    * ✨ Add max_time and n_jobs to gp_outlier
    Marigold authored Oct 16, 2024
    Configuration menu
    Copy the full SHA
    885a6b5 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2024

  1. 🐛 Fix anomalist bugs (#3427)

    * 🐛 Fix anomalist bugs
    Marigold authored Oct 18, 2024
    Configuration menu
    Copy the full SHA
    9e1caac View commit details
    Browse the repository at this point in the history
  2. 🎉 anomalist: Experiment with different anomaly detection methods (#3420)

    * 🎉 anomalist: Experiment with different anomaly detection methods
    
    * Improve script to visualize anomalies
    
    * Improve visualization of anomalies, and try different methods
    
    * Improve cli
    
    * Some refactoring
    
    * Add useful comment
    
    * ✨ anomalist: Improve automatic detection of new datasets (#3429)
    
    * ✨ anomalist: Improve automatic detection of new datasets
    
    * Create new functions to detect new datasets, and speed up anomalist
    
    * Infer variable mapping
    
    * Use inferred variable mapping in Anomalist
    
    * Move function to get datasets info
    pabloarosado authored Oct 18, 2024
    Configuration menu
    Copy the full SHA
    45ed79a View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2024

  1. ✨ Add anomalist to owidbot (#3431)

    * ✨ Add anomalist to owidbot
    Marigold authored Oct 21, 2024
    Configuration menu
    Copy the full SHA
    0476593 View commit details
    Browse the repository at this point in the history
  2. 🐛 anomalist: Fix bug with unknown indicators and long loading time (#…

    …3434)
    
    * 🐛 anomalist: Fix bug with unknown indicators and long loading time
    
    * Stop storing dfScore, which takes a long time to load
    
    * Fix GP detecting anomalies on old variables (which is unnecessary)
    pabloarosado authored Oct 21, 2024
    Configuration menu
    Copy the full SHA
    8caeb49 View commit details
    Browse the repository at this point in the history
  3. ✨ anomalist: Small improvement in Anomalist filters (#3437)

    * 🐛 anomalist: Fix bug with unknown indicators and long loading time
    
    * Stop storing dfScore, which takes a long time to load
    
    * Fix GP detecting anomalies on old variables (which is unnecessary)
    
    * ✨ anomalist: Small improvement in Anomalist filters
    
    * Show instead of hide detectors in filter
    pabloarosado authored Oct 21, 2024
    Configuration menu
    Copy the full SHA
    92adc85 View commit details
    Browse the repository at this point in the history
  4. ✨ Persist filter values in the URL for Anomalist (#3441)

    * ✨ persist filter values in URL
    Marigold authored Oct 21, 2024
    Configuration menu
    Copy the full SHA
    f0c131c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b00bf99 View commit details
    Browse the repository at this point in the history