All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Patched micromatch, path-to-regexp vulnerability
- Patched axios vulnerability
- Patched fast-xml-parser vulnerability
- Fixed security vulnerabilities
- Updated Python Lambda function runtime to Python 3.11
- Updated Node Lambda function runtime to NodeJS 20
- Patch security vulnerabilities
- Updated outdated libraries and dependencies
- Replaced deprecated url.parse with URL constructor
- Updated AppRegistry Resource logical Id creation logic to generate consistent Ids
- Updated QuickSight template to remove Twitter references
- Updated QuickSight template to fix issue with platform menu on Topic Analysis page
- Fixed connection timeout issue when uploading images on S3
- Fixed security vulnerabilities
- Migrated to AWS CDK v2
- Migrated to AWS SDK V3
- Updated Node Lambda runtime to Node 18
- Implemented NewsCatcher Locally instead of using deprecated library
- Reddit comments Ingestion - Migrated from npm snoowrap package to Python praw library. Subreddit comment ingestion lambda is now using Python runtime
- Security patches for npm packages
- Updated outdated libraries
- Operational metrics to include additional deployment attributes for Reddit ingestion and attributes indicating if particular ingestion type is enabled
- Youtube Search query when OR (|) expression is used in query parameter
- Reddit comment ingestion issue for highly active subreddits
- UrlLib issue in RssNewsFeed Ingestion Lambda
- Python NewCatcher library
- npm snoowrap package
- AWS SDK Node and Python libraries
s
- Update AWS-SDK V2 that addresses the security issue identified in xml2js
- Updated the bucket policy on the logging bucket to grant access to the logging service principal (logging.s3.amazonaws.com) for access log delivery.
- Update AWS CDK, Nodejs and Python library updates
- GitHub issue #75. This fix prevents AWS Service Catalog AppRegistry Application Name and Attribute Group Name from using a string that begins with
AWS
, since strings begining withAWS
are considered as reserved words by the AWS Service.
- Service Catalog AppRegistry resource to register the CloudFormation template and underlying resources as an application in both Service Catalog AppRegistry and AWS Systems Manager Application Manager
- Github issue #69. The fix required the ingestion infrastructure code in CDK to be pass the AWS CloudFormation parameter to be passed to the Reddit nested stack.
- CDK version 1.177.0
- AWS SDK verson 2.1223.0
- The capability to ingest Subreddit comments using the Reddit API
- CDK version 1.152.0
- AWS SDK version 2.1127.0
- Amazon QuickSight dashboard to support NLP analysis on Subreddit comments
- node and python module version dependencies
- The capability to ingest custom data by uploading files as JSON, XLSX, or CSV files
- Use Amazon Kinesis Data Firehose partition projection to store and partition the data by source date (instead of system processing date)
- Use Amazon Athena dynamic partitioning features to run SQL queries on data stored in S3 bucket
- AWS CDK version 1.137.0
- AWS SDK version 2.1067.0
- Creating of AWS Glue partitions (replaced with Amazon Athena dynamic partitions)
- GitHub issue #42. To fix the issue, RSS feed ingestion lambda function and SQLs related to the Amazon QuickSight dashboard were updated.
- AWS CDK version to 1.125.0
- AWS SDK version to 2.1008.0
- Capability to ingest YouTube comments
- AWS CDK version to 1.121.0
- AWS SDK version to 2.991.0
- Updated Amazon QuickSight analysis and dashboard to reflect the new ingestion source
- Ingest RSS feeds from over ~3000+ news websites across the world
- AWS CDK version to 1.110.1
- AWS SDK version to 2.945.0
- Updated Nodejs Lambda runtimes to use Nodejs 14.x
- Updated Amazon QuickSight analysis and dashboard to reflect the new ingestion source
- Updated AWS StepFunction workflows to handle parallel ingestion (tweets from Twitter and RSS feeds from News websites)
- Truncated tweets through merging GitHub pull request #26
- Capability to use geo coordinates when invoking the Twitter API to filter tweets returned by its Search API
- New visuals and sheets (tabs) on Amazon QuickSight to perform analysis using geo coordinates (when available with tweets)
- Additional remediation to handle throttling conditions from Twitter v1.1 API calls and push additional information to Amazon CloudWatch Logs that can be used to create alarms or notifications using CloudWatch Metric Filters
- Switched to AWS Managed KMS keys for AWS Glue Security Configuration
- AWS CDK version to 1.83.0
- AWS SDK version to 2.828.0
- Implementation to refactor and to reuse the following architecture patterns from AWS Solutions Constructs
- aws-kinesisfirehose-s3
- aws-kinesisstreams-lambda
- aws-lambda-step-function
- The join condition for Topic Modeling in Amazon QuickSight dataset to provide accurate topic identification for a specific run
- ID and name generation for Amazon QuickSigh resource to use dynamic value based on the stack name
- AWS CDK version to 1.73.0
- AWS SDK version to 2.790.0
- New and simplified interactive Amazon QuickSight dashboard that is now automatically generated through an AWS CloudFormation deployment and that customers can extend to suit their business case
- Updated to AWS CDK v1.69.0
- Consolidate Amazon S3 access Log bucket across the solution. All access log files have a prefix that corresponds to the bucket for which they are generated
- S3 storage for inference outputs to use Apache Parquet
- Add partitioning to AWS Glue tables
- Update to AWS CDK v1.63.0
- Update to AWS SDK v2.755.0
- Initial release