Skip to content

v0.1.8

Compare
Choose a tag to compare
@CharlieJCJ CharlieJCJ released this 15 Nov 01:08
· 403 commits to main since this release
c843106

🎯 Highlights of v0.1.8

Version 0.1.8 marks the initial release of Curator, introducing core functionalities for managing and processing LLM completions for synthetic data generation. This release establishes a foundation with two main components: a completions module for efficient batch processing with OpenAI models, and a dataset viewer for visualizing and managing completion results. Key features include batch processing support, configurable model parameters, streaming capabilities, and metadata management through SQLite integration. The release also prioritizes developer experience with Colab compatibility and robust documentation.

⚡ Completions Module

  • Reorganized prompting logic (#2, #3) and improved OpenAI integration (#4, #28)
  • Added configurable temperature and top-p parameters (#77)
  • Implemented batch size configuration (#70)
  • Added fallback token counting with tiktoken (#59)
  • Improved dataset management with List objects (#9)
  • Added configurable working directory support (#53)
  • Fixed Colab compatibility issues (#69, #72)
  • Enhanced request/response handling (#65)

🎨 Curator Viewer

  • Reorganized dataset viewer architecture (#8)
  • Added streaming dataset UI functionality (#14)
  • Implemented file streaming for batch mode (#79)
  • Added metadata SQLite database integration (#10)
  • Fixed compilation errors (#23)

📚 Documentation & Packaging

  • Added Apache 2.0 license (#12)
  • Improved documentation and README (#1, #26)
  • Properly packaged as bespokelabs-curator (#11)
  • Added repository logo (#63)
  • Updated API key documentation (#32)

What's Changed

New Contributors

Full Changelog: https://github.com/bespokelabsai/curator/commits/v0.1.8