v0.1.10
What's Changed
- Add the simplest dataset.map() by @nsthorat in #769
- Add sampling to our
ParquetSource
by @dsmilkov in #773 - Add intelligent sampling in
ParquetSource
by @dsmilkov in #778 - Add
include_labels
andexclude_labels
when exporting data viadataset.to_*
by @dsmilkov in #768 - Improve the Export Dataset modal dialog by @dsmilkov in #775
- Allow searching by pre-computed concepts in the searchbox. by @nsthorat in #783
- Add spacy embedding-clustering splitter by @brilee in #784
Other Changes
- Add notebook exploring chunking algorithms by @brilee in #771
- Remove TextSplitterSignal, fixing related tests. by @brilee in #774
- Fix the concept labeler with the new preview concept key. by @nsthorat in #780
- Improve export preview text. by @nsthorat in #782
- Upgrade ruff version and fix associated new linter errors by @brilee in #777
New Contributors
Full Changelog: v0.1.9...v0.1.10