Skip to content

Releases: Aleph-Alpha/intelligence-layer-sdk

v1.2.0

15 May 07:26
843f36a
Compare
Choose a tag to compare

v1.2.0

We did a major revamp of the ArgillaEvaluator to separate an AsyncEvaluator from the normal evaluation scenario.
This comes with easier to understand interfaces, more information in the EvaluationOverview and a simplified aggregation step for Argilla that is no longer dependent on specific Argilla types.
Check the how-to for detailed information here

Breaking Changes

  • rename: AggregatedInstructComparison to AggregatedComparison
  • rename InstructComparisonArgillaAggregationLogic to ComparisonAggregationLogic
  • remove: ArgillaAggregator - the regular aggregator now does the job
  • remove: ArgillaEvaluationRepository - ArgillaEvaluator now uses AsyncRepository which extend existing EvaluationRepository for the human-feedback use-case
  • ArgillaEvaluationLogic now uses to_record and from_record instead of do_evaluate. The signature of the to_record stays the same. The Field and Question are now defined in the logic instead of passed to the ArgillaRepository
  • ArgillaEvaluator now takes the ArgillaClient as well as the workspace_id. It inherits from the abstract AsyncEvaluator and no longer has evalaute_runs and evaluate. Instead it has submit and retrieve.
  • EvaluationOverview gets attributes end_date, successful_evaluation_count and failed_evaluation_count
    • rename: start is now called start_date and no longer optional
  • we refactored the internals of Evaluator. This is only relevant if you subclass from it. Most of the typing and data handling is moved to EvaluatorBase

New Features

  • Add ComparisonEvaluation for the elo evaluation to abstract from the Argilla record
  • Add AsyncEvaluator for human-feedback evaluation. ArgillaEvaluator inherits from this
    • .submit pushes all evaluations to Argilla to label them
    • Add PartialEvaluationOverview to store the submission details.
    • .retrieve then collects all labelled records from Argilla and stores them in an AsyncRepository.
    • Add AsyncEvaluationRepository to store and retrieve PartialEvaluationOverview. Also added AsyncFileEvaluationRepository and AsyncInMemoryEvaluationRepository
  • Add EvaluatorBase and EvaluationLogicBase for base classes for both async and synchronous evaluation.

Fixes

  • Improve description of using artifactory tokens for installation of IL
  • Change confusion_matrix in SingleLabelClassifyAggregationLogic such that it can be persisted in a file repository

Full Changelog: v1.1.0...v1.2.0

v1.1.0

06 May 09:06
11738b8
Compare
Choose a tag to compare

New Features

  • AlephAlphaModel now supports a context_size-property
  • Add new IncrementalEvaluator for easier addition of runs to existing evaluations without repeated evaluation.
    • Add IncrementalEvaluationLogic for use in IncrementalEvaluator

Full Changelog: v1.0.0...v1.1.0

Addendum to the initial release:

  • The use-cases folder was renamed to examples.

Initial Release

30 Apr 13:56
Compare
Choose a tag to compare

Please see the readme for further details.

Thanks to all contributors and feedback!

v0.11.0

29 Apr 11:37
e5b3a8f
Compare
Choose a tag to compare

Breaking Changes

  • breaking_change: HuggingFaceDatasetRepository now has a parameter caching, which caches a examples of a dataset once loaded. This is True by default. This drastically reduces network traffic. For a non-breaking change, set it to False.
  • breaking_change: MultipleChunkRetrieverQa does not take insert_chunk_size-parameter but instead takes ExpandChunks-task
  • breaking_change: the issue_cassification_user_journey notebook moved to its own repository

New Features

  • feature: Llama2InstructModel to support llama-2 models in Aleph Alpha API
  • feature: Llama3InstructModel to support llama-3 models in Aleph Alpha API
  • feature: ExpandChunks-task caches chunked documents by ID
  • feature: DocumentIndexClient now supports
    - create_index
    - index_configuration
    - assign_index_to_collection
    - delete_index_from_collection
    - list_assigned_index_names
  • feature: DocumentIndexRetriever now supports index_name
  • feature: Runner.run_dataset now has a configurable number of workers via max_workers and defaults to the previous value, which is 10.
  • feature: In case a BusyError is raised during a complete the LimitedConcurrencyClient will retry until max_retry_time is reached.
  • feature: FileTracer now accepts as log_file_path both, a str and a Path

Fixes

  • refactor: rename index parameter in DocumentIndex.search() to index_name
  • fix: HuggingFaceRepository no longer is a dataset repository. This also means that HuggingFaceAggregationRepository no longer is a dataset repository.

Full Changelog: v0.10.0...v0.11.0

v0.10.0

22 Apr 15:24
Compare
Choose a tag to compare

Breaking Changes

  • breaking change: ExpandChunksOutput now returns ChunkWithStartEndIndices instead of TextChunk
  • breaking change: MultipleChunkRetrieverQa's AnswerSource now contains EnrichedChunk instead of just the TextChunk

New Features

Fixes

  • fix: ChunkWithIndices now additionally returns end_index
  • fix: DocumentPath and CollectionPath are now immutable

v0.9.1

18 Apr 12:46
c619a40
Compare
Choose a tag to compare

Breaking Changes

  • breaking change: MultipleChunkRetrieverQaOutput now return sources and search_results

New Features

  • feature: ExpandChunks task takes a retriever and some search results to expand the chunks to the desired length

Fixes

  • fix: ExpectedSearchOutput has only relevant fields and supports generic document-ID rather than just str
  • fix: SearchEvaluationLogic explicitly compares documents by ids
  • fix: In RecusrsiveSummarize.do_run, num_generated_tokens not uninitialized anymore. See Issue 743..
  • fix: Reverted pydantic to 2.6.* because of FastAPI incompatibility.

Full Changelog: v0.9.0...v0.9.1

v0.9.0

16 Apr 07:32
8b6078a
Compare
Choose a tag to compare

Breaking Changes

  • breaking change: Renamed the field chunk of AnswerSource to search_result for multi chunk retriever qa.
  • breaking change: The implementation of the HuggingFace repository creation and deletion got moved to HuggingFaceRepository

New Features

  • feature: HuggingFaceDataset- & AggregationRepositories now have an explicit create_repository function.
  • feature: Add MultipleChunkRetrieverBasedQa, a task that performs better on faster on retriever-QA, especially with longer context models

Full Changelog: v0.8.2...v0.9.0

v0.8.2

11 Apr 12:05
b829383
Compare
Choose a tag to compare

0.8.2

New Features

  • feature: Add SearchEvaluationLogic and SearchAggregationLogic to evaluate Search-use-cases
  • feature: Trace viewer and IL python package are now deployed to artifactory

Fixes

  • Documentation
    • fix: Add missing link to issue_classification_user_journey notebook to the tutorials section of README.
    • fix: Confusion matrix in issue_classification_user_journey now have rounded numbers.

Full Changelog: v0.8.1...v0.8.2

v0.8.1

08 Apr 12:45
Compare
Choose a tag to compare

What's Changed

Fixes:

  • fix: Linting for release version

Full Changelog: v0.8.0...v0.8.1

v0.8.0

08 Apr 11:55
Compare
Choose a tag to compare

What's Changed

New Features

  • feature: Expose start and end index in DocumentChunk

  • feature: Add sorted_scores property to SingleLabelClassifyOutput.

  • feature: Error information is printed to the console on failed runs and evaluations.

  • feature: The stack trace of a failed run/evaluation is included in the FailedExampleRun/FailedExampleEvaluation object

  • feature: The Runner.run_dataset(..) and Evaluator.evaluate_run(..) have an optional flag abort_on_error to stop running/evaluating when an error occurs.

  • feature: Added Runner.failed_runs(..) and Evaluator.failed_evaluations(..) to retrieve all failed run / evaluation lineages

  • feature: Added .successful_example_outputs(..) and .failed_example_outputs(..) to RunRepository to match the evaluation repository

  • feature: Added optional argument to set an id when creating a Dataset via DatasetRepository.create_dataset(..)

  • feature: Traces now log exceptions using the ErrorValue type.

  • Documentation:

    • feature: Add info on how to run tests in VSCode
    • feature: Add issue_classification_user_journey notebook.
    • feature: Add documentation of newly added data retrieval methods how_to_retrieve_data_for_analysis
    • feature: Add documentation of release workflow

Fixes

  • fix: Fix version number in pyproject.toml in IL
  • fix: Fix instructions for installing IL via pip.

Full Changelog: v0.7.0...v0.8.0