- Boundary types are now automatically identified for each comparison (fixes a bug in multiple boundary segmentation comparison where by default, only the boundary type
1
is compared) - Fixed a bug where during agreement computation, an extra boundary was being counted and only the boundary-mass format was properly supported (oops)
- If someone uses a hypothesis or reference argument in mass format that is a single number (e.g., using
(20)
instead of the proper(20, )
), their argument is converted for them into a tuple
- Fixed a bug where long mass sequences would not work for window metrics
- Removed duplicate unit test
- Fixed a bug where agreement could not be computed on boundary-set-format segmentations and segmentations and segmentations containing multiple boundary types.
- Added Python 2.6, 3.2, and 3.3 support
- Added support for converting NLTK-style segmentations into segmentation masses
- Added
convert_nltk_to_masses
- Added
- Fixed minor comment/documentation typos
- Fixed an issue with importing the Dataset object which resulted in an inablity to also build docs
- Fixed an issue with subpackages not being included in the distribution
- Fixed a README typo
- Updated this history
- Fixed pep8 and flake errors
- Increased branch code coverage to 100%
- Added coveralls support for builds
- Corrected documentation and added examples
- Happy Canada Day!
- Fixed a bug with the 'minus_one' keyword argument
- Improved code coverage
- Re-created to make APIs easier to use
- Implemented boundary similarity
- Inter-coder coefficient values are now only calculated over items coded by all coders (i.e., fully coded), and where coders do not code all items, the items are divided up into groups that have been fully coded
- Micro and macro averages are available, with macro averages indicating the standard error and number of items averaged
- Added support for an authorative reference coder (see the Segmentation Representation Specification Version 1.1 PDF) and support for S-based precision, recall, and F-beta measure
- Modified the input JSON files to allow for an entire dataset to be contained within a single file (see the Segmentation Representation Specification Version 1.1 PDF)
- Added additional unit tests
- Fixed a distribution issue
- Added CLI and prepared for presentation at NAACL-HLT 2012
- Updated implenentation and tests in preparetion for camera-ready submission to NAACL
- Updated implenentation and tests in response to feedback from discussions at uOttawa
- Birth of a NAACL paper!
- Curiosity
- Inception