Skip to content

Release v0.7.2

Latest

Choose a tag to compare

@github-actions github-actions released this 13 Dec 03:37
· 2 commits to main since this release

Release Notes - v0.7.2

New Features

ValidationRunner Integration Fix

  • Added loc property to RyanDataAddressError - Validation errors now include Pydantic-compatible field location information, fixing the "unknown" field name issue in ValidationRunner reports. Errors for ZIP code and state validation now correctly display StateName or ZipCode instead of unknown.

Enhanced Cleaning Metrics Tracking

  • New OperationType constants - Standard constants for categorizing transformation operations:

    • NORMALIZATION - Format standardization (abbreviations, ZIP formats)
    • FORMATTING - Whitespace, punctuation, case changes
    • EXPANSION - Abbreviation expansion (via libpostal)
    • CLEANING - Removal of invalid data
    • PARSING - Component extraction from raw input
  • New tracking methods in TransformationTracker:

    • track_case_normalization() - Detects case changes in street names and cities
    • track_street_type_changes() - Detects street type abbreviations (Street→St, Avenue→Ave)
    • track_direction_changes() - Detects directional abbreviations (North→N, Southeast→SE)
    • track_unit_type_changes() - Detects unit type abbreviations (Apartment→Apt, Suite→Ste)
    • track_punctuation_removal() - Detects period and punctuation removal
    • track_component_parsing() - Records what components were extracted from raw input
  • New constant mappings for detecting transformations:

    • STREET_TYPE_TO_ABBREV - 40+ street type mappings
    • DIRECTION_TO_ABBREV - 8 directional mappings
    • UNIT_TYPE_TO_ABBREV - 18 unit type mappings

Bug Fixes

  • Fixed validation report displaying "unknown" as the field name for ZIP and state validation errors when using ValidationRunner from abstract_validation_base

Tests

  • Added 16 new tests for enhanced tracking functionality

Full Changelog: v0.7.1...v0.7.2