Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Op#3648 create streaming track parser #554

Open
wants to merge 43 commits into
base: main
Choose a base branch
from

Commits on Dec 12, 2023

  1. first attempts for streaming track parser

    add lazy version of track dataset and detection parser
    KueblerJelle committed Dec 12, 2023
    Configuration menu
    Copy the full SHA
    ed0c9f0 View commit details
    Browse the repository at this point in the history

Commits on Jan 4, 2024

  1. Configuration menu
    Copy the full SHA
    6e3c81f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5c2d677 View commit details
    Browse the repository at this point in the history

Commits on Jan 23, 2024

  1. Added functionality to parse detections as stream for multiple files

    Detection parser can accept a stream of file contents to lazily load files only when they are needed
    KueblerJelle committed Jan 23, 2024
    Configuration menu
    Copy the full SHA
    b3944ed View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    66eb2d1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    723f4c8 View commit details
    Browse the repository at this point in the history
  4. Reduced code duplication by inheriting OttrkParser, adapted additions…

    … made to TrackDataset in subclass
    KueblerJelle committed Jan 23, 2024
    Configuration menu
    Copy the full SHA
    ed30083 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2efca9d View commit details
    Browse the repository at this point in the history

Commits on Feb 3, 2024

  1. Configuration menu
    Copy the full SHA
    8939dd2 View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2024

  1. Configuration menu
    Copy the full SHA
    9ca3fde View commit details
    Browse the repository at this point in the history

Commits on Feb 16, 2024

  1. Configuration menu
    Copy the full SHA
    e65ec68 View commit details
    Browse the repository at this point in the history

Commits on Mar 1, 2024

  1. refactored stream detection / track parser -> requires reduction of c…

    …ode duplication with python track parser
    
    make some functionality reusable through classmethod
    use bzip 2 json stream reader to sort files by start date
    KueblerJelle committed Mar 1, 2024
    Configuration menu
    Copy the full SHA
    5c5bfc9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b1e627e View commit details
    Browse the repository at this point in the history
  3. cleaned up otvision_parser

    renamed BulkPythonDetectionParser back to PythonDetectionParser
    extracted methods for creating PythonDetections and PythonTracks as static methods on module level
    KueblerJelle committed Mar 1, 2024
    Configuration menu
    Copy the full SHA
    f01ed28 View commit details
    Browse the repository at this point in the history
  4. added metadata format fixer for recorded start date format

    i have some old ottrk files for testing that could no longer be parsed
    a new metadata format fixer fixes the old date string format by transforming it to timestamp format
    TODO: review versions of fixer
    KueblerJelle committed Mar 1, 2024
    Configuration menu
    Copy the full SHA
    328b291 View commit details
    Browse the repository at this point in the history
  5. move stateless reusable methods from PythonTrackDataset to module level

    clean up streaming parser and add more doc strings
    KueblerJelle committed Mar 1, 2024
    Configuration menu
    Copy the full SHA
    26950b5 View commit details
    Browse the repository at this point in the history

Commits on Mar 12, 2024

  1. Configuration menu
    Copy the full SHA
    ec9c022 View commit details
    Browse the repository at this point in the history
  2. fixed bugs in ottrk_parser and streaming_parser

    found potential memory leak: delete det_list in StreamOttrkParser.parser()
    OttrkVideoParser crashed in a manual test because the metadata format did not match -> apply format fixer
    KueblerJelle committed Mar 12, 2024
    Configuration menu
    Copy the full SHA
    2b7d87a View commit details
    Browse the repository at this point in the history

Commits on Mar 19, 2024

  1. Configuration menu
    Copy the full SHA
    cec9362 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4ab4171 View commit details
    Browse the repository at this point in the history

Commits on Mar 22, 2024

  1. Configuration menu
    Copy the full SHA
    7f23cd8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c5fd43a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    09a3fd9 View commit details
    Browse the repository at this point in the history
  4. Add ijson as dependency

    briemla committed Mar 22, 2024
    Configuration menu
    Copy the full SHA
    d283752 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a6c1216 View commit details
    Browse the repository at this point in the history

Commits on Mar 28, 2024

  1. Added cli-mode to cli args to start either bulk or stream processing

    also reduced code duplication in cli.py and main_application.py
    
    also fixed/adapted tests
    KueblerJelle committed Mar 28, 2024
    Configuration menu
    Copy the full SHA
    bd19151 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3c545de View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    21b6436 View commit details
    Browse the repository at this point in the history
  4. fixed MetadataFixer for recorded start date -> catch TypeError

    fixed test_cli patches
    KueblerJelle committed Mar 28, 2024
    Configuration menu
    Copy the full SHA
    c940583 View commit details
    Browse the repository at this point in the history

Commits on Apr 6, 2024

  1. Configuration menu
    Copy the full SHA
    ed26a52 View commit details
    Browse the repository at this point in the history

Commits on May 27, 2024

  1. Configuration menu
    Copy the full SHA
    b1310d2 View commit details
    Browse the repository at this point in the history
  2. fixed minor merge bugs, implemented missing max_confidence_of method …

    …in SingletonTrackDataset
    ar0305 committed May 27, 2024
    Configuration menu
    Copy the full SHA
    acdfbf6 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2024

  1. Configuration menu
    Copy the full SHA
    3fa15dc View commit details
    Browse the repository at this point in the history

Commits on Jul 16, 2024

  1. refactored StreamingOttrkParser to allow for track chunks

    now a chunk size can be specified, tracks will be loaded from the track files until n tracks are collected
    the collected tracks are transformed into a TrackDataset, the caller of StreamingOttrkParser can specify a TrackDataset factory
    ar0305 committed Jul 16, 2024
    Configuration menu
    Copy the full SHA
    b1df536 View commit details
    Browse the repository at this point in the history

Commits on Jul 31, 2024

  1. Introduce ExportMode in result exporters, refactored some exporters t…

    …o be stateful to allow for incremental result exports
    
    Added ExportMode to various export specification data classes:
    ExportMode Overwrite for non incremental result export
    ExportModes INITIAL_MERGE, MERGE to append data
    ExportMode FLUSH to write out results aggregated in stateful exporters
    All exporters now consider this export mode
    
    Introduced missing EventExportSpecification to be more similar to other exporters
    
    If first event export has no events the resulting csv/excel have no header, as the data frame is empty, therefore specified all columns as header list
    
    Fixed bug in EventRepository:
    clear previously only checked for the existence of (section)events, if none existed nothing was cleared, even if non_section_events were present
    
    stateful exporters for counts require to be cached in a newly added CachedExporterFactory
    
    Added merge_into_dict functions to TracksMetadata and VideosMetadata to allow for incrementally updating a dict with metadata updates
    ar0305 committed Jul 31, 2024
    Configuration menu
    Copy the full SHA
    4f1ee9c View commit details
    Browse the repository at this point in the history

Commits on Sep 3, 2024

  1. updated test cases in test_cli to test both BULK and STREAMING cli mode

    fixed bugs:
    no longer use Counter in CsvExport of COunts, instead manually increment counted tags using defaultdict
    
    Streaming parser now has a set of VideosMetadata and TracksMetadata. instead of a list, to avoid duplications
    
    write event list data frame even if empty
    
    OTAnalyticsStreamCli and OTAnalyticsBulkCli both use same attribute name _track_parser
    ar0305 committed Sep 3, 2024
    Configuration menu
    Copy the full SHA
    ea4776a View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2024

  1. started parameterizing regression test with Stream and Bulk CLI Mode

    Added error messages to assertions when comparing file content
    ar0305 committed Sep 12, 2024
    Configuration menu
    Copy the full SHA
    52b649e View commit details
    Browse the repository at this point in the history
  2. fixed assertion code after debugging cleanup

    ar0305 committed Sep 12, 2024
    Configuration menu
    Copy the full SHA
    abdc0f7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    bf07c2a View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. Configuration menu
    Copy the full SHA
    b11fafa View commit details
    Browse the repository at this point in the history
  2. reset cli mode in benchmark to BULK

    fixed type annotations for write mode
    ar0305 committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    fb2c45f View commit details
    Browse the repository at this point in the history
  3. fixed todo review comments -> moved to todo comments in pull request …

    …comments
    
    removed unused class SingletonTrackDataset
    ar0305 committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    9cc8ce6 View commit details
    Browse the repository at this point in the history
  4. removed todo

    ar0305 committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    84abad0 View commit details
    Browse the repository at this point in the history