Enhance DataCollector to validate model_reporters functions #2605

peter-kinger · 2025-01-10T07:47:25Z

Summary

This PR enhances the DataCollector class by adding a comprehensive validation system for model reporters. The main goal is to provide clearer feedback when reporters are misconfigured, while maintaining backward compatibility.

Motive

The original DataCollector implementation had several limitations:

No validation for different types of model reporters
Silent failures or unclear error messages when reporters were misconfigured
Inconsistent handling of method calls and function parameters
No early warning system for potential issues

These issues made debugging difficult and led to confusion when setting up model reporters.

Implementation

New Validation Method:

def _validate_model_reporter(self, name, reporter, model):
    """Validates four types of model reporters:
    1. Lambda functions
    2. Method references (as strings)
    3. Attribute strings
    4. Function lists with parameters
    """

Modified Collection Logic:

def collect(self, model):
    if self.model_reporters:
        for var, reporter in self.model_reporters.items():
            # Add validation before collection
            self._validate_model_reporter(var, reporter, model)
            # Existing collection logic continues...

Warning System:

Replaced hard errors with warning messages
Added specific warnings for each reporter type
Included examples in warning messages

Usage Examples

Lambda Functions:

# Valid usage
model_reporters = {
    "Agent Count": lambda m: len(m.agents)
}

# Invalid usage (will show warning)
model_reporters = {
    "Bad Lambda": lambda m: m.nonexistent_attr
}

Method References:

# Valid usage
model_reporters = {
    "Grid Size": "get_grid_size"  # As string
}

# Invalid usage (will show warning)
model_reporters = {
    "Grid Size": self.get_grid_size  # Direct reference
}

Attribute Strings:

# Valid usage
model_reporters = {
    "Total Wealth": "total_wealth"
}

# Invalid usage (will show warning)
model_reporters = {
    "Status": "nonexistent_attribute"
}

Function Lists:

# Valid usage
model_reporters = {
    "Custom": [calculate_metric, [model, param]]
}

# Invalid usage (will show warning)
model_reporters = {
    "Bad Function": ["not_callable", [1, 2]]
}

Additional Notes

Test Coverage:

Added comprehensive test suite in test_model_reporters.py
Tests cover both valid and invalid configurations
Includes edge cases and error conditions

Backward Compatibility:

All existing code continues to work
Warnings can be suppressed if desired
No breaking changes to the API

Documentation:

Updated docstrings with clear examples
Added warning messages with helpful suggestions
Included migration guide for existing code

Files Modified:

datacollection.py: the only one py code changed in mesa offical repository.

Dependencies:

No new dependencies added
Uses standard Python warnings module

datacollection_new.zip

EwoutH · 2025-01-10T10:05:22Z

Thanks for the PR, sounds interesting.

What's the performance overhead?

quaquel · 2025-01-12T18:47:45Z

mesa/datacollection.py

+                    f"Warning: Lambda reporter '{name}' failed: {e!s}\n"
+                    f"Example of valid lambda: lambda m: len(m.agents)",
+                    UserWarning,
+                    stacklevel=2,
+                )


why issue a warning instead of an exception?

quaquel · 2025-01-12T18:48:53Z

mesa/datacollection.py

+                # Add validation
+                self._validate_model_reporter(var, reporter, model)
+


Does this imply that the validation is done every single time you try to collect the data? That seems inefficient and overkill.

Enhance DataCollector to validate model_reporters functions

426c62a

peter-kinger mentioned this pull request Jan 10, 2025

Enhance DataCollector to Validate model_reporters Functions #2606

Open

quaquel reviewed Jan 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance DataCollector to validate model_reporters functions #2605

Enhance DataCollector to validate model_reporters functions #2605

peter-kinger commented Jan 10, 2025

EwoutH commented Jan 10, 2025

quaquel Jan 12, 2025

quaquel Jan 12, 2025

		# Add validation
		self._validate_model_reporter(var, reporter, model)

Enhance DataCollector to validate model_reporters functions #2605

Are you sure you want to change the base?

Enhance DataCollector to validate model_reporters functions #2605

Conversation

peter-kinger commented Jan 10, 2025

Summary

Motive

Implementation

Usage Examples

Additional Notes

EwoutH commented Jan 10, 2025

quaquel Jan 12, 2025

Choose a reason for hiding this comment

quaquel Jan 12, 2025

Choose a reason for hiding this comment