Interval variables - adding two new exceptions #8

rodricios · 2015-08-25T03:56:58Z

Added WrongVariableTypeError, MultipleMostCommonValuesError.

WrongVariableTypeError addresses the fact that one cannot find the 'expectation' (mean) of a distribution of categorical (nominal) random variables (for example, a distribution of words is equivalent to a categorical variable).

In other words, it makes no sense to find the average word.

From Foundations of Statistical Natural Language Processing - Manning and Schutze:

The expection is the mean or average of a random variable...

Where they define a random variable as being a...

... function X: R - OB” (commonly with n = I), where iw is the set of real numbers

The above quotes are taken from section 2.1.4 on Random Variables.

Unfortunately, the motivation behind MultipleMostCommonValuesError is not based off textbook definitions. Instead, it is based off the fact that we named our function best_pair in the singular.

Oh, and test objects were simplified a bit.

…al values; the stats fns are now executed against 'elemends' and no 'values'

…d only return when a single value is most common

…ting

eugene-eeo · 2015-08-25T04:33:47Z

WrongVariableTypeError should be a subclass of TypeError. In fact, from a philosophical point of view, TypeError is a child of ValueError. With that being said, it is now not necessary to have our own "wrapper" around TypeError, unless you want to provide more semantic information in the form of class names. However for most purposes, a simple TypeError is enough.

rodricios · 2015-08-25T05:34:58Z

Would the statistics info I provided count as a good motivating factor for subclassing TypeError?

I personally see it similar to how stats.py subclasses on line 17:

class StatisticsError(ValueError):
    pass

eugene-eeo · 2015-08-25T05:49:47Z

But the name of WrongVariableTypeError already implies that it should be an error regarding the type of the values, which is more specific the value itself. But then again having your own "wrapper exceptions" are more of an issue of how 'heavy' you want the library to be.

rodricios · 2015-08-25T06:03:46Z

Ah, I think I get what you're suggesting. So drop the extra subclass, but override the exception message?

eugene-eeo · 2015-08-25T06:05:31Z

Yup. See this SO question. Basically,

try:
    num()
except TypeError as err:
    err.message = 'custom_message'
    raise err

… the elements of the referencing class; used for checking if dist is 'discrete random variable'

rodricios added 5 commits July 21, 2015 23:28

added new error for cases when attempting to find mean of non-numeric…

e84e18f

…al values; the stats fns are now executed against 'elemends' and no 'values'

upped vs. number

7ad446d

adding errors for median fn's for when dealing with non-numeric types

30bbd90

adding MultipleMostCommonValuesError; max, argmax and best_pair shoul…

5dd81b2

…d only return when a single value is most common

formatted test functions to match conventions regarding exception tes…

e4b1d8c

…ting

added key_types_distribution, which creates a prob. dist. of types of…

7b9bc61

… the elements of the referencing class; used for checking if dist is 'discrete random variable'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interval variables - adding two new exceptions #8

Interval variables - adding two new exceptions #8

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015

Interval variables - adding two new exceptions #8

Are you sure you want to change the base?

Interval variables - adding two new exceptions #8

Conversation

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015

rodricios commented Aug 25, 2015

eugene-eeo commented Aug 25, 2015