Quantifying non-monotonicity of functions and the lack of positivity in signed measures

In various research areas related to decision making, problems and their solutions frequently rely on certain functions being monotonic. In the case of non-monotonic functions, one would then wish to quantify their lack of monotonicity. In this paper we develop a method designed specifically for this task, including quantification of the lack of positivity, negativity, or sign-constancy in signed measures. We note relevant applications in Insurance, Finance, and Economics, and discuss some of them in detail.


Introduction
In various research areas such as Economics, Finance, Insurance, and Statistics, problems and their solutions frequently rely on certain functions being monotonic, as well as on determining the degree of their monotonicity, or lack of it. For example, the notion of profit seekers in Behavioural Economics and Finance is based on increas-ing utility functions, which can have varying shapes and thus characterize subclasses of profit seekers (e.g., [9]). In Reliability Engineering and Risk Assessment (e.g., [13]), a number of notions such as hazard-rate and likelihood-ratio orderings rely on monotonicity of the ratios of certain functions. The presence of insurance deductibles and policy limits often change the pattern of monotonicity of insurance losses (e.g., [3]). In the literature on Statistical Inference, the so-called monotone-likelihood-ratio family plays an important role.
Due to these and a myriad of other reasons, researchers quite often restrict themselves to function classes with pre-specified monotonicity properties. But one may not be comfortable with this element of subjectivity and would therefore prefer to rely on data-implied shapes when making decisions. To illustrate the point, we recall, for example, the work of Bebbington et al. [1] who specifically set out to determine whether mortality continues to increase, or starts to decelerate, after a certain species-related late-life age. This is known in the gerontological literature as the late-life mortality deceleration phenomenon. Naturally, we refrain from elaborating on this complex topic and refer for details and further references to the aforementioned paper.
Monotonicity may indeed be necessary for certain results or properties to hold, but there are also many instances when monotonicity is just a sufficient condition. In such cases, a natural question arises: can we depart from monotonicity and still have valid results? Furthermore, in some cases, monotonicity may not even be expected to hold, though perhaps be desirable, and so developing techniques for quantifying the lack of monotonicity becomes of interest. Several results in this direction have recently been proposed in the literature (e.g., [5,21,15,16]; and references therein). All in all, these are some of the problems, solutions to which ask for indices that could be used to assess monotonicity, or lack of it. In the following sections we introduce and discuss several such indices, each designed to reveal different monotonicity aspects.
We have organized the rest of the paper as follows. In Section 2 we present a specific example, driven by insurance and econometric considerations, which not only motivates present research but also highlights the main idea adopted in this paper. In Section 3 we introduce and discuss indices of lack of increase, decrease, and monotonicity. We also suggest a convenient numerical procedure for a quick calculation of the indices at any desirable precision. In Section 4 we introduce orderings of functions according to the values of their indices and argue in favour of using normalized versions of the indices. In Section 5 we introduce a stricter notion of ordering, and in Section 6 we develop a theory for quantifying the lack of positivity (or negativity) in signed measures. Section 7 concludes the paper.

Motivating example
Insurance losses are non-negative random variables X ≥ 0, and the expected value E[X] is called the net premium. All practically-sound premium calculation principles (pcp's), which are functionals π assigning non-negative finite or infinite values to X's, are such that π[X] ≥ E[X]. The latter property is called non-negative loading of π.
As an example, consider the following 'dual' version of the weighted pcp ( [7,8]; and references therein): where U is the random variable uniform on [0, 1], F −1 is the quantile function of X defined by the equation F −1 (p) = inf{x : F (x) ≥ p}, and w : [0, 1] → [0, ∞] is an appropriately chosen weight function, whose illustrative examples will be provided in a moment. We of course assume that the two expectations in the definition of π w are well-defined and finite, and the denominator is not zero. When dealing with insurance losses, researchers usually choose non-decreasing weight functions in order to have non-negative loading of π w . For example, w(t) = 1{t > p} for any parameter p ∈ (0, 1) is non-decreasing, and it turns π w into the average value at risk, also known as tail conditional expectation. Another example is w(t) = ν(1 − t) ν−1 with parameter ν > 0. If ν ∈ (0, 1], then w is non-decreasing, and π w becomes the proportional hazards pcp [18,19]. If ν ≥ 1, then w is nonincreasing, and π w becomes the (absolute) S-Gini index of economic equality (e.g., [23]; and references therein).
Note that the pcp π w can be rewritten as the weighted integral with the normalized weight function w * (t) = w(t)/ 1 0 w(u)du, which is a probability density function, because w(t) ≥ 0 for all t ∈ [0, 1] and 1 0 w(u)du ∈ (0, ∞). This representation of π w connects our present research with the dual utility theory ( [20,17]; and references therein) that has arisen as a prominent counterpart to the classical utility theory of von Neuman and Morgenstern [14]. It is also important to mention that in Insurance, integral (2) plays a very prominent role and is known as the distortion risk measure (e.g., [6]; and references therein).
In general, the function w and its normalized version w * may not be monotonic, as it depends on the shape of the probability density function of the random variable W ∈ [0, 1] in the following reformulation of π w : In the econometric language, E[F −1 (W )] means the average income (assuming that X stands for 'income') possessed by individuals whose positions on the society's income-percentile scale are modelled by W , which is, naturally, a random variable. Hence, we are interested when E[F −1 (W )] is at least E[W ] (insurance perspective) or at most E[W ] (income inequality perspective). In view of equations (1) and (2), our task reduces to verifying whether or not If the function w happens to be non-decreasing (as in insurance), then bound (4) holds (e.g., [11]). For example, with parameter λ > 0, the weight function w(s) = s λ leads to the size-biased pcp, w(s) = e λs to the Esscher pcp, w(s) = 1 − e −λs to the Kamps pcp, and the already noted function w(s) = 1{s > λ} leads to the average value at risk. As already noted, broader contexts than that of classical insurance suggest various shapes of probability distortions and thus lead to functions w that are not necessarily monotonic. Indeed, in view of representation (3), the average income of individuals depends on the distribution of W , whose shape is governed by societal opportunities that the individuals are exposed to. A natural question arises: What shapes of w could ensure property (4)?
To answer this question in an illuminating way, let w be absolutely continuous and such that w(0) = 0, which are sound assumptions from the practical point of view. Denote the density of w by w ′ , and let 1 {u>t} be the indicator. We have [11]), and so is also the integral θ = 1 0 v(t)dt, which makes v(t)/θ a probability density function. Denote the corresponding distribution function by V . We have the equations where λ denotes the Lebesgue measure. Hence, property (4) is equivalent to the bound 1 0 gdλ ≥ 0 with g = w ′ • V −1 , and it can of course be rewritten as the inequality where g + = max{g, 0} and g − = max{−g, 0}. Bound (5) says that, in average with respect to the Lebesgue measure, the positive part g + must be larger than the negative part g − . Those familiar with asset pricing will immediately see how inequality (5), especially when reformulated as is connected to the gain-loss ratio [2], as well as to the Omega ratio [10]. Furthermore, we can reformulate bound (6) as with the ratio on the left-hand side being equal to 0 when the function g is nonpositive, and equal to 1 when g is non-negative. Hence, bound (7) says that the ratio must be at least 1/2, which means that, in average, the function g must be more positive than negative. The above interpretations have shaped our considerations in the present paper, and have led toward the construction of monotonicity indices that we introduce and discuss next.

Assessing lack of monotonicity in functions
We are interested in assessing monotonicity of a function g 0 on an interval [a, b]. Since shifting the function up or down, left or right, does not distort its monotonicity, we therefore 'standardize' g 0 into defined on the interval [0, y] with y = b − a. Note that g satisfies the boundary condition g(0) = 0. We assume that g 0 and thus g are absolutely continuous. Let F y denote the set of all absolutely continuous functions f on the interval the set of all f ∈ F y that are non-decreasing. For any g ∈ F y , we define its index of lack of increase (LOI) as the distance between g and the set F + y , that is, Obviously, if g is non-decreasing, then LOI y (g) = 0, and the larger the value of LOI y (g), the farther the function g is from being non-decreasing on the interval [0, y].
For the function g 0 , which is where g originates from, the LOI of g 0 on the interval (Throughout the paper we occasionally use ':=,' when a need arises to emphasize that certain equations are by definition.) Determining LOI y (g) using definition (9) is not straightforward. To facilitate the task, we next give an integral representation of the index.
When g originates from g 0 via equation (8), we have Theorem 1, though easy to prove directly, follows immediately from a more general result, Theorem 2 of Section 6, and we thus do not provide any more details. The index of lack of decrease (LOD) is defined analogously. Namely, in the computationally convenient form, it is given by the equation When g originates from g 0 via equation (8), we have In turn, the index of lack of monotonicity (LOM) of g is given by the equation and when g originates from g 0 via equation (8), we have The reason for doubling the minimum on the right-hand sides of definitions (12) and (13) will become clear from properties below.
A1) The index LOI y is translation invariant, that is, LOI y (g + α) = LOI y (g) for every constant α ∈ R. The index LOD y is also translation invariant.
A2) The index LOI y is positively homogeneous, that is, LOI y (βg) = βLOI y (g) for every constant β ≥ 0. The index LOD y is also positively homogeneous.

A4
) LOI y (g) + LOD y (g) = g y . Consequently, we have the following two observations: (a) LOI y (g) and LOD y (g) do not exceed g y , with both indices achieving the upper bound. Namely, LOI y (g) = g y whenever (g ′ ) + ≡ 0, and LOD y (g) = g y whenever (g ′ ) − ≡ 0.
An illustrative example follows.
Hence, for example, sine is at the distance 1 from the set of all non-decreasing functions on the noted interval, whereas cosine is at the distance 2 from the same set. Note also that the total variations 3π/2 0 |g ′ |dλ and 3π/2 0 |h ′ |dλ of the two functions are the same, equal to 3. This concludes Example 1. (14) and (15) are easy to calculate, but functions arising in applications are frequently quite unwieldy. For this, we need a numerical procedure for calculating the integral A convenient way is as follows: The use of the integral y 0 H(f ′ )dλ in Note 1 hints at the possibility of distances other than L 1 -norms when defining indices. One may indeed wish to de-emphasize small values of f ′ and to emphasize its large values when defining indices. In a simple way, this can be achieved by taking the p-th power of (f ′ ) − , (f ′ ) + and f ′ for any p ≥ 1. This argument leads us to the L p -type counterpart of minimization problem (9) which can be solved along the same path as that used in the proof of Theorem 1. It is remarkable that the minimizing function does not depend on the choice of the metric and remains equal to (f ′ ) + . Consequently, for example, the following L pindex of lack of increase arises:

Note 1. Example 1 is based on functions for which the four integrals in equations
Other than L p -norms can also be successfully explored, and this is important in applications, where no p-th power may adequately (de-)emphasize parts of f ′ . The phenomenon has prominently manifested, for example, in Econometrics (e.g., [4,22]; and references therein). In such cases, more complexly shaped functions are typically used, including those H : [0, ∞) → [0, ∞) with H(0) = 0 that give rise to the class of Birnbaum-Orlicz (BO) spaces.

Monotonicity comparisons and normalized indices
The index values in Example 1 suggest that on the noted interval, sine is more increasing than cosine, because sine is closer to the set F + y than cosine is. In general, given two functions g 1 and h 1 on the interval [0, y], we can say that the function g 1 is more non-decreasing than h 1 on the interval whenever LOI y (g 1 ) ≤ LOI y (h 1 ). Likewise, we can say that the function g 2 is more non-increasing than h 2 on the interval [0, y] whenever LOD y (g 2 ) ≤ LOD y (h 2 ). But an issue arises with these definitions because for a given pair of functions g and h, the property LOI y (g) ≤ LOI y (h) may not be equivalent to LOD y (g) ≥ LOD y (h). Though it may look strange at first sight, this non-reflexivity is natural because the total variations of the functions g and h on the interval [0, y] may not be equal, and in such cases, comparing non-monotonicities of g and h is not meaningful.
If, however, the total variations of g and h are equal on the interval [0, y], then LOI y (g) ≤ LOI y (h) if and only if LOD y (g) ≥ LOD y (h). This suggests that in order to achieve this 'if and only if' property in general, we need to normalize the indices, which gives rise to the following definitions of the normalized indices of lack of increase and decrease, respectively. Obviously, and thus LOI * y (g) ≤ LOI * y (h) if and only if LOD * y (g) ≥ LOD * y (h). Furthermore, the normalized index of lack of monotonicity is LOM * y (g) := 2 min{LOI * y (g), LOD * y (g)}, which we rewrite as using the identity min{u, v} = (u + v − |u − v|)/2 that holds for all real numbers u and v. In terms of the original function g 0 on the interval [a, b], we have Obviously, LOI * [a,b] (g 0 ) = LOI * y (g), LOD * [a,b] (g 0 ) = LOD * y (g), and LOM * [a,b] (g 0 ) = LOM * y (g). To illustrate the normalized indices, we continue Example 1.
Next are properties of the normalized indices.
We next use the above indices to introduce three new orderings: C1) The function g is more non-decreasing than h on the interval [0, y], denoted by g ≥ I,y h, if and only if LOI * y (g) ≤ LOI * y (h).
C2) The function g is more non-increasing than h on the interval [0, y], denoted by g ≥ D,y h, if and only if LOD * y (g) ≤ LOD * y (h).
C3) The function g is more monotonic than h on the interval [0, y], denoted by g ≥ M,y h, if and only if LOM * y (g) ≤ LOM * y (h).
We see that on the interval [0, y], the function g is more non-decreasing than h if and only if the function g is less non-increasing than h. In other words, g ≥ I,y h is equivalent to g ≤ D,y h, which we have achieved by introducing the normalized indices.

Stricter notion of comparison: a note
One of the fundamental notions of ordering random variables is that of first-order stochastic dominance (e.g., [6,12,13]). Similarly to this notation, our earlier introduced orderings can be strengthened by first noting that the integral counts the 'time' that the function (g ′ ) − spends above the threshold z during the 'time' period [0, y]. Likewise, we define the 'plus' version S + y (z | g). This lead us to the following definitions of ordering functions according to their monotonicity. D1) g is more strictly non-decreasing than h on the interval [0, y], denoted by g ≥ SI,y h, whenever D2) g is more strictly non-increasing than h on the interval [0, y], denoted by g ≥ SD,y h, whenever Obviously, if g ≥ SI ,y h, then g ≥ I,y h, and if g ≥ SD,y h, then g ≥ D,y h.

Assessing lack of positivity in signed measures
We now take a path in the direction of general Measure Theory. Namely, let Ω be a set equipped with a sigma-algebra, and let M denote the set of all (signed) measures ν defined on the sigma-algebra. Furthermore, let M + ⊂ M be the subset of all positive measures. Given a signed-measure ν ∈ M, we define its index of lack of positivity (LOP) by the equation where · denotes the total variation. Specifically, with (Ω − , Ω + ) denoting a Hahn decomposition of Ω, let (ν − , ν + ) be the Jordan decomposition of ν. Note that ν − and ν + are elements of M + . The variation of ν is |ν| = ν − + ν + , and its total variation is ν = |ν|(Ω). The following theorem provides an actionable, and crucial for our considerations, reformulation of the index LOP(ν).
Similarly to the LOP index, the index of lack of negativity (LON) of ν ∈ M is given by the equation and the corresponding index of lack of sign (LOS) is The reason for doubling the minimum on the right-hand side of equation (23) is the same as in the more specialized cases discussed earlier. Namely, due to the equation ν − + ν + = ν , we have that min{ ν − , ν + } does not exceed ν /2. Hence, for LOS to be always between 0 and ν , just like LOP and LON are, we need to double the minimum. Finally, we introduce the normalized indices and LOS * (ν) = 2 min LOP * (ν), LON * (ν) , whose values are always in the unit interval [0, 1].

Conclusion
In this paper, we have introduced indices that, in a natural way, quantify the lack of increase, decrease, and monotonicity of functions, as well as the lack of positivity, negativity, and sign-constancy in signed measures. In addition to being of theoretical interest, this research topic also has practical implications, and for the latter reason, we have also introduced a simple and convenient numerical procedure for calculating the indices without resorting to frequently unwieldy closed-form expressions. The indices satisfy a number of natural properties, and they also facilitate the ranking of functions according to their lack of monotonicity. Relevant applications in Insurance, Finance, and Economics have been pointed out, and some of them discussed in greater detail.