Tag Archives: mathematical definition

Insights into mathematical definitions

My general practice with abstractmath.org has been to write about the problems students have at the point where they first start studying abstract math, with some emphasis on the languages of math. I have used my own observations of students, lexicographical work I did in the early 2000’s, and papers written by workers in math ed at the college level.

A few months ago, I finished revising and updating abstractmath.org. This took rather more than a year because among other things I had to reconstitute the files so that the html could be edited directly. During that time I just about quit reading the math ed literature. In the last few weeks I have found several articles that have changed my thinking about some things I wrote in abmath, so now I need to go back and revise some more!

In this post I will make some points about definitions that I learned from the paper by Edwards and Ward and the paper by Selden and Selden

I hope math ed people will read the final remarks.

Peculiarities of math definitions

When I use a word, it means just what I choose it to mean–neither more nor less.” — Humpty Dumpty

A mathematical definition is fundamentally different from other sorts of definitions in two different ways. These differences are not widely appreciated by students or even by mathematicians. The differences cause students a lot of trouble.

List of properties

One of the ways in which a math definition is different from other kinds is that the definition of a math object is given by accumulation of attributes, that is, by listing properties that the object is required to have. Any object defined by the definition must have all those properties, and conversely any object with all the properties must be an example of the type of object being defined. Furthermore, there is no other criterion than the list of attributes.

Definitions in many fields, including some sciences, don’t follow this rule. Those definitions may list some properties the objects defined may have, but exceptions may be allowed. They also sometimes give prototypical examples. Dictionary definitions are generally based on observation of usage in writing and speech.

Imposed by decree

One thing that Edwards and Ward pointed out is that, unlike definitions in most other areas of knowledge, a math definition is stipulated. That means that meaning of (the name of) a math object is imposed on the reader by decree, rather than being determined by studying the way the word is used, as a lexicographer would do. Mathematicians have the liberty of defining (or redefining) a math object in any way they want, provided it is expressed as a compulsory list of attributes. (When I read the paper by Edwards and Ward, I realized that the abstractmath.org article on math definitions did not spell that out, although it was implicit. I have recently revised it to say something about this, but it needs further work.)

An example is the fact that in the nineteenth century some mathe­maticians allowed $1$ to be a prime. Eventually they restricted the definition to exclude $1$ because including it made the statement of the Fundamental Theorem of Arithmetic complicated to state.

Another example is that it has become common to stipulate codomains as well as domains for functions.

Student difficulties

Giving the math definition low priority

Some beginning abstract math students don’t give the math definition the absolute dictatorial power that it has. They may depend on their understanding of some examples they have studied and actively avoid referring to the definition. Examples of this are given by Edwards and Ward.

Arbitrary bothers them

Students are bothered by definitions that seem arbitrary. This includes the fact that the definition of “prime” excludes $1$. There is of course no rule that says definitions must not seem arbitrary, but the students still need an explanation (when we can give it) about why definitions are specified in the way they are.

What do you DO with a definition?

Some students don’t realize that a definition gives a magic formula — all you have to do is say it out loud.
More generally, the definition of a kind of math object, and also each theorem about it, gives you one or more methods to deal with the type of object.

For example, $n$ is a prime by definition if $n\gt 1$ and the only positive integers that divide $n$ are $1$ and $n$. Now if you know that $p$ is a prime bigger than $10$ then you can say that $p$ is not divisible by $3$ because the definition of prime says so. (In Hogwarts you have to say it in Latin, but that is no longer true in math!) Likewise, if $n\gt10$ and $3$ divides $n$ then you can say that $n$ is not a prime by definition of prime.

The paper by Bills and Tall calls this sort of thing an operable definition.

The paper by Selden and Selden gives a more substantial example using the definition of inverse image. If $f:S\to T$ and $T’\subseteq T$, then by definition, the inverse image $f^{-1}T’$ is the set $\{s\in S\,|\,f(s)\in T’\}$. You now have a magic spell — just say it and it makes something true:

  • If you know $x\in f^{-1}T’$ then can state that $f(x)\in T’$, and all you need to justify that statement is to say “by definition of inverse image”.
  • If you know $f(x)\in T’$ then you can state that $x\in f^{-1}T’$, using the same magic spell.

Theorems can be operable, too. Wiles’ Theorem wipes out the possibility that there is an integer $n$ for which $n^{42}=365^{42}+666^{42}$. You just quote Wiles’ Theorem — you don’t have to calculate anything. It’s a spell that reveals impossibilities.

What the operability of definitions and theorems means is:

A definition or theorem is not just a static statement,it is a weapon for deducing truth.

Some students do not realize this. The students need to be told what is going on. They do not have to be discarded to become history majors just because they may not have the capability of becoming another Andrew Wiles.

Final remarks

I have a wish that more math ed people would write blog posts or informal articles (like the one by Edwards and Ward) about what that have learned about students learning math at the college level. Math ed people do write scholarly articles, but most of the articles are behind paywalls. We need accessible articles and blog posts aimed at students and others aimed at math teachers.

And feel free to steal other math ed people’s ideas (and credit them in a footnote). That’s what I have been doing in abstractmath.org and in this blog for the last ten years.

References


  • Bills, L., & Tall, D. (1998). Operable definitions in advanced mathematics: The case of the least upper bound. In A. Olivier & K. Newstead (Eds.), Proceedings of the 22nd Conference of the International Group for the Psychology of Mathematics Education, Vol. 2 (pp. 104-111). Stellenbosch, South Africa: University of Stellenbosch.
  • B. S. Edwards, and M. B. Ward, Surprises from mathematics education research: Student (mis) use of mathematical definitions (2004). American Mathematical Monthly, 111, 411-424.
  • G. Lakoff, Women, Fire and Dangerous
    Things
    . University of Chicago Press, 1990. See his discussion of concepts and prototypes.
  • J. Selden and A. Selden, Proof Construction Perspectives: Structure, Sequences of Actions, and Local Memory, Extended Abstract for KHDM Conference, Hanover, Germany, December 1-4, 2015. This paper may be downloaded from Academia.edu.
  • A Handbook of mathematical discourse, by Charles Wells. See concept, definition, and prototype.
  • Definitions, article in abstractmath.org. (Some of the ideas in this post have now been included in this article, but it is due for another revision.)
  • Definitions in logic and mathematics in Wikipedia.
  • Creative Commons License

    This work is licensed under a Creative Commons Attribution-ShareAlike 2.5 License.

    Send to Kindle

    The Mathematical Definition of Function

    Introduction

    This post is a completely rewritten version of the abstractmath article on the definition of function. Like every part of abstractmath, the chapter on functions is designed to get you started thinking about functions. It is no way complete. Wikipedia has much more complete coverage of mathematical functions, but be aware that the coverage is scattered over many articles.

    The concept of function in mathematics is as important as any mathematical idea. The mathematician’s concept of function includes the kinds of functions you studied in calculus but is much more abstract and general. If you are new to abstract math you need to know:

    • The precise meaning of the word “function” and other concepts associated with functions. That’s what this section is about.
    • Notation and terminology for functions. (That will be a separate section of abstractmath.org which I will post soon.)
    • The many different kinds of functions there are. (See Examples of Functions in abmath).
    • The many ways mathematicians think about functions. The abmath article Images and Metaphors for Functions is a stub for this.

    I will use two running examples throughout this discussion:

    • {F} is the function defined on the set {\left\{1,\,2,3,6 \right\}} as follows: {F(1)=3,\,\,\,F(2)=3,\,\,\,F(3)=2,\,\,\,F(6)=1}. This is a function defined on a finite set by explicitly naming each value.
    • {G} is the real-valued function defined by the formula {G(x)={{x}^{2}}+2x+5}.

    Specification of function

    We start by giving a specification of “function”. (See the abstractmath article on specification.) After that, we get into the technicalities of the definitions of the general concept of function.

    Specification: A function {f} is a mathematical object which determines and is completely determined bythe following data:

    • {f} has a domain, which is a set. The domain may be denoted by {\text{dom }f}.
    • {f} has a codomain, which is also a set and may be denoted by {\text{cod }f}.
    • For each element {a} of the domain of {f}, {f} has a value at {a}, denoted by {f(a)}.
    • The value of {f} at {a} is completely determined by {a} and {f} .
    • The value of {f} at {a} must be an element of the codomain of {f}.

    The operation of finding {f(a)} given {f} and {a} is called evaluation.

    Examples

    • The definition above of the finite function {F} specifies that the domain is the set {\left\{1,\,2,\,3,\,6 \right\}}. The value of {F} at each element of the domain is given explicitly. The value at 3, for example, is 2, because the definition says that {F(2) = 3}. The codomain of {F} is not specified, but must include the set {\{1,2,3\}}.
    • The definition of {G} above gives the value at each element of the domain by a formula. The value at 3, for example, is {G(3)=3^2+2\cdot3+5=20}. The definition does not specify the domain or the codomain. The convention in the case of functions defined on the real numbers by a formula is to take the domain to be all real numbers at which the formula is defined. In this case, that is every real number, so the domain is {{\mathbb R}}. The codomain must include all real numbers greater than or equal to 4. (Why?)

    Comment: The formula above that defines the function G in fact defines a function of complex numbers (even quaternions).

    Definition of function

    In the nineteenth century, mathematicians realized that it was necessary for some purposes (particularly harmonic analysis) to give a mathematical definition of the concept of function. A stricter version of this definition turned out to be necessary in algebraic topology and other fields, and that is the one I give here.

    To state this definition we need a preliminary idea.

    The functional property

    A set R of ordered pairs has the functional property if two pairs in R with the same first coordinate have to have the same second coordinate (which means they are the same pair).

    Examples

    • The set {\{(1,2), (2,4), (3,2), (5,8)\}} has the functional property, since no two different pairs have the same first coordinate. It is true that two of them have the same second coordinate, but that is irrelevant.
    • The set {\{(1,2), (2,4), (3,2), (2,8)\}} does not have the functional property. There are two different pairs with first coordinate 2.
    • The graphs of functions in beginning calculus have the functional property.
    • The empty set {\emptyset} has the functional property .

    Example: Graph of a function defined by a formula

    The graph of the function {G} given above has the functional property. The graph is the set

    \displaystyle \left\{ (x,{{x}^{2}}+2x+5)\,\mathsf{|}\,x\in {\mathbb R} \right\}.

    If you repeatedly plug in one real number over and over, you get out the same real number every time. Example:

    • if {x = 0}, then {{{x}^{2}}+2x+5=5}.  You get 5 every time you plug in 0.
    • if {x = 1}, then {{{x}^{2}}+2x+5=8}.
    • if {x =-2}, then {{{x}^{2}}+2x+5=5}.

    This set has the functional property because if {x} is any real number, the formula {{{x}^{2}}+2x+5} defines a specific real number. (This description of the graph implicitly assumes that {\text{dom } G={\mathbb R}}.)  No other pair whose first coordinate is {-2} is in the graph of {G}, only {(-2, 5)}. That is because when you plug {-2} into the formula {{{x}^{2}}+2x+5}, you get {5} every time. Of course, {(0, 5)} is in the graph, but that does not contradict the functional property. {(0, 5)} and {(-2, 5)} have the same second coordinate, but that is OK.

    How to think about the functional property

    The point of the functional property is that for any pair in the set of ordered pairs, the first coordinate determines what the second one is. That’s why you can write “{G(x)}” for any {x } in the domain of {G} and not be ambiguous.

    Mathematical definition of function

    A function{f} is a mathematical structure consisting of the following objects:

    • A set called the domain of {f}, denoted by {\text{dom } f}.
    • A set called the codomain of {f}, denoted by {\text{cod } f}.
    • A set of ordered pairs called the graph of { f}, with the following properties:
      • {\text{dom } f} is the set of all first coordinates of pairs in the graph of {f}.
      • Every second coordinate of a pair in the graph of {f} is in {\text{cod } f} (but {\text{cod } f} may contain other elements).
      • The graph of {f} has the functional property. Using arrow notation, this implies that {f:A\rightarrow B}.

    Examples

    • Let {F} have graph {\{(1,2), (2,4), (3,2), (5,8)\}} and define {A = \{1, 2, 3, 5\}} and {B = \{2, 4, 8\}}. Then {F:A\rightarrow B} is a function.
    • Let {G} have graph {\{(1,2), (2,4), (3,2), (5,8)\}} (same as above), and define {A = \{1, 2, 3, 5\}} and {C = \{2, 4, 8, 9, 11, \pi, 3/2\}}. Then {G:A\rightarrow C} is a (admittedly ridiculous) function. Note that all the second coordinates of the graph are in {C}, along with a bunch of miscellaneous suspicious characters that are not second coordinates of pairs in the graph.
    • Let {H} have graph {\{(1,2), (2,4), (3,2), (5,8)\}}. Then {H:A\rightarrow {\mathbb R}} is a function.

    According to the definition of function, {F}, {G} and {H} are three different functions.

    Identity and inclusion

    Suppose we have two sets A and B with {A\subseteq B}.

    • The identity function on A is the function {{{\text{id}}_{A}}:A\rightarrow A} defined by {{{\text{id}}_{A}}(x)=x} for all{x\in A}. (Many authors call it {{{1}_{A}}}).
    • The inclusion function from A to B is the function {i:A\rightarrow B} defined by {i(x)=x} for all {x\in A}. Note that there is a different function for each pair of sets A and B for which {A\subseteq B}. Some authors call it {{{i}_{A,\,B}}} or {\text{in}{{\text{c}}_{A,\,B}}}.

    Remark The identity function and an inclusion function for the same set A have exactly the same graph, namely {\left\{ (a,a)|a\in A \right\}}.

    Graphs and functions

    • If {f} is a function, the domain of {f} is the set of first coordinates of all the pairs in {f}.
    • If {x\in \text{dom } f}, then {f(x)} is the second coordinate of the only ordered pair in {f} whose first coordinate is {x}.

    Examples

    The set {\{(1,2), (2,4), (3,2), (5,8)\}} has the functional property, so it is the graph of a function. Call the function {H}. Then its domain is {\{1,2,3,5\}} and {H(1) = 2} and {H(2) = 4}. {H(4)} is not defined because there is no ordered pair in H beginning with {4} (hence {4} is not in {\text{dom } H}.)

    I showed above that the graph of the function {G}, ordinarily described as “the function {G(x)={{x}^{2}}+2x+5}”, has the functional property. The specification of function requires that we say what the domain is and what the value is at each point. These two facts are determined by the graph.

    Other definitions of function

    Because of the examples above, many authors define a function as a graph with the functional property. Now, the graph of a function {G} may be denoted by {\Gamma(G)}.  This is an older, less strict definition of function that doesn’t work correctly with the concepts of algebraic topology, category theory, and some other branches of mathematics.

    For this less strict definition of function, {G=\Gamma(G)}, which causes a clash of our mental images of “graph” and “function”. In every important way except the less-strict definition, they ARE different!

    A definition is a device for making the meaning of math technical terms precise. When a mathematician think of “function” they think of many aspects of functions, such as a map of one shape into another, a graph in the real plane, a computational process, a renaming, and so on. One of the ways of thinking of a function is to think about its graph. That happens to be the best way to define the concept of function.  (It is the less strict definition and it is a necessary concept in the modern definition given here.)

    The occurrence of the graph in either definition doesn’t make thinking of a function in terms of its graph the most important way of visualizing  it. I don’t think it is even in the top three.

    Send to Kindle