Tag Archives: math object

A new kind of introduction to category theory

About this article

  • This post is an alpha version of the first part of the intended article.
  • People who are beginners in learning abstract math concepts have many misunderstandings about the definitions and early theorems of category theory.
  • This article introduces a few basic concepts of category theory. It goes into detail in Purple Prose about the misunderstandings that can arise with each of the concepts. The article is not at all a complete introduction to categories.
  • My blog post Introducing abstract topics describes some of the strategies needed in teaching a new abstract math concept.
  • This article also introduces a few examples of categories that are primarily chosen to cause the reader to come up against some of those misunderstandings. The first example is completely abstract.
  • Math students usually see categories after considerable exposure to abstract math, but students in computing science and other fields may see it without having much background in abstraction. I hope teachers in such courses will include explanations of the sort of misunderstandings mentioned in this article.
  • Like all posts in Gyre&Gimble and all posts in abstractmath.org, this article is licensed under a Creative Commons Attribution-ShareAlike 2.5 License. If you are teaching a class involving category theory, feel free to hand it out, and to modify it (in which case you should include a link to this post).
  • You could also use the article as a source of remarks you make in the class about the topics.

About categories

To be written.

Definition of category

A category is a type of Mathematical structure consisting of two types of data, whose relationships are entirely determined by some axioms. After the definition is complete, I will introduce several categories with a detailed discussion of each one, explaining how they fit the definition of category.

Axiom 1: Data

  1. A category consists of two types of data: objects and arrows.
  2. No object can be an arrow and no arrow can be an object.

Notes for Axiom 1

  • An object of a category can be any kind of mathematical object. It does not have to be a set and it does not have to have elements.
  • Arrows of a category are also called morphisms. You may be familiar with “homomorphisms”, “homeomorphisms” or “isomorphisms”, all of which are functions. This does not mean that a “morphism” in an arbitrary category is a function.

Axiom 2: Domain and codomain

  1. Each arrow has a domain and a codomain, each of which is an object of the category.
  2. The domain and the codomain of an arrow may or may not be the same object.
  3. Each arrow has only one domain and only one codomain.

Notes for Axiom 2

  • If $f$ is an arrow with domain $A$ and codomain $B$, that fact is typically shown either by the notation “$f:A\to B$” or by a diagram like this:
  • The notation “$f:A\to B$” is like that used for functions. This notation may be used in any category, but it does not imply that $f$ is a function or that $A$ and $B$ have elements.
  • For such an arrow, the notation “$\text{dom}(f)$” refers to $A$ and “$\text{cod}(f)$” refers to $B$.
  • For a given category $\mathsf{C}$, the collection of all the arrows with domain $A$ and codomain $B$ may be denoted by
    • “$\text{Hom}(A,B)$” or
    • “$\text{Hom}_\mathsf{C}(A,B)$” or
    • “$\mathsf{C}(A,B)$”.
  • Some newer books and articles in category theory use the name source for domain and target for codomain. This usage has the advantage that a newcomer to category theory will be less likely to think of an arrow as a function.

Axiom 3: Composition

  1. If $f$ and $g$ are arrows in a category for which $\text{cod}(f)=\text{dom}(g)$, as in this diagram:

    then there is a unique arrow with domain $A$ and codomain $C$ called the composite of $f$ and $g$.

Notes for Axiom 3

    diagra

  • An important metaphor for composition is: Every path of length 2 has exactly one composite.
  • The unique arrow required by Axiom 3 may be denoted by “$g\circ f$” or “$gf$”. “$g\circ f$” is more explicit, but “$gf$” is much more commonly used by category theorists.
  • Many constructions in categories may be shown by diagrams, like the one used just above.
  • The diagram

    is said to commute if $h=g\circ f$. The idea is that going along $f$ and then $g$ is the same as going along $h$.

  • It is customary in some texts in category theory to indicate that a diagram commutes by putting a gyre in the middle:
  • The concept of category is an abstraction of the idea of function, and the composition of arrows is an abstraction of the composition of functions. It uses the same notation, “$g\circ f$”. If $f$ and $g$ are set functions, then for an element $x$ in the domain of $f$, \[(g\circ f)(x)=g(f(x))\]
  • But in arbitrary category, it may make no sense to evaluate an arrow $f$ at some element $x$; indeed, the domain of $f$ may not have elements at all, and then the statement “$(g\circ f)(x)=g(f(x))$” is meaningless.

Axiom 4: Identity arrows

  1. For each object $A$ of a category, there is a unique arrow denoted by $\textsf{id}_A$.
  2. $\textsf{dom}(\textsf{id}_A)=A$ and $\textsf{cod}(\textsf{id}_A)=A$.
  3. For any object $B$ and any arrow $f:B\to A$, the diagram

    commutes.

  4. For any object $C$ and any arrow $g:A\to C$, the diagram

    commutes.

Notes for Axiom 4

  • The fact stated in Axiom 4(b) could be shown diagrammatically either as

    or as

  • Facts (c) and (d) can be written in algebraic notation: For any arrow $f$ going to $A$,\[\textsf{id}_A\circ f=f\]and for any arrow $g$ coming from $A$,\[g\circ \textsf{id}_A=g\]

Axiom 5: Associativity

  1. If $f$, $g$ and $h$ are arrows in a category for which $\text{cod}(f)=\text{dom}(g)$ and $\text{cod}(g)=\text{dom}(h)$, as in this diagram:

    then there is a unique arrow $k$ with domain $A$ and codomain $C$ called the composite of $f$, $g$ and $h$.

  2. In the diagram below, the two triangles containing $k$ must both commute.

Notes for Axiom 5

  • Axiom 5b requires that \[h\circ(g\circ f)=(h\circ g)\circ f\](which both equal $k$), which is the usual formula for associativity.
  • Note that the top two triangles commute by Axiom 3.
  • The associativity axiom means that we can get rid of parentheses and write \[k=h\circ f\circ g\]just as we do for addition and multiplication of numbers.
  • In my opinion the notation using categorical diagrams communicates information much more clearly than algebraic notation does. In particular, you don’t have to remember the domains and codomains of the functions — they appear in the picture. I admit that diagrams take up much more space, but now that we read math stuff on a computer screen instead of on paper, space is free.

Examples of categories

For the first three examples, I will give a detailed explanation about how they fit the definition of category.

Example 1: MyFin

This first example is a small, finite category which I have named $\mathsf{MyFin}$ (my very own finite category). It is not at all an important category, but it has advantages as a first example.

  • It’s small enough that you can see all the objects and arrows on the screen at once, but big enough not to be trivial.
  • The objects and arrows have no properties other than being objects and arrows. (The other examples involve familiar math objects.)
  • So in order to check that $\mathsf{MyFin}$ really obeys the axioms for a category, you can use only the skeletal information given here. As a result, you must really understand the axioms!

A correct proof will be based on axioms and theorems. The proof can be suggested by your intuitions, but intuitions are not enough. When working with $\mathsf{MyFin}$ you won’t have any intuitions!

A diagram for $\mathsf{MyFin}$

This diagram gives a partial description of $\mathsf{MyFin}$.

Now let’s see how to make the diagram above into a category.

Axiom 1

  • The objects of $\mathsf{MyFin}$ are $A$, $B$, $C$ and $D$.
  • The arrows are $f$, $g$, $h$, $j$, $k$, $r$, $s$, $u$, $v$, $w$ and $x$.
  • You can regard the letters just listed as names of the objects and arrows. The point is that at this stage all you know about the objects and arrows are their names.
  • If you prefer, you can think of the arrows as the actual arrows shown in the $\mathsf{MyFin}$ diagram.
  • Our definition of $\mathsf{MyFin}$ is an abstract definition. You may have seen multiplication tables of groups given in terms of undefined letters. (If you haven’t, don’t worry.) Those are also abstract definitions.
  • Most of our other definitions of categories involve math objects you actually know something about. They are like the definition of division, for example, where the math objects are integers.

Axiom 2

  • The domains and codomains of the arrows are shown by the diagram above.
  • For example, $\text{dom}(r)=A$ and $\text{cod}(r)=C$, and $\text{dom}(v)=\text{cod}(v)=B$.

Axiom 3

Showing the $\mathsf{MyFin}$ diagram does not completely define $\mathsf{MyFin}$. We must say what the composites of all the paths of length 2 are.

  • In fact, most of them are forced, but two of them are not.
  • We must have $g\circ f=r$ because $r$ is the only arrow possible for the composite, and Axiom 3 requires that every path of length 2 must have a composite.
  • For the same reason, $h\circ g=s$.
  • All the paths involving $u$, $v$, $w$ and $x$ are forced:

  • (p1) $u\circ u=u$, $v\circ v=v$, $w\circ w=w$ and $x\circ x=x$.
  • (p2) $f\circ u=f$, $r\circ u=r$, $j\circ u=j$ and $k\circ u=k$. You can see that, for example, $f\circ u=f$ by opening up the loop on $f$ like this:

    There is only one arrow going from $A$ to $B$, namely$f$, so $f$ has to be the composite $f\circ u$.

  • (p3) $v\circ f=f$, $g\circ v=g$ and $s\circ v=s$.
  • (p4) $w\circ g=g$, $w\circ r=r$ and $h\circ w=h$.
  • (p5) $x\circ h=h$, $x\circ s=s$, $x\circ j=j$ and $x\circ k=k$.

  • For $s\circ f$ and $h\circ r$, we have to choose between $j$ and $k$ as composites. Since $s\circ f=(h\circ g)\circ f$ and $h\circ r=h\circ (g\circ f)$, Axiom 3 requires that we must chose one of $j$ and $k$ to be both composites.

    Definition: $s\circ f=h\circ r=j$.

    If we had defined $s\circ f=h\circ r=k$ we would have a different category, although one that is “isomorphic” to $\mathsf{MyFin}$ (you have to define “isomorphic” or look it up.)

  • Axiom 4

    • It is clear from the $\mathsf{MyFin}$ diagram that for each object there is just one arrow that has that object both as domain and as codomain, as required by Axiom 4a.
    • The requirements in Axiom 4b and 4c are satisfied by statements (p1) through (p5).

    Axiom 5

    • Since we have already required both $(h\circ g)\circ f$ and $h\circ(g\circ f)$ to be $k$, composition is associative.

    Example 2: Set

    To be written.

    This will be a very different example, because it involves known mathematical objects — sets and functions. But there are still issues, for example the fact that the inclusion of $\{1,2\}$ into $\{1,2,3\}$ and the identity map on $\{1,2\}$ are two different arows in the category of sets.

    Example 3: IntegerDiv

    To be written.

    The objects are all the positive integers and there is an arrow from $m$ to $n$ if and only if $m$ divides $n$. So this example involves familiar objects and predicates, but the arrows are nevertheless not functions that take elements to elements. Integers don’t have elements. I would expect to show how the GCD of two integers is a limit.

    References

      Creative Commons License        

    This work is licensed under a Creative Commons Attribution-ShareAlike 2.5 License.

    Send to Kindle

    Improving abstractmath.org

    This post discusses some ideas I have for improving abstractmath.org.

    Handbook of mathematical discourse

    The Handbook was kind of a false start on abmath, and is the source of much of the material in abmath. It still contains some material not in abmath, parti­cularly the citations.

    By citations I mean lexicographical citations: examples of the usage from texts and scholarly articles.

    I published the Handbook of mathe­ma­tical discourse in 2003. The first link below takes you to an article that describes what the Handbook does in some detail. Briefly, the Handbook surveys the use of language in math (and some other things) with an emphasis on the problems it causes students. Its collection of citations of usage could some day could be the start of an academic survey of mathematical language. But don’t expect me to do it.

    Links

    The Handbook exists as a book and as two different web versions. I lost the TeX source of the Handbook a few years after I published the book, so none of the different web versions are perfect. Version 2 below is probably the most useful.

    1. Handbook of mathe­ma­tical discourse. Description.
    2. Handbook of mathe­ma­tical discourse. Hypertext version without pictures but with active internal links. Some links don’t work, but they won’t be repaired because I have lost the TeX input files.
    3. Handbook of mathe­ma­tical discourse. Paperback.
    4. Handbook of mathematical discourse. PDF version of the printed book, including illustrations and citations but without hyperlinks.
    5. Citations for the paperback version of the Handbook. (The hypertext version and the PDF version include the citations.)

    Abmath

    Soon after the Handbook was published, I started work on abstractmath.org, which I abbreviate as abmath. It is intended specifically for people beginning to study abstract math, which means roughly post-calculus. I hope their teachers will read it, too. I had noticed when I was teaching that many students hit a big bump when faced with abstraction, and many of them never recovered. They would typically move into another field, often far away from STEM stuff.

    Links

    These abmath articles give more detail about the purpose of this website and the thinking behind the way it is presented:

    Presentation of abmath

    Informal

    Abmath is written for students of abstract math and other beginners to tell them about the obstacles they may meet up with in learning abstract math. It is not a scholarly work and is not written in the style of a scholarly work. There is more detail about its style in my rant in Attitude.

    Scholarly works should not be written in the style of a scholarly work, either.

    Links

    To do:

    Every time I revise an article I find myself rewriting overly formal parts. Fifty years of writing research papers has taken its toll. I must say that I am not giving this informalization stuff very high priority, but I will continue doing it.

    No citations

    One major difference concerns the citations in the Handbook. I collected these in the late nineties by spending many hours at Jstor and looking through physical books. When I started abmath I decided that the website would be informal and aimed at students, and would contain few or no citations, simply because of the time it took to find them.

    Boxouts and small screens

    The Handbook had both sidebars on every page of the paper version containing a reference index to words on that page, and also on many pages boxouts with comments. It was written in TeX. I had great difficulty using TeX to control the placement of both the sidebars and especially the boxouts. Also, you couldn’t use TeX to let the text expand or contract as needed by the width of the user’s screen.

    Abmath uses boxouts but not sidebars. I wrote Abmath using HTML, which allows it to be presented on large or small screens and to have extensive hyperlinks.
    HTML also makes boxouts easy.

    The arrival of tablets and i-pods has made it desirable to allow an abmath page to be made quite narrow while still readable. This makes boxouts hard to deal with. Also I have gotten into the habit of posting revisions to articles on Gyre&Gimble, whose editor converts boxouts into inline boxes. That can probably be avoided.

    To do:

    I have to decide whether to turn all boxouts into inline small-print paragraphs the was you see them in this article. That would make the situation easier for people reading small screens. But in-line small-print paragraphs are harder to associate to the location you want them to refer, in contrast to boxouts.

    Abmath 2.0

    For the first few years, I used Microsoft Word with MathType, but was plagued with problems described in the link below. Then I switched to writing directly in HTML. The articles of abmath labeled “abstractmath.org 2.0” are written in this new way. This makes the articles much, much easier to update. Unfortunately, Word produces HTML that is extraordinarily complicated, so transforming them into abmath 2.0 form takes a lot of effort.

    Link

    Illustrations

    Abmath does not have enough illustrations and diagrams. Gyre&Gimble has many posts with static illustrations, some of them innovative. It also has some posts with interactive demos created with Mathematica. These demos require the reader to download the CDF Player, which is free. Unfortunately, it is available only for Windows, Mac and Linux, which precludes using them on many small devices.

    Links

    To do:

    • Create new illustrations where they might be useful, and mine Gyre&Gimble and other sources.
    • There are many animated GIFs out there in the math cloud. I expect many of them are licensed under Creative Commons so that I can use them.
    • I expect to experiment with converting some of the interactive CFD diagrams that are in Gyre&Gimble into animated GIFs or AVIs, which as far as I know will run on most machines. This will be a considerable improvement over static diagrams, but it is not as good as interactive diagrams, where you can have several sliders controlling different variables, move them back and forth, and so on. Look at Inverse image revisited. and “quintic with three parameters” in Demos for graph and cograph of calculus functions.

    Abmath content

    Language

    Abmath includes most of the ideas about language in the Handbook (rewritten and expanded) and adds a lot of new material.

    Links

    1. The languages of math. Article in abmath. Has links to the other articles about language.
    2. Syntactic and semantic thinkers. Gyre&Gimble post.
    3. Syntax trees in mathematicians’ brains. Gyre&Gimble post.
    4. A visualization of a computation in tree form.Gyre&Gimble post.
    5. Visible algebra I. Gyre&Gimble post.
    6. Algebra is a difficult foreign language. Gyre&Gimble post.
    7. Presenting binops as trees. Gyre&Gimble post.
    8. Moths to the flame of meaning. How linguistics students also have trouble with syntax.
    9. Varieties of mathematical prose, by Atish Bagchi and Charles Wells.

    To do:

    The language articles would greatly benefit from more illustrations. In parti­cular:

    • G&G contains several articles about using syntax trees (items 3, 4, 5 and 7 above) to understand algebraic expressions. A syntax tree makes the meaning of an algebraic expression much more transparent than the usual one-dimensional way of writing it.
    • Several items in the abmath article More about the language of math, for example the entries on parenthetic assertions and postconditions could benefit from a diagrammatic representation of the relation between phrases in a sentence and semantics (or how the phrases are spoken).
    • The articles on Names and Alphabets could benefit from providing spoken pronunciations of many words. But what am I going to do about my southern accent?
    • The boxed example of change in context as you read a proof in More about the language of math could be animated as you click through the proof. *Sigh* The prospect of animating that example makes me tired just thinking about it. That is not how grasshoppers read proofs anyway.

    Understanding and doing math

    Abmath discusses how we understand math and strategies for doing math in some detail. This part is based on my own observations during 35 years of teaching, as well as extensive reading of the math ed literature. The math ed literature is usually credited in footnotes.

    Links

    Math objects and math structures

    Understanding how to think about mathematical objects is, I believe, one of the most difficult hurdles newbies have to overcome in learning abstract math. This is one area that the math ed community has focused on in depth.

    The first two links below are take you to the two places in abmath that discuss this problem. The first article has links to some of the math ed literature.

    Links

    To do: Everything is a math object

    An important point about math objects that needs to be brought out more is that everything in math is a math object. Obviously math structures are math objects. But the symbol “$\leq$” in the statement “$35\leq45$” denotes a math object, too. And a proof is a math object: A proof written on a blackboard during a lecture does not look like it is an instance of a rigorously defined math object, but most mathe­maticians, including me, believe that in principle such proofs can be transformed into a proof in a formal logical system. Formal logics, such as first order logic, are certainly math objects with precise mathematical definitions. Definitions, math expressions and theorems are math objects, too. This will be spelled out in a later post.

    To do: Bring in modern ideas about math structure

    Classically, math structures have been presented as sets with structure, with the structure being described in terms of subsets and functions. My chapter on math structures only waves a hand at this. This is a decidedly out-of-date way of doing it, now that we have category theory and type theory. I expect to post about this in order to clarify my thinking about how to introduce categorical and type-theoretical ideas without writing a whole book about it.

    Particular math structures

    Abmath includes discussions
    of the problems students have with certain parti­cular types of structures. These sections talk mostly about how to think about these structure and some parti­cular misunder­standings students have at the most basic levels.

    These articles are certainly not proper intro­ductions to the structures. Abmath in general is like that: It tells students about some aspects of math that are known to cause them trouble when they begin studying abstract math. And that is all it does.

    Links

    To do:

    • I expect to write similar articles about groups, spaces and categories.
    • The idea about groups is to mention a few things that cause trouble at the very beginning, such as cosets, quotients and homomorphisms (which are all obviously related to each other), and perhaps other stumbling blocks.
    • With categories the idea is to stomp on misconceptions such as that the arrows have to be functions and to emphasize the role of categories in allowing us to define math structures in terms of their relations with other objects instead of in terms with sets.
    • I am going to have more trouble with spaces. Perhaps I will show how you can look at the $\epsilon$-$\delta$ definition of continuous functions on the reals and “discover” that they imply that inverse images of open sets are open, thus paving the way for the family-of-subsets definition of a topoogy.
    • I am not ruling out other particular structures.

    Proofs

    This chapter covers several aspects of proofs that cause trouble for students, the logical aspects and also the way proofs are written.

    It specifically does not make use of any particular symbolic language for logic and proofs. Some math students are not good at learning languages, and I didn’t see any point in introducing a specific language just to do rudimentary discussions about proofs and logic. The second link below discusses linguistic ability in connection with algebra.

    I taught logic notation as part of various courses to computer engineering students and was surprised to discover how difficult some students found using (for example) $p+q$ in one course and $p\vee q$ in another. Other students breezed through different notations with total insouciance.

    Links

    To do:

    Much of the chapter on proofs is badly written. When I get around to upgrading it to abmath 2.0 I intend to do a thorough rewrite, which I hope will inspire ideas about how to conceptually improve it.

    Creative Commons License

    This work is licensed under a Creative Commons Attribution-ShareAlike 2.5 License.


    Send to Kindle

    Conceptual blending

    This post uses MathJax.  If you see formulas in unrendered TeX, try refreshing the screen.

    A conceptual blend is a structure in your brain that connects two concepts by associating part of one with part of another.  Conceptual blending is a major tool used by our brain to understand the world.

    The concept of conceptual blend includes special cases, such as representations, images and conceptual metaphors, that math educators have used for years to understand how mathematics is communicated and how it is learned.  The Wikipedia article is a good starting place for understanding conceptual blending. 

    In this post I will illustrate some of the ways conceptual blending is used to understand a function of the sort you meet with in freshman calculus.  I omit the connections with programs, which I will discuss in a separate post.

    A particular function

    Consider the function $h(t)=4-(t-2)^2$. You may think of this function in many ways.

    FORMULA:

    $h(t)$ is defined by the formula $4-(t-2)^2$.

    • The formula encapsulates a particular computation of the value of $h$ at a given value $t$.
    • The formula defines the function, which is a stronger statement than saying it represents the function.
    • The formula is in standard algebraic notation. (See Note 1)
    • To use the formula requires one of these:
      • Understand and use the rules of algebra
      • Use a calculator
      • Use an algebraic programming language. 
    • Other formulas could be used, for example $4t-t^2$.
      • That formula encapsulates a different computation of the value of $h$.

    TREE: 

    $h(t)$ is also defined by this tree (right).
    • The tree makes explicit the computation needed to evaluate the function.
    • The form of the tree is based on a convention, almost universal in computing science, that the last operation performed (the root) is placed at the top and that evaluation is done from bottom to top.
    • Both formula and tree require knowledge of conventions.
    • The blending of formula and tree matches some of the symbols in the formula with nodes in the tree, but the parentheses do not appear in the tree because they are not necessary by the bottom-up convention.
    • Other formulas correspond to other trees.  In other words, conceptually, each tree captures not only everything about the function, but everything about a particular computation of the function.
    • More about trees in these posts:

    GRAPH:

    $h(t)$ is represented by its graph (right). (See note 2.)

    • This is the graph as visual image, not the graph as a set of ordered pairs.
    • The blending of graph and formula associates each point on the (blue) graph with the value of the formula at the number on the x-axis directly underneath the point.
    • In contrast to the formula, the graph does not define the function because it is a physical picture that is only approximate.
    • But the formula does represent the function.  (This is "represents" in the sense of cognitive psychology, but not in the mathematical sense.)
    • The blending requires familiarity with the conventions concerning graphs of functions. 
    • It sets into operation the vision machinery of your brain, which is remarkably elaborate and powerful.
      • Your visual machinery allows you to see instantly that the maximum of the curve occurs at about $t=2$. 
    • The blending leaves out many things.
      • For one, the graph does not show the whole function.  (That's another reason why the graph does not define the function.)
      • Nor does it make it obvious that the rest of the graph goes off to negative infinity in both directions, whereas that formula does make that obvious (if you understand algebraic notation).  

    GEOMETRIC

    The graph of $h(t)$ is the parabola with vertex $(2,4)$, directrix $x=2$, and focus $(2,\frac{3}{4})$. 

    • The blending with the graph makes the parabola identical with the graph.
    • This tells you immediately (if you know enough about parabolas!) that the maximum is at $(2,4)$ (because the directrix is vertical).
    • Knowing where the focus and directrix are enables you to mechanically construct a drawing of the parabola using a pins, string, T-square and pencil.  (In the age of computers, do you care?)

    HEIGHT:

    $h(t)$ gives the height of a certain projectile going straight up and down over time.

    • The blending of height and graph lets you see instantly (using your visual machinery) how high the projectile goes. 
    • The blending of formula and height allows you to determing the projectile's velocity at any point by taking the derivative of the function.
    • A student may easily be confused into thinking that the path of the projectile is a parabola like the graph shown.  Such a student has misunderstood the blending.

    KINETIC:

    You may understand $h(t)$ kinetically in various ways.

    • You can visualize moving along the graph from left to right, going, reaching the maximum, then starting down.
      • This calls on your experience of going over a hill. 
      • You are feeling this with the help of mirror neurons.
    • As you imagine traversing the graph, you feel it getting less and less steep until it is briefly level at the maximum, then it gets steeper and steeper going down.
      • This gives you a physical understanding of how the derivative represents the slope.
      • You may have seen teachers swooping with their hand up one side and down the other to illustrate this.
    • You can kinetically blend the movement of the projectile (see height above) with the graph of the function.
      • As it goes up (with $t$ increasing) the projectile starts fast but begins to slow down.
      • Then it is briefly stationery at $t=2$ and then starts to go down.
      • You can associate these feelings with riding in an elevator.
        • Yes, the elevator is not a projectile, so this blending is inaccurate in detail.
      • This gives you a kinetic understanding of how the derivative gives the velocity and the second derivative gives the acceleration.

    OBJECT:

    The function $h(t)$ is a mathematical object.

    • Usually the mental picture of function-as-object consists of thinking of the function as a set of ordered pairs $\Gamma(h):=\{(t,4-(t-2)^2)|t\in\mathbb{R}\}$. 
    • Sometimes you have to specify domain and codomain, but not usually in calculus problems, where conventions tell you they are both the set of real numbers.
    • The blend object and graph identifies each point on the graph with an element of $\Gamma(h)$.
    • When you give a formal proof, you usually revert to a dry-bones mode and think of math objects as inert and timeless, so that the proof does not mention change or causation.
      • The mathematical object $h(t)$ is a particular set of ordered pairs. 
      • It just sits there.
      • When reasoning about something like this, implication statements work like they are supposed to in math: no causation, just picking apart a bunch of dead things. (See Note 3).
      • I did not say that math objects are inert and timeless, I said you think of them that way.  This post is not about Platonism or formalism. What math objects "really are" is irrelevant to understanding understanding math [sic].

    DEFINITION

    definition of the concept of function provides a way of thinking about the function.

    • One definition is simply to specify a mathematical object corresponding to a function: A set of ordered pairs satisfying the property that no two distinct ordered pairs have the same second coordinate, along with a specification of the codomain if that is necessary.
    • A concept can have many different definitions.
      • A group is usually defined as a set with a binary operation, an inverse operation, and an identity with specific properties.  But it can be defined as a set with a ternary operation, as well.
      • A partition of a set is a set of subsets of a set with certain properties. An equivalence relation is a relation on a set with certain properties.  But a partition is an equivalence relation and an equivalence relation is a partition.  You have just picked different primitives to spell out the definition. 
      • If you are a beginner at doing proofs, you may focus on the particular primitive objects in the definition to the exclusion of other objects and properties that may be more important for your current purposes.
        • For example, the definition of $h(t)$ does not mention continuity, differentiability, parabola, and other such things.
        • The definition of group doesn't mention that it has linear representations.

    SPECIFICATION

    A function can be given as a specification, such as this:

    If $t$ is a real number, then $h(t)$ is a real number, whose value is obtained by subtracting $2$ from $t$, squaring the result, and then subtracting that result from $4$.

    • This tells you everything you need to know to use the function $h$.
    • It does not tell you what it is as a mathematical object: It is only a description of how to use the notation $h(t)$.

    Notes

    1. Formulas can be give in other notations, in particular Polish and Reverse Polish notation. Some forms of these notations don't need parentheses.

    2. There are various ways to give a pictorial image of the function.  The usual way to do this is presenting the graph as shown above.  But you can also show its cograph and its endograph, which are other ways of representing a function pictorially.  They  are particularly useful for finite and discrete functions. You can find lots of detail in these posts and Mathematica notebooks:

    3. See How to understand conditionals in the abstractmath article on conditionals.

    References

    1. Conceptual blending (Wikipedia)
    2. Conceptual metaphors (Wikipedia)
    3. Definitions (abstractmath)
    4. Embodied cognition (Wikipedia)
    5. Handbook of mathematical discourse (see articles on conceptual blendmental representationrepresentation, and metaphor)
    6. Images and Metaphors (article in abstractmath)
    7. Links to G&G posts on representations
    8. Metaphors in Computing Science (previous post)
    9. Mirror neurons (Wikipedia)
    10. Representations and models (article in abstractmath)
    11. Representations II: dry bones (article in abstractmath)
    12. The transition to formal thinking in mathematics, David Tall, 2010
    13. What is the object of the encapsulation of a process? Tall et al., 2000.

     

    Send to Kindle

    Stances

    Philosophy

    With the help of some colleagues, I am beginning to understand why I am bothered by most discussions of the philosophy of math.  Philosophers have a stance. Examples:

    • "Math objects are real but not physical."
    • "Mathematics consists of statements" (deducible from axioms, for example).
    • "Mathematics consists of physical activity in the brain."

    And so on.  They defend their stances, and as a result of arguments occasionally refine them.  Or even change them radically.  The second part of this post talks about these three stances in a little more detail.

    I have a different stance:  I want to gain a scientific understanding of the craft of doing math.

    Given this stance, I don't understand how the example statements above help a scientific understanding.    Why would making a proclamation (taking a stance) whose meaning needs to be endlessly dissected help you know what math really is?

    In fact if you think about (and argue with others about) any of the three, you can (and people have) come up with lots of subtle observations.  Now, some of those observations may in fact give you a starting point towards a scientific investigation, so taking stances may have some useful results.  But why not start with the specific observations?

    Observe yourself and others doing math, noticing

    • specific behaviors that give you forward progress,
    • specific confusions that inhibit progress,
    • unwritten rules (good and bad) that you follow without noticing them,
    • intricate interactions beneath the surface of discourse about math,

    and so on.  This may enable you to come up with scientifically testable claims about what happens when doing math.  A lot of work of this sort has already been done, and it is difficult work since much of doing math goes on in our brains and in our interactions with other mathematicians (among other things) without anyone being aware of it.   But it is well worth doing.

    But you may object:  "I don't want to take your stance! I want to know what math really is."  Well, can we reliably find out anything about math in any way other than through scientific investigation?   [The preceding statement is not a stance, it is a rhetorical question.]

    Analysis of three straw men

    The three stances at the beginning of the post are not the only possible ones, so you may object that I have come up with some straw men that are easy to ridicule.  OK, come up with another stance and I will analyze it as well!

    "I think math objects are real but not physical."  There are lots of ways of defining "real", but you have to define it in order to investigate the question scientifically.  My favorite is "they have consistent and repeated behavior" like physical objects, and this behavior causes specific modules in the brain that deal with physical objects to deal with math objects in an efficient way.  If you write two or three paragraphs about consistent and repeated behavior that make testable claims then you have a start towards scientifically understanding something about math.   But why talk about "real"?  Isn't "consistent and repeated behavior" more explicit?  (Making it more explicit it makes it easier to find fault with it and modify it or throw it out.  That's science.)

    "Mathematics consists of statements".  Same kind of remark:  Define "statement".  (A recursively defined string of symbols?  An assertion with specific properties?)  Philosophers have thought about this a bunch.  So have logicians and computer scientists.  The concept of statement has really deep issues.  You can't approach the question of whether math "is" a bunch of statements until you get into those issues.  Of course, when you do you may come up with specific testable claims that are worth looking into.   But it seems to me that this sort of thinking has mostly resulted in people thinking philosophy of math is merely a matter of logic and set theory.  That point of view has been ruinous to the practice of math.

    "Mathematics consists of physical patterns in the brain."   Well, physical events in the brain are certainly associated with doing math, and they are worth finding out about.  (Some progress has already been made.)  But what good is the proclamation: "Math consists of activity in the brain".   What does that mean?  Math "is" math texts and mathematical conversations as well as activity in the brain.   If you want to claim that the brain activity is somehow primary, that may be defendable, but you have to say how it is primary and what its relations are with written and oral discourse.  If you succeed in doing that, the statement "Math consists of activity in the brain" becomes superfluous.

    Send to Kindle

    Constraints on the Philosophy of Mathematics

    In a recent blog post I described a specific way in which neuroscience should constrain the philosophy of math. For example, many mathematicians who produce a new kind of mathematical object feel they have discovered something new, so they may believe that mathematical objects are created rather than eternally existing. But identifying something as newly created is presumably the result of a physical process in the brain. So the feeling that an object is new is only indirectly evidence that the object is new.  (Our pattern recognition devices work pretty well with respect to physical objects so that feeling is indeed indirect evidence.)

    This constraint on philosophy is not based on any discovery that there really is a process in the brain devoted to recognizing new things. (Déjà vu is probably the result of the opposite process.) It’s just that neuroscience has uncovered very strong evidence that mental events like that are based on physical processes in the brain. Because of that work on other processes, if someone claims that recognizing newness is not based on a physical process in the brain, the burden of proof is on them.  In particular, they have to provide evidence that recognizing that a mathematical object is newly discovered says something about math other than what happened in your brain.

    Of course, it will be worthwhile to investigate how the feeling of finding something new arises in the brain in connection with mathematical objects. Understanding the physical basis for how the brain does math has the potential of improving math education, although that may be years down the road.

    Send to Kindle

    Math and the Modules of the Mind

    I have written (references below) about the way we seem to think about math objects using our mind’s mechanisms for thinking about physical objects. What I want to do in this post is to establish a vocabulary for talking about these ideas that is carefully enough defined that what I say presupposes as little as possible about how our mind behaves. (But it does presuppose some things.) This is roughly like Gregor Mendel’s formulation of the laws of inheritance, which gave precise descriptions of how characteristics were inherited while saying nothing at all about the mechanism.

    I will use module as a name for the systems in the mind that perform various tasks.

    Examples of modules

    a) We have an “I’ve seen this before module” that I talked about here.

    b) When we see a table, our mind has a module that recognizes it as a table, a module that notes that it is nearby, and in particular a module that notes that it is a physical object. The physical-object module is connected to many other modules, including for example expectations of what we would feel if we touched it, and in particular connections to our language-producing module that has us talk about it in a certain way (a table, the table, my table, and so on.)

    c) We also have a module for abstract objects. Abstract objects are discussed in detail in the math objects chapter of abstractmath.org. A schedule is an abstract object, and so is the month of November. They are not mathematical objects because they affect people and change over time. (More about this here.) For example, the statement “it is now November” is true sometimes and false sometimes. Abstract objects are also not abstractions, like “beauty” and “love” which are not thought of as objects.

    d) We talk about numbers in some ways like we talk about physical objects. We say “3 is a number”. We say “I am thinking of the only even prime”. But if we point and say, “Look, there is a 3”, we know that we have shifted ground and are talking about, not the number 3, but about a physical representation of the number 3. That’s because numbers trigger our abstract object module and our math object module, but not our physical object module. (Back and fill time: if you are not a mathematician, your mind may not have a math object module. People are not all the same.)

    More about modules

    My first choice for a name for these systems would have been object, as in object-oriented programming, but this discussion has too many things called objects already. Now let’s clear up some possible misconceptions:

    e) I am talking about a module of the mind. My best guess would be that the mind is a function of the brain and its relationship with the world, but I am not presuppposing that. Whatever the mind is, it obviously has a system for recognizing that something is a physical object or a color or a thought or whatever. (Not all the modules are recognizers; some of them initiate actions or feelings.)

    f) It seems likely that each module is a neuron together with its connections to other neurons, with some connections stronger than others (our concepts are fuzzy, not Boolean). But maybe a module is many neurons working together. Or maybe it is like a module in a computer program, that is instantiated anew each time it is called, so that a module does not have a fixed place in the brain. But it doesn’t matter. A module is whatever it is that carries out a particular function. Something has to carry out such functions.

    Math objects

    The modules in a mathematician’s mind that deal with math objects use some of the same machinery that the mind uses for physical objects.

    g) You can do things to them. You can add two numbers. You can evaluate a function at an input. You can take the derivative of some functions.

    h) You can discover properties of some kinds of math objects. (Every differentiable function is continuous.)

    i) Names of some math objects are treated as proper nouns (such as “42”) and others as common nouns (such as “a prime”.)

    I maintain that these phenomena are evidence that the systems in your mind for thinking about physical objects are sometimes useful for thinking about math objects.

    Different ways of thinking about math objects.

    j) You can construct a mathematical object that is new to you. You may feel that you invented it, that it didn’t exist before you created it. That’s your I just created this module acting. If you feel this way, you may think math is constantly evolving.

    k) Many mathematicians feel that math objects are all already there. That’s a module that recognizes that math objects don't come into or go out of existence.

    l) When you are trying to understand math objects you use all sorts of physical representations (graphs, diagrams) and mental representations (metaphors, images). You say things like, “This cubic curve goes up to positive infinity in the negative direction” and “This function vanishes at 2” and “Think of a Möbius strip as the unit square with two parallel sides identified in the reverse direction.”

    m) When you are trying to prove something about math objects mathematicians generally think of math objects as eternal and inert (not affecting anything else). For example, you replace “the slope of the secant gets closer and closer to the slope of the tangent” by an epsilon-delta argument in which everything you talk about is treated as if it is unchanging and permanent. (See my discussion of the rigorous view.)

    Consequences

    When you have a feeling of déjà vu, it is because something has triggered your “I have seen this before” module (see (a)). It does not mean you have seen it before.

    When you say “the number 3” is odd, that is a convenient way of talking about it (see (d) above), but it doesn’t mean that there is really only one number three.

    If you say the function x^2 takes 3 to 9 it doesn’t have physical consequences like “Take me to the bank” might have. You are using your transport module but in a pretend way (you are using the pretend module!).

    When you think you have constructed a new math object (see (j)), your mental modules leave you feeling that the object didn’t exist before. When you think you have discovered a new math object (see (k)), your modules leave you feeling that it did exist before. Neither of those feelings say anything about reality, and you can even have both feelings at the same time.

    When you think about math objects as eternal and inert (see (m)) you are using your eternal and inert modules in a pretend way. This does not constitute an assertion that they are eternal and inert.

    Is this philosophy?

    My descriptions of how we think about math are testable claims about the behavior of our mind, expressed in terms of modules whose behavior I (partially) specify but whose nature I don’t specify. Just as Mendel’s Laws turned out to be explained by the real behavior of chromosomes under meiosis, the phenomena I describe may someday turn out to be explained by whatever instantiation the modules actually have – except for those phenomena that I have described wrongly, of course – that is what “testable” means!

    So what I am doing is science, not philosophy, right?

    Now my metaphor-producing module presents the familiar picture of philosophy and science as being adjacent countries, with science intermittently taking over pieces of philosophy’s territory…

    Links to my other articles in this thread

    Math objects in abstractmath.org
    Mathematical objects are “out there”?
    Neurons and math
    A scientific view of mathematics (has many references to what other people have said about math objects)
    Constructivism and Platonism

    Send to Kindle