Tag Archives: metaphor

A tiny step towards killing string-based math

I discussed endographs of real functions in my post  Endographs and cographs of real functions.  Endographs of finite functions also provide another way of thinking about functions, and I show some examples here.  This is not a new idea; endographs have appeared from time to time in textbooks, but they are not used much, and they have the advantage of revealing some properties of a function instantly that cannot be seen so easily in a traditional graph or cograph.

In contrast to endographs of functions on the real line, an endograph of a finite function from a set to itself contains all the information about the function.  For real functions, only some of the arrows can be shown; you are dependent on continuity to interpolate where the infinite number of intermediate arrows would be, and of course, it is easy to produce a function, with, say, small-scale periodicity, that the arrows would miss, so to speak.  But with an endograph of a finite function, WYSIATI (what you see is all there is).

Here is the endograph of a function.  It is one function.  The graph has four connected components.

You can see immediately that it is a permutation  of the set \{1,2,3,4,5,6\}, and that it is involution (a permutation f for which f f=\text{id}).  In cycle notation, it is the permutation (1 2)(5 6), and the connected components of the endograph correspond to the cycle structure.

Here is another permutation:

You can see that to get f^n=\text{id} you would have to have n=6, since you have to apply the 3-cycle 3 times and the transposition twice to get the identity.   The cycle structure (1 2 4)(0 3) tells you this, but you have to visualize it acting to see that.  The endograph gives the newbie a jumpstart on the visualization.  “The power to understand and predict the quantities of the world should not be restricted to those with a freakish knack for manipulating abstract symbols” (Brett Victor).   This is an argument for insisting that this permutation is the endograph, and the abstract string of symbols (1 2 4)(0 3) is a representation of secondary importance.  [See Note 1.]

Here is the cograph of the same function.  It requires a bit of visualization or tracing arrows around to see its cycle structure.

If I had rearranged the nodes like this

the cycle structure would be easier to see.  This does not indicate as much superiority of the endograph metaphor over the cograph metaphor as you might think:  My endograph code [Note 2] uses Mathematica’s graph-displaying algorithm, which automatically shows cycles clearly.   The cograph code that I wrote specifies the placement of the nodes explicitly, so I rearranged them to obtain the second cograph above using my knowledge of the cycle structure.

The following endographs of functions that are not permutations exhibit the general fact that the graph of a finite function consists of cycles with trees attached.   This structure is obvious from the endographs, and it is easy to come up with a proof of this property of finite functions by tracing your finger around the endographs.

This is the endograph of the polynomial 2 n^9+5 n^8+n^7+4 n^6+9 n^5+1 over the finite field of 11 elements.

Here is another endograph:

I constructed this explicitly by writing a list of rules, and then used Mathematica’s interpolating polynomial to determine that it is given by the polynomial

6 x^{16}+13 x^{15}+x^{14}+3 x^{13}+10 x^{12}+5  x^{11}\\ +14 x^{10}+4 x^9+9 x^8+x^7+14 x^6\\ +15  x^5+16 x^4+14 x^3+4 x^2+15 x+11

in GF[17].

Quite a bit is known about polynomials over finite fields that give permutations.  For example there is an easy proof using interpolating polynomials that a polynomial that gives a transposition must have degree q-2.  The best reference for this stuff is Lidl and Niederreiter, Introduction to Finite Fields and their Applications

The endographs above raise questions such as what can you say about the degree or coefficients of a polynomial that gives a digraph like the function f below that is idempotent (f f=f).  Students find idempotence vs. involution difficult to distinguish between.  Digraphs show you almost immediately what is going on.  Stare at the digraph below for a bit and you will see that if you follow f to a node and then follow  it again you stay where you are (the function is the identity on its image).  That’s another example of the insights you can get from a new metaphor for a mathematical object.

The following function is not idempotent even though it has only trivial loops.  But the digraph does tell you easily that it satisfies f^4=f^3.

Notes

[1] Atish Bagchi and I have contributed to this goal in Graph Based Logic and Sketches, which gives a bare glimpse of the possibility of considering that the real objects of logic are diagrams and their limits and morphisms between them, rather than hard-to-parse strings of letters and logical symbols.  Implementing this (and implementing Brett Victor’s ideas) will require sophisticated computer support.  But that support is coming into existence.  We won’t have to live with string-based math forever.

[2] The Mathematica notebook used to produce these pictures is here.  It has lots of other examples.

Send to Kindle

Endograph and cograph of real functions

This post is covered by the Creative Commons Attribution – ShareAlike 3.0 License, which means you may use, adapt and distribute the work provided you follow the requirements of the license.

Introduction

In the article Functions: Images and Metaphors in abstractmath I list a bunch of different images or metaphors for thinking about functions. Some of these metaphors have realizations in pictures, such as a graph or a surface shown by level curves. Others have typographical representations, as formulas, algorithms or flowcharts (which are also pictorial). There are kinetic metaphors — the graph of {y=x^2} swoops up to the right.

Many of these same metaphors have realizations in actual mathematical representations.

Two images (mentioned only briefly in the abstractmath article) are the cograph and the endograph of a real function of one variable. Both of these are visualizations that correspond to mathematical representations. These representations have been used occasionally in texts, but are not used as much as the usual graph of a continuous function. I think they would be useful in teaching and perhaps even sometimes in research.

A rough and unfinished Mathematica notebook is available that contains code that generate graphs and cographs of real-valued functions. I used it to generate most of the examples in this post, and it contains many other examples. (Note [1].)

The endograph of a function

In principle, the endograph (Note [2]) of a function {f} has a dot for each element of the domain and of the codomain, and an arrow from {x} to {f(x)} for each {x} in the domain. For example, this is the endograph of the function {n\mapsto n^2+1 \pmod 11} from the set {\{0,1,\ldots,10\}} to itself:


“In principle” means that the entire endograph can be shown only for small finite functions. This is analogous to the way calculus books refer to a graph as “the graph of the squaring function” when in fact the infinite tails are cut off.

Real endographs

I expect to discuss finite endographs in another post. Here I will concentrate on endographs of continuous functions with domain and codomain that are connected subsets of the real numbers. I believe that they could be used to good effect in teaching math at the college level.

Here is the endograph of the function {y=x^2} on the reals:

I have displayed this endograph with the real line drawn in the usual way, with tick marks showing the location of the points on the part shown.

The distance function on the reals gives us a way of interpreting the spacing and location of the arrowheads. This means that information can be gleaned from the graph even though only a finite number of arrows are shown. For example you see immediately that the function has only nonnegative values and that its increase grows with {x}.(See note [3]).

I think it would be useful to show students endographs such as this and ask them specific questions about why the arrows do what they do.

For the one shown, you could ask these questions, probably for class discussion rather that on homework.

  • Explain why most of the arrows go to the right. (They go left only between 0 and 1 — and this graph has such a coarse point selection that it shows only two arrows doing that!)
  • Why do the arrows cross over each other? (Tricky question — they wouldn’t cross over if you drew the arrows with negative input below the line instead of above.)
  • What does it say about the function that every arrowhead except two has two curves going into it?

Real Cographs

The cograph (Note [4] of a real function has an arrow from input to output just as the endograph does, but the graph represents the domain and codomain as their disjoint union. In this post the domain is a horizontal representation of the real line and the codomain is another such representation below the domain. You may also represent them in other configurations (Note [5]).

Here is the cograph representation of the function {y=x^2}. Compare it with the endograph representation above.

Besides the question of most arrows going to the right, you could also ask what is the envelope curve on the left.

More examples

Absolute value function

Arctangent function

Notes

[1] This website contains other notebooks you might find useful. They are in Mathematica .nb, .nbp, or .cdf formats, and can be read, evaluated and modified if you have Mathematica 8.0. They can also be made to appear in your browser with Wolfram CDF Player, downloadable free from Wolfram site. The CDF player allows you to operate any interactive demos contained in the file, but you can’t evaluate or modify the file without Mathematica.

The notebooks are mostly raw code with few comments. They are covered by the Creative Commons Attribution – ShareAlike 3.0 License, which means you may use, adapt and distribute the code following the requirements of the license. I am making the files available because I doubt that I will refine them into respectable CDF files any time soon.

[2] I call them “endographs” to avoid confusion with the usual graphs of functions — — drawings of (some of) the set of ordered pairs {x,f(x)} of the function.

[3] This is in contrast to a function defined on a discrete set, where the elements of the domain and codomain can be arranged in any old way. Then the significance of the resulting arrangement of the arrows lies entirely in which two dots they connect. Even then, some things can be seen immediately: Whether the function is a cycle, permutation, an involution, idempotent, and so on.

Of course, the placement of the arrows may tell you more if the finite sets are ordered in a natural way, as for example a function on the integers modulo some integer.

[4] The text [1] uses the cograph representation extensively. The word “cograph” is being used with its standard meaning in category theory. It is used by graph theorists with an entirely different meaning.

[5] It would also be possible to show the domain codomain in the usual {x-y} plane arrangement, with the domain the {x} axis and the codomain the {y} axis. I have not written the code for this yet.

References

[1] Sets for Mathematics, by F. William Lawvere and Robert Rosebrugh. Cambridge University Press, 2003.

[2] Martin Flashman’s website contains many exampls of cographs of functions, which he calls mapping diagrams.

Send to Kindle

Mental, Physical and Mathematical Representations

For a given mathematical object, a mathematician may have:

  • A mental representation of the object. This can be a metaphor, a mental image, or a kinesthetic understanding of the object.
  • A physical representation of the object. This may be a (physical) picture or drawing or three-dimensional model of the object.
  • A mathematical definition and one or more mathematical representations for the object. Such a representation is itself a mathematical object.

The boldface things in this list are related to each other in lots of ways, and they are fuzzy and overlap and don’t include every phenomenon connected with a math object.

I have written about these things ([1], [2], [3], [4]). So have lots of other people. In this post I summarize these ideas. I expect to write about particular examples later on and will use this as a reference.

Two Examples

The following examples point out a few of the relationships between the ideas in boldface above. There is much more to understand.

Function as black box

The idea that a function is a black box or machine with input and output is a metaphor for a function.

A is a metaphor for B means that A and B are cognitively pasted together in such a way that the behavior of A is in many ways like the behavior of B. Such a thing is both useful and dangerous, dangerous because there will be ways in which A behaves that suggest inappropriate ideas about B.

The function as machine is a good metaphor: for example functional composition involves connecting the output of one machine to the input of another, and the inverse function is like running the machine backward.

The function as machine is a bad metaphor: For example, it is wrong to think you could build a machine to calculate any given function exactly. But you can still imagine such a machine, given by a specification (it outputs the value of the function at a given input) and then, in your imagination, connecting the input of one to the output of another must perforce calculate the composite of the corresponding functions.

Like any metaphor, this is a mental representation. That means the metaphor has a physical instantiation in your brain. So a metaphor has a physical representation.

Different people won’t have quite the same concept of a particular metaphor. So a metaphor will have lots of slightly different physical representations, but mathematicians form a community, and communication between mathematicians fine-tunes the different physical instantiations so that they correspond more closely to each other. This is the sense in which mathematical objects have a shared existence in a community as Reuben Hersh has suggested.

A function is a mathematical object, which can be rigorously specified as a set of ordered pairs together with a domain and a codomain. There is a cognitive relationship between the concepts of function as math object and function as black box with input and output.

Triangle

A triangle can be drawn, or created on a computer and a physical image printed out. You may also have a mental image of the triangle.

The physical and the mental images are not the same thing, but they are definitely related. The relationship is mediated by the neuronal circuitry behind your retinas, which performs a highly sophisticated transformation of the pixels on your retina into an organized physical structure in your brain, connected to various other neurons.

This circuitry exists because it helps us get a useful understanding of the world through our eyes. So a picture of a triangle takes advantage of pre-existing neuron structure to generate a useful mental representation that helps us understand and prove things about triangles.

This mental representation also lives in a community of mathematician. Like any community, it has subgroups with “dialects” — varying understanding of representation.

For example, a mathematician who looks at the triangle below sees a triangle that looks like a right triangle. A student sees a triangle that is a right triangle.

This is “sees” in the sense of what their brain reports after all that processing. The mathematician’s brain connects the “triangle I am seeing” module (in their brain) to the “looks like a right triangle” module, but does not connect it to the “is a right triangle” module because they don’t see any statement in the surrounding text that it is a right triangle. The student, on the other hand, fallaciously makes the connection to “is a right triangle” directly.

In some sense, a student who does not make that connection directly is already a mathematician.

A triangle also exists as a mathematical object in your and my brain. It is described by a formal mathematical definition. The pictures of triangles you see above do not fit this definition. For one thing, the line segments in the pictures have thickness. But the pictures trigger a reaction in your neurons that causes your brain to cognitively paste together the line segments in the drawing to the segments required by the formal definition. This is a kind of metaphor of concrete-to-abstract that connects drawings to math objects that mathematicians use all the time.

Note that this “concrete-to-abstract metaphor” itself has a physical existence in your brain.  It drops, for example, the property of thickness that the line segments in the drawing have when matching them (in the metaphor) with the line segments in the corresponding abstract triangle.  On the other hand, it preserves the sense the all three angles in the triangle are acute.  The abstract mathematical concept of triangle (the generic triangle) has no requirement on the angles except that they add up to pi.

Summary

The discussions above describe a few of the complex and subtle relationships that exist between

  • Mental representations of math objects
  • Physical representations of math objects
  • Formally defined math objects and their formally defined representations.

I have purported to discuss how mathematics is understood (especially in connection with language) in several articles and a book but only a few of the relationships I just described are mentioned in any of those articles. Perhaps one or two things I said caused you to react: “Actually, that’s obviously true but I never thought of it before”. (Much the way I had mathematicians in the ’60’s tell me, “I see what you mean that addition is a function of two variables, but I never thought of it that way before”.) (I was a brash category theorist wannabe then.)

A lot of research has been done on understanding math, and some research has been done on mathematical discourse. But what has been done has merely exposed the fin of the shark.

References

[1] Images and metaphors (in abstractmath).

[2] Representations and Models (in abstractmath).

[3] Mathematical Concepts (previous blog).

[4] Mental Representations in Math (previous blog).

Send to Kindle

Operation as metaphor in math

Operation: Is it just a name or is there a metaphor behind it?

A function of the form {f:S\times S\rightarrow S} may be called a binary operation on {S}. The main point to notice is that it takes pairs of elements of {S} to the same set {S}.

A binary operation is a special case of n-ary operation for any natural number {n}, which is a function of the form {f:S^n\rightarrow S}. A {1}-ary (unary) operation on {S} is a function from a set to itself (such as the map that takes an element of a group to its inverse), and a {0}-ary (nullary) operation on {S} is a constant.

It is useful at times to consider multisorted algebra, where a binary operation can be a function {f:S_1\times S_2\rightarrow  S_3} where the {S_i} are possibly different sets. Then a unary operation is simply a function.

Calling a function a multisorted unary operation suggest a different way of thinking about it, but as far as I can tell the difference is only that the author is thinking of algebraic operations as examples. This does not seem to be a different metaphor the way “function as map” and “function as transformation” are different metaphors. Am I missing something?

In the 1960’s some mathematicians (not algebraists) were taken aback by the idea that addition of real numbers (for example) is a function. I observed this personally. I don’t think any mathematician would react this way today.

Send to Kindle

Knowledge is a (pre)sheaf

Mathematical structures as metaphors

People understand aspects of life that they don’t have good words for. Math could supply them with some names for these concepts. Just as music theory explains how Mozart’s music, blues, and klezmer music are different from each other (part of the explanation is: different scales).

It would be convenient if everyone understood comments such as “Race and ethnicity are not Boolean concepts”. Well, they don’t. In the case of race, I think many people over 70 years old or so are hung up on the idea that a person is either black or white. They ask questions about a mixed race person like “What is he?” Younger people seem to know better, but they don’t have a way of expressing the idea that the concepts of Boolean and fuzzy set would give them. In a similar way, ethnicity is a function of (at least) two independent variables: ancestry and culture. Many people understand this without having a decent way to say it. But who outside of mathematicians knows from independent variables?

The Theory of Everything is a sheaf of theories

Reading The Grand Design, by Stephen Hawking and Leonard Mlodinow, led me to the idea that knowledge, at least scientific knowledge, is like a sheaf. Astronomy, biology, chemistry and physics are different systems of knowledge. In some sense Newton discovered a map that interpreted astronomy in physics, Linus Pauling did something like that with chemistry and physics (calculating chemical reactions using quantum mechanics), and Crick and Watson got hold of a basic fact that interprets biology in chemistry.

Now physicists are worried because (in terms of the metaphors of sheaves) physics seems to consist of two theories, quantum mechanics and large-scale physics, that may be different open sets in a sheaf that doesn’t have a global element, and possibly even worse, the restriction maps to their intersection may not be compatible. In other words, it not only doesn’t have a global element but it may be only a presheaf!

Now that will not sit well with scientists. Ordinary people go through life having different theories about love, religion, politics, when you kick a table it hurts your foot, and so on, and don’t seem to worry a bit about whether the restriction maps are compatible. Many scientists seem to me to believe that all the restriction maps are compatible, but we don’t know the details yet. And many of them want to throw out whole theories (astrology, ESP, and lately religion) because they can’t think how the restriction maps could be compatible.

There is evidence that the scientists are right: more and more overlaps between different theories have been shown compatible over the years. All different experiences can be connected by one sheaf of theories. That feeling is base on historical experience, but also it is intrinsic to the scientific method to assume that you can reconcile different aspects of whatever you are studying. It isn’t a matter solely of faith that there is one Theory (sheaf) of Everything; it is a matter of methodology. That knowledge forms a sheaf, not just a presheaf is the claim that all knowledge is compatible. That there may not be a global element, one Theory of Everything, is a separate idea and one that Hawking & Mlodinow seem to hint at. It is certainly worth considering the possibility that there is no global element in the Universal Sheaf of Theories.

Send to Kindle

Technical meanings clash with everyday meanings

Recently (see note [a]) on MathOverflow, Colin Tan asked [1] “What does ‘kernel’ mean in ‘integral kernel’?”  He had noticed the different use of the word in referring to the kernels of morphisms.

I have long thought [2] that the clash between technical meanings and everyday meaning of technical terms (not just in math) causes trouble for learners.  I have recently returned to teaching (discrete math) and my feeling is reinforced — some students early in studying abstract math cannot rid themselves of thinking of a concept in terms of familiar meanings of the word.

One of the worst areas is logic, where “implies” causes well-known bafflement.   “How can ‘If P then Q’ be true if P is false??”  For a large minority of beginning college math students, it is useless to say, “Because the truth table says so!”.  I may write in large purple letters (see [3] for example) on the board and in class notes that The Definition of a Technical Math Concept Determines Everything That Is True About the Concept but it does not take.  Not nearly.

The problem seems to be worse in logic, which changes the meaning of words used in communicating math reasoning as well as those naming math concepts. But it is bad enough elsewhere in math.

Colin’s question about “kernel” is motivated by these feelings, although in this case it is the clash of two different technical meanings given to the same English word — he wondered what the original idea was that resulted in the two meanings.  (This is discussed by those who answered his question.)

Well, when I was a grad student I made a more fundamental mistake when I was faced with two meanings of the word “domain” (in fact there are at least four meanings in math).  I tried to prove that the domain of a continuous function had to be a connected open set.  It didn’t take me all that long to realize that calculus books talked about functions defined on closed intervals, so then I thought maybe it was the interior of the domain that was a, uh, domain, but I pretty soon decided the two meanings had no relation to each other.   If I am not mistaken Colin never thought the two meanings of “kernel” had a common mathematical definition.

It is not wrong to ask about the metaphor behind the use of a particular common word for a technical concept.  It is quite illuminating to get an expert in a subject to tell about metaphors and images they have about something.  Younger mathematicians know this.  Many of the questions on MathOverflow are asking just for that.  My recollection of the Bad Old Days of Abstraction and Only Abstraction (1940-1990?) is that such questions were then strongly discouraged.

Notes

[a] The recent stock market crash has been blamed [4] on the fact that computers make buy and sell decisions so rapidly that their actions cannot be communicated around the world fast enough because of the finiteness of the speed of light.  This has affected academic exposition, too.  At the time of writing, “recently” means yesterday.

References

[1] Colin Tan, “What does ‘kernel’ mean in ‘integral kernel’?

[2] Commonword names for technical concepts (previous blog).

[3] Definitions. (Abstractmath).

[4] John Baez, This weeks finds in mathematical physics, Week 297.

Send to Kindle

How "math is logic" ruined math for a generation

Mark Meckes responded to my statement

But it seems to me that this sort of thinking has mostly resulted in people thinking philosophy of math is merely a matter of logic and set theory.  That point of view has been ruinous to the practice of math.

with this comment:

I may be misreading your analysis of the second straw man, but you seem to imply that “people thinking philosophy of math is merely a matter of logic and set theory” has done great damage to mathematics. I think that’s quite an overstatement. It means that in practice, mathematicians find philosophy of mathematics to be irrelevant and useless. Perhaps philosophers of mathematics could in principle have something to say that mathematicians would find helpful but in practice they don’t; however, we’re getting along quite well without their help.

On the other hand, maybe you only meant that people who think “philosophy of math is merely a matter of logic and set theory” are handicapped in their own ability to do mathematics. Again, I think most mathematicians get along fine just not thinking about philosophy.

Mark is right that at least this aspect of philosophy of math is irrelevant and useless to mathematicians.  But my remark that the attitude that “philosophy of math is merely a matter of logic and set theory” is ruinous to math was sloppy, it was not what I should have said.    I was thinking of a related phenomenon which was ruinous to math communication and teaching.

By the 1950’s many mathematicians adopted the attitude that all math is is theorem and proof.  Images, metaphors and the like were regarded as misleading and resulting in incorrect proofs.  (I am not going to get into how this attitude came about).     Teachers and colloquium lecturers suppressed intuitive insights and motivations in their talks and just stated the theorem and went through the proof.

I believe both expository and research papers were affected by this as well, but I would not be able to defend that with citations.

I was a math student 1959 through 1965.  My undergraduate calculus (and advanced calculus) teacher was a very good teacher but he was affected by this tendency.  He knew he had to give us intuitive insights but he would say things like “close the door” and “don’t tell anyone I said this” before he did.  His attitude seemed to be that that was not real math and was slightly shameful to talk about.  Most of my other undergrad teachers simply did not give us insights.

In graduate school I had courses in Lie Algebra and Mathematical Logic from the same teacher.   He was excellent at giving us theorem-proof lectures, much better than most teachers, but he never gave us any geometric insights into Lie Algebra (I never heard him say anything about differential equations!) or any idea of the significance of mathematical logic.  We went through Killing’s classification theorem and Gödel’s incompleteness theorem in a very thorough way and I came out of his courses pleased with my understanding of the subject matter.  But I had no idea what either one of them had to do with any other part of math.

I had another teacher for several courses in algebra and various levels of number theory.   He was not much for insights, metaphors, etc, but he did do well in explaining how you come up with a proof.  My teacher in point set topology was absolutely awful and turned me off the Moore Method forever.   The Moore method seems to be based on: don’t give the student any insights whatever. I have to say that one of my fellow students thought the Moore method was the best thing since sliced bread and went on to get a degree from this teacher.

These dismal years in math teaching lasted through the seventies and perhaps into the eighties.  Apparently now younger professors are much more into insights, images and metaphors and to some extent into pointing out connections with the rest of math and science.  Since I have been retired since 1999 I don’t have much exposure to the newer generation and I am not sure how thoroughly things have changed.

One noticeable phenomenon was that category theorists (I got into category theory in the mid seventies) were very assiduous in lectures and to some extent in papers in giving motivation and insight.  It may be that attitudes varied a lot between different disciplines.

This Dark Ages of math teaching was one of the motivations for abstractmath.org.  My belief is that not only should we give the students insights, images and metaphors to think about objects, and so on, but that we should be upfront about it:   Tell them what we are doing (don’t just mutter the word “intuitive”) and point out that these insights are necessary for understanding but are dangerous when used in proofs.  Tell them these things with examples. In every class.

My other main motivation for abstractmath.org was the way math language causes difficulties.  But that is another story.

Send to Kindle

Mental Representations in Math

This post is part of the abstractmath article on images and metaphors.  I have had some new insights into the subject of mental representations and have incorporated them in this rewritten version (which omits some examples).  I would welcome comments.

Mathematicians who work with a particular kind of mathematical object have mental representations of that type of object that help them understand it.  These mental representations come in various forms:

  • Visual images,  for example of what a right triangle looks like.
  • Notation, for example visualizing the square root of 2 by the symbol\sqrt{2}“.  Of course, in a sense notation is also a physical representation of the number.  An important fact:  A mathematical object may be referred to by many different notations. There are examples here and here. (If you think deeply about the role notation plays in your head and on paper you can easily get a headache.)
  • Kinetic understanding, for example thinking of the value of a function as approaching a limit by imagining yourself moving along the graph of the function.
  • Metaphorical understanding, for example thinking of a function such as  as a machine that turns one number into another: for example, when you put in 3 out comes 9.   See also literalism and this post on Gyre&Gimble.

Example

Consider the function h(t)=25-(t-5)^2.   The chapter on images and metaphors for functions describes many ways to think about functions.  A few of them are considered here.

Visual images You can picture this function in terms of its graph, which is a parabola.   You can think of it more physically, as like the Gateway Arch.  The graph visualization suggests that the function has a single maximum point that appears to occur at t = 5.

I personally use visual placement to remember relationships between abstract objects, as well.  For example, if I think of three groups, two of which are isomorphic (for example C_2 and \text{Alt}_3.), I picture them as in different places with a connection between the two isomorphic ones.  I know of no research on this.

Notation You can think of the function as its formula .  The formula tells you that its graph will be a parabola (if you know that quadratics give parabolas) and it tells you instantly without calculus that its maximum will be at (see ratchet effect).

Another formula for the same function is -t^2+10t.   The formula is only a representation of the function.  It is not the same thing as the function.  The functions h(t) and k(t) defined on the real numbers  by h(t)=25-(t-5)^2 and k(t)=-t^2+10t are the same function; in other words, h = k.

Kinetic The function h(t)  could model the height over time of a physical object, perhaps a ball shot vertically upwards on a planet with no atmosphere. You could think of the ball starting at time t = 0 at elevation 0, reaching an elevation of (for example) 16 units at time t = 2, and landing at t = 10.  You are imagining a physical event continuing over time, not just as a picture but as a feeling of going up and down (see mirror neuron).  This feeling of the ball going up and down is attached in your brain to your understanding of the function h(t).

Although h(t) models the height of the ball, it is not the same thing as the height of the ball. A mathematical object may have a relationship in our mind to physical processes or situations but is distinct from them.

According to this report, kinetic understanding can also help with learning math that does not involve pictures.  I know that when I think of evaluating the function  at 3, I visualize 3 moving into the x slot and then the formula  transforming itself into 10.  I remember doing this even before I had ever heard of the Transformers.

Metaphor One metaphor for functions is that it is a machine that turns one number into another.  For example, the function h(t)  turns 0 into 0 (which is therefore a fixed point) and 5 into 25.  It also turns 10 into 0, so once you have the output you can’t necessarily tell what the input was (the function is not injective).

More examples

  • ¨ “Continuous functions don’t have gaps in the graph“. This is a visual image.
  • ¨ You may think of the real numbers as lying along a straight line (the real line) that extends infinitely far in both directions. This is both visual and a metaphor (a real number “is” a place on the real line).
  • ¨ You can think of the set containing 1, 3 and 5 and nothing else in terms of its list notation {1, 3, 5}. But remember that {5, 1,3} is the same set. In other words the list notation has irrelevant features – the order in which the elements are listed in this case.
  • The interior of a closed curve or a sphere is called that because it is like the interior in the everyday sense of a bucket or a house. Note The boundary of a real-life container such as a bucket has thickness, in contrast to the boundary of a closed curve or a sphere. This observation illustrates my description of a metaphor as identifying part of one situation with part of another. One aspect is emphasized; another aspect, where they may differ, is ignored.

Uses of mental representations

Integers and metaphors make up what is arguably the most important part of the mathematician’s understanding of the concept.

  • Mental representations of mathematical objects using metaphors and images are necessary for understanding and communicating about them (especially with types of objects that are new to us).
  • They are necessary for seeing how the theory can be applied. 
  • They are useful for coming up with proofs.

Many representations

Different mental representations of the same kind of object help you understand different aspects of the object.

Every important mathematical object has many representations and skilled mathematicians generally have several of them in mind at once.

New concepts and old ones

We especially depend on metaphors and images to understand a math concept that is new to us.  But if we work with it for awhile, finding lots of examples, and eventually proving theorems and providing counterexamples to conjectures, we begin to understand the concept in its own terms and the images and metaphors tend to fade away from our awareness…

Then, when someone asks us about this concept that we are now experts with, we trundle out our old images and metaphors – and are often surprised at how difficult and misleading our listener finds them!

Some mathematicians retreat from images and metaphors because of this and refuse to do more than state the definition and some theorems about the concept.   They are wrong to do this. That behavior encourages the attitude of many people that

  • mathematicians can’t explain things
  • math concepts are incomprehensible or bizarre
  • you have to have a mathematical mind to understand math

All three of these statements are half-truths. There is no doubt that a lot of abstract math is hard to understand, but understanding is certainly made easier with the use of images and metaphors

Send to Kindle

Proofs without dry bones

I have discussed images, metaphors and proofs in math in two ways:

(A) A mathematical proof

A monk starts at dawn at the bottom of a mountain and goes up a path to the top, arriving there at dusk. The next morning at dawn he begins to go down the path, arriving at dusk at the place he started from on the previous day. Prove that there is a time of day at which he is at the same place on the path on both days.

Proof: Envision both events occurring on the same day, with a monk starting at the top and another starting at the bottom at the same time and doing the same thing the monk did on different days. They are on the same path, so they must meet each other. The time at which they meet is the time required.

This example comes from Fauconnier, Mappings in Thought and Language, Cambridge Univ. Press, 1997. I discuss it in the Handbook, pages 46 and 153. See the Wikipedia article on conceptual blending.

(B) Rigor and rigor mortis

The following is quoted from a previous post here. See also the discussion in abstractmath.

When we are trying to understand or explain math, we may use various kinds of images and metaphors about the subject matter to construct a colorful and rich representation of the mathematical objects and processes involved. I described some of these briefly here. They can involve thinking of abstract things moving and changing and affecting each other.

When we set out to prove some math statement, we go into what I have called “rigorous mode”. We feel that we have to forget some of the color and excitement of the rich view. We must think of math objects as inert and static. They don’t move or change over time and they don’t interact with other objects or the real world. In other words, pretend that all math objects are dead.

We don’t always go all the way into this rigorous mode, but if we use an image or metaphor in a proof and someone challenges us about it, we may rewrite that part to get rid of the colorful representation and replace it by a calculation or line of reasoning that refers to the math objects as if they were inert and static – dead.

I didn’t contradict myself.
I want to clear up some tension between these two ideas.

The argument in (A) is a genuine mathematical proof, just as it is written. It contains hidden assumptions (enthymemes), but all math proofs contain hidden assumptions. My remarks in (B) do not mean that a proof is not a proof until everything goes dead, but that when challenged you have to abandon some of the colorful and kinetic reasoning to make sure you have it right. (This is a standard mathematical technique (note 1).)

One of the hidden assumptions in (A) is that two monks walking the opposite way on the path over the same interval of time will meet each other. This is based on our physical experience. If someone questions this we have several ways to get more rigorous. One many mathematicians might think of is to model the path as a curve in space and consider two different parametrizations by the unit interval that go in opposite directions. This model can then appeal to the intermediate value theorem to assert that there is a point where the two parametrizations give the same value.

I suppose that argument goes all the way to the dead. In the original argument the monk is moving. But the parametrized curve just sits there. The parametrizations are sets of ordered pairs in R x (R x R x R). Nothing is moving. All is dry bones. Ezekiel has not done his thing yet.

This technique works, I think, because it allows classical logic to be correct. It is not correct in everyday life when things are moving and changing and time is passing.

Avoid models; axiomatize directly
But it certainly is not necessary to rigorize this argument by using parametrizations involving the real numbers. You could instead look at the situation of the monk and make some axioms the events being described. For example, you could presumably make axioms on locations on the path that treat the locations as intervals rather than as points.

The idea is to make axioms that state properties that intervals have but doesn’t say they are intervals. For example that there is a relation “higher than” between locations that must be reflexive and transitive but not antisymmetric. I have not done this, but I would propose that you could do this without recreating the classical real numbers by the axioms. (You would presumably be creating the intuitionistic real numbers.)

Of course, we commonly fall into using the real numbers because methods of modeling using real numbers have been worked out in great detail. Why start from scratch?

About the heading on this section: There is a sense in which “axiomatizing directly” is a way of creating a model. Nevertheless there is a distinction between these two approaches, but I am to confused to say anything about this right now.

First order logic.
It is commonly held that if you rigorize a proof enough you could get it all the way down to a proof in first order logic. You could do this in the case of the proof in (A) but there is a genuine problem in doing this that people don’t pay enough attention to.

The point is you replace the path and the monks by mathematical models (a curve in space) and their actions by parametrizations. The resulting argument calls on well known theorems in real analysis and I have no doubt can be turned into a strict first order logic argument. But the resulting argument is no longer about the monk on the path.

The argument in (A) involves our understanding of a possibly real physical situation along with a metaphorical transference in time of the two walks (a transference that takes place in our brain using techniques (conceptual blending) the brain uses every minute of every day). Changing over to using a mathematical model might get something wrong. Even if the argument using parametrized curves doesn’t have any important flaws (and I don’t believe it does) it is still transferring the argument from one situation to another.

Conclusion:
Mathematical arguments are still mathematical arguments whether they refer to mathematical objects or not. A mathematical argument can be challenged and tested by uncovering hidden assumptions and making them explicit as well as by transferring the argument to a classical mathematical situation.

Note 1. Did you ever hear anyone talking about rigor requiring making images and metaphors dead? This is indeed a standard mathematical technique but it is almost always suppressed, or more likely unnoticed. But I am not claiming to be the first one to reveal it to the world. Some of the members of Bourbaki talked this way. (I have lost the reference to this.)

They certainly killed more metaphors than most mathematicians.

Note 2. This discussion about rigor and dead things is itself a metaphor, so it involves a metametaphor. Metaphors always have something misleading about them. Metametaphorical statements have the potential of being far worse. For example, the notion that mathematics contains some kind of absolute truth is the result of bad metametaphorical thinking.

Send to Kindle

Math and the Modules of the Mind

I have written (references below) about the way we seem to think about math objects using our mind’s mechanisms for thinking about physical objects. What I want to do in this post is to establish a vocabulary for talking about these ideas that is carefully enough defined that what I say presupposes as little as possible about how our mind behaves. (But it does presuppose some things.) This is roughly like Gregor Mendel’s formulation of the laws of inheritance, which gave precise descriptions of how characteristics were inherited while saying nothing at all about the mechanism.

I will use module as a name for the systems in the mind that perform various tasks.

Examples of modules

a) We have an “I’ve seen this before module” that I talked about here.

b) When we see a table, our mind has a module that recognizes it as a table, a module that notes that it is nearby, and in particular a module that notes that it is a physical object. The physical-object module is connected to many other modules, including for example expectations of what we would feel if we touched it, and in particular connections to our language-producing module that has us talk about it in a certain way (a table, the table, my table, and so on.)

c) We also have a module for abstract objects. Abstract objects are discussed in detail in the math objects chapter of abstractmath.org. A schedule is an abstract object, and so is the month of November. They are not mathematical objects because they affect people and change over time. (More about this here.) For example, the statement “it is now November” is true sometimes and false sometimes. Abstract objects are also not abstractions, like “beauty” and “love” which are not thought of as objects.

d) We talk about numbers in some ways like we talk about physical objects. We say “3 is a number”. We say “I am thinking of the only even prime”. But if we point and say, “Look, there is a 3”, we know that we have shifted ground and are talking about, not the number 3, but about a physical representation of the number 3. That’s because numbers trigger our abstract object module and our math object module, but not our physical object module. (Back and fill time: if you are not a mathematician, your mind may not have a math object module. People are not all the same.)

More about modules

My first choice for a name for these systems would have been object, as in object-oriented programming, but this discussion has too many things called objects already. Now let’s clear up some possible misconceptions:

e) I am talking about a module of the mind. My best guess would be that the mind is a function of the brain and its relationship with the world, but I am not presuppposing that. Whatever the mind is, it obviously has a system for recognizing that something is a physical object or a color or a thought or whatever. (Not all the modules are recognizers; some of them initiate actions or feelings.)

f) It seems likely that each module is a neuron together with its connections to other neurons, with some connections stronger than others (our concepts are fuzzy, not Boolean). But maybe a module is many neurons working together. Or maybe it is like a module in a computer program, that is instantiated anew each time it is called, so that a module does not have a fixed place in the brain. But it doesn’t matter. A module is whatever it is that carries out a particular function. Something has to carry out such functions.

Math objects

The modules in a mathematician’s mind that deal with math objects use some of the same machinery that the mind uses for physical objects.

g) You can do things to them. You can add two numbers. You can evaluate a function at an input. You can take the derivative of some functions.

h) You can discover properties of some kinds of math objects. (Every differentiable function is continuous.)

i) Names of some math objects are treated as proper nouns (such as “42”) and others as common nouns (such as “a prime”.)

I maintain that these phenomena are evidence that the systems in your mind for thinking about physical objects are sometimes useful for thinking about math objects.

Different ways of thinking about math objects.

j) You can construct a mathematical object that is new to you. You may feel that you invented it, that it didn’t exist before you created it. That’s your I just created this module acting. If you feel this way, you may think math is constantly evolving.

k) Many mathematicians feel that math objects are all already there. That’s a module that recognizes that math objects don't come into or go out of existence.

l) When you are trying to understand math objects you use all sorts of physical representations (graphs, diagrams) and mental representations (metaphors, images). You say things like, “This cubic curve goes up to positive infinity in the negative direction” and “This function vanishes at 2” and “Think of a Möbius strip as the unit square with two parallel sides identified in the reverse direction.”

m) When you are trying to prove something about math objects mathematicians generally think of math objects as eternal and inert (not affecting anything else). For example, you replace “the slope of the secant gets closer and closer to the slope of the tangent” by an epsilon-delta argument in which everything you talk about is treated as if it is unchanging and permanent. (See my discussion of the rigorous view.)

Consequences

When you have a feeling of déjà vu, it is because something has triggered your “I have seen this before” module (see (a)). It does not mean you have seen it before.

When you say “the number 3” is odd, that is a convenient way of talking about it (see (d) above), but it doesn’t mean that there is really only one number three.

If you say the function x^2 takes 3 to 9 it doesn’t have physical consequences like “Take me to the bank” might have. You are using your transport module but in a pretend way (you are using the pretend module!).

When you think you have constructed a new math object (see (j)), your mental modules leave you feeling that the object didn’t exist before. When you think you have discovered a new math object (see (k)), your modules leave you feeling that it did exist before. Neither of those feelings say anything about reality, and you can even have both feelings at the same time.

When you think about math objects as eternal and inert (see (m)) you are using your eternal and inert modules in a pretend way. This does not constitute an assertion that they are eternal and inert.

Is this philosophy?

My descriptions of how we think about math are testable claims about the behavior of our mind, expressed in terms of modules whose behavior I (partially) specify but whose nature I don’t specify. Just as Mendel’s Laws turned out to be explained by the real behavior of chromosomes under meiosis, the phenomena I describe may someday turn out to be explained by whatever instantiation the modules actually have – except for those phenomena that I have described wrongly, of course – that is what “testable” means!

So what I am doing is science, not philosophy, right?

Now my metaphor-producing module presents the familiar picture of philosophy and science as being adjacent countries, with science intermittently taking over pieces of philosophy’s territory…

Links to my other articles in this thread

Math objects in abstractmath.org
Mathematical objects are “out there”?
Neurons and math
A scientific view of mathematics (has many references to what other people have said about math objects)
Constructivism and Platonism

Send to Kindle