Shared mental objects

Notes on viewing

Shared mental objects

I propose the phrase "shared mental object" to name the sort of thing that includes mathematical objects, abstract objects, fictional objects and other concepts with the following properties: ​

  • They are not physical objects
  • We think of them as objects 
  • We share them with other people

It is the name "shared mental object" that is a new idea; the concept has been around in philosophy and math ed for awhile and has been called various things, especially "abstract object", which is the name I have used in abstractmath.

I will go into detail concerning some examples in order to make the concept clear.  If you examine this concept deeply you discover many fine points, nested ideas and circles of examples that go back on themselves.  I will not get very far into these fine points here, but I have written about some of them posts and in abmath (see references).  I am working on a post about some of the fine points and will publish it if I can control its tendency to expand into infinite proliferation and recursion.

Examples

 

Messages

There is a story about the early days of telegraphy:  A man comes into the newly-opened telegraph station and asks to send a telegram to his son who is working in another city. He writes out the message and gives it to the operator with his payment.  The operator puts the message on a spike and clicks the key in front of him for a while, then says, "I have sent your message.  Thanks for shopping at Postal Telegraph".  The man looks astonished and points at the message and says, "But it is still here!"

A message is a shared mental object.  

  • It may be represented by a physical object, such as a piece of paper with writing on it, and people commonly refer to the paper as the message.  
  • It may be a verbal message from you, perhaps delivered by another person to a third person by speech.  
  • The delivery process may introduce errors (so can sending a telegraph).  So the thoughts in the three brains (the sender, the deliverer and the recipient) can differ from each other, but they can still talk about "the message" as if it were one object.

Other examples that are similar in nature to messages are schedules and the month of September (see Math Objects in abmath, where they are called abstract objects.).  In English-speaking communities, September is a cultural default: you are expected to know what it is. You can know that September is a month and that right this minute it is not September (unless it is September). You may think that September has 31 days and most people would say you are wrong, but they would agree that you and they are talking about the same month.

The general concept of the month of September and facts concerning it have been in shared existence in English-speaking cultural groups for (maybe) a thousand years.  In contrast, a message is usually shared by only two or three people and it has a short life; a few years from now, it may be that none of the people involved with the message remember what it said or even that it existed.

Symbols

symbol, such as the letter "a" or the integral sign "$\int$", is a shared mental object.  Like the month of September, but unlike messages, letters are shared by large cultural entities, every language community that uses the Latin alphabet (and more) in the case of "a", and math and tech people in the case of "$\int$". 

The letter "a" is represented physically on paper, a blackboard or a screen, among other things.  If you are literate in English and recognize an occurrence as representing the letter, you probably do this using a process in the brain that is automatic and that operates outside your awareness

Literate readers of English also generally agree that a string of letters either does or does not represent the word "default" but there are borderline cases (as in those little boxes where you have to prove you are not a robot) where they may disagree or admit that they don't know.  Even so, the letter "a" and the word "default" are shared in the minds of many people and there is general (but not absolutely universal) agreement on when you are seeing representations of them.

Fictional objects

Fictional objects such as Sherlock Holmes and unicorns are shared mental objects.  I wrote briefly about them in Mathematical objects and will not go into them here.  

Mathematical objects 

The integer $111$, the integral $\int_0^1 x^2\,dx$ and the set of all real numbers are all mathematical objects.   They are all shared mental objects.  In most of the world, people with a little education will know that $111$ is a number and what it means to have $111$ beans in a jar (for example).  They know that it is one more that $110$ and a lot more than $42$.  

Mathematicians, scientists and STEM students will know something about what  $\int_0^1 x^2\,dx$ means and they will probably know how to calculate it.  Most  of them may be able to do it in their head.  I have taught calculus so many times that I know it "by heart", which means that it is associated in my brain with the number $1/3$ in such a way that when I see the integral the number automatically and without effort pops us (in the same way that I know September has 30 days).

Beginning calculus students may have a confused and incorrect understanding of the set of all real numbers in several ways, but practicing mathematicians (and many others) know that it is an uncountably infinite dense set and they think of it as an object.  A student very likely does not think of it as an object, but as a sprawling unimaginable space that you cannot possibly regard as a thing. Students may picture a real number as having another real number sitting right beside it — the next biggest one. Most practicing mathematicians think of the set of real numbers as a completed infinity – every real number is already there —  and they know that between any two of them there is another one.

As a consequence, when students and professors talk about real numbers the student finds that some times the professor says things that sound completely wrong and the professor hears the student say things that are bizarre and confused.  They firmly believe they are talking about the same thing, the real numbers, but the student is seen by the professor as wrong and the professor is seen by the students as talking meaningless nonsense.  Even so, they believe they are talking about the same thing.

Nomenclature

I tried various other names before I came to "shared mental objects".

  • I called them abstract objects in abstractmath.  The word "abstract" does not convey their actual character — they are mental and they are shared.
  • They are non-physical objects, a phrase widely used in philosophy, but naming something by a negation is always a bad idea.  
  • Co-mental objects is ugly and comental looks like a misspelling.
  • Intermental objects sounds like it has something to do with burial.  Maybe InterMental?
  • The word entity may avoid some confusion caused by the word "object", which suggests physical object.  But "object" is widely used in philosophy and in math ed in the way it is used here.
  • Meme?  Well, in some sense a shared mental object is a meme.  Memes have a connotation of forcing themselves into your brain that I don't want, but I want to consider the relationship further.

The major advantage of "shared mental object" is that it describes the important properties of the concept: It is a mental object and it is shared by people.  It has no philosophical implications concerning platonism, either. Mathematical objects do have special properties of verifiability that general shared mental objects do not, but my terminology does not suggest any existence of absolute truth or of an Ideal existing in another world.  I don't believe in such things, but some people do and I want to point out that "shared mental object" does not rule such things out — it merely gives a direct evidence-based description of a phenomenon that actually exists in the real world.

References  

Abstract objects in the Stanford Encyclopedia of Philosophy

Abstract object in Wikipedia

Mathematical objects in abstractmath

Mathematical objects in Wikipedia

What is Mathematics, Really?  R. Hersh, Oxford University Press, 1997

Previous posts

Representations of mathematical objects 

Representations III: Rigor and Rigor Mortis

Representations II: Dry Bones

Notes on Viewing  

This post uses MathJax. If you see mathematical expressions with dollar signs around them, or badly formatted formulas, try refreshing the screen. Sometimes you have to do it two or three times.

Representations of mathematical objects

This is a long post. Notes on viewing.

About this post

A mathematical object, or a type of math object, is represented in practice in a great variety of ways, including some that mathematicians rarely think of as "representations".  

In this post you will find examples and comments about many different types of representations as well as references to the literature. I am not aware that anyone has considered all these different ideas of representation in one place before. Reading through this post should raise your consciousness about what is going on when you do math.  

This is also an experiment in exposition.  The examples are discussed in a style similar to the way a Mathematica command is discussed in the Documentation Center, using mostly nonhierarchical bulleted lists. I find it easy to discover what I want to know when it is written in that way.  (What is hard is discovering the name of a command that will do what I want.)

Types of representations

Using language

  • Language can be used to define a type of object.
  • A definition is intended to be precise enough to determine all the properties that objects of that type all have.  (Pay attention to the two uses of the word "all" in that sentence; they are both significant, in very different ways.)
  • Language can be used to describe an object, exhibiting properties without determining all properties.
  • It can also provide metaphors, making use of one of the basic tools of our brain to understand the world. 
  • The language used is most commonly mathematical English, a special dialect of English.
  • The symbolic language of mathematics (distinct from mathematical English) is used widely in calculations. Phrases from the symbolic language are often embedded in a statement in math English. The symbolic language includes among others algebraic notation and logical notation. 
  • The language may also be a formal language, a language that is mathematically defined and is thus itself a mathematical object. Logic texts generally present the first order predicate calculus as a formal language. 
  • Neither mathematical English nor the symbolic language is a formal language. Both allow irregularities and ambiguities.

Mathematical objects

The representation itself may be a mathematical object, such as:

  • A linear representation of a group. Not only are the groups mathematical objects, so is the representation.
  • An embedding of a manifold into Euclidean space. A definition given in a formal language of the first order predicate calculus of the property of commutativity of binary operations. (Thus a property can be represented as a math object.)

Visual representations

A math object can be represented visually using a physical object such as a picture, graph (in several senses), or diagram.  

  • The visual processing of our brain is our major source of knowledge of the world and takes about a fifth of the brain's processing power.  We can learn many things using our vision that would take much longer to learn using verbal descriptions.  (Proofs are a different matter.)
  • When you look at a graph (for example) your brain creates a mental representation of the graph (see below).

Mental representations

If you are a mathematician, a math object such as "$42$", "the real numbers" or "continuity" has a mental representation in your brain.  

  • In the math ed literature, such a representation is called "mental image", "concept image", "procept", or "schema".   (The word "image" in these names is not thought of as necessarily visual.) 
  • The procept or schema describe all the things that come to mind when you think about a particular math object: The definition, important theorems, visual images, important examples, and various metaphors that help you understand it. 
  • The visual images occuring in a mental schema for an object may themselves be mental representations of physical objects. The examples and theorems may be mental representations of ideas you learned from language or pictures, and so on.  The relationships between different kinds of representations get quite convoluted.

Metaphors

Conceptual metaphors are a particular kind of mental representation of an object which involve mentally associating some aspects of the objects with some aspects of something else — a physical object, an image, an action or another abstract object.

  • A conceptual metaphor may give you new insight into the object.
  • It may also mislead you because you think of properties of the other object that the math object doesn't have.
  • A graph of a function is a conceptual metaphor.
  • When you say that a point on a graph "rises as it goes from left to right" your metaphor is an action. 
  • When you say that the cosets of a normal subgroup of a group "get along" with the group multiplication, your metaphor identifies a property they have with an aspect of human behavior.

Properties of representations

A representation of a math object may or may not

  • determine it completely
  • exhibit some of its properties
  • suggest easy proofs of some theorems
  • provide a useful way of thinking about it
  • mislead you about the object's properties
  • mislead you about what is significant about the object

Examples of representations

This list shows many of the possibilities of representation.  In each case I discuss the example in terms of the two bulleted lists above. Some of the examples are reused from my previous publications.

Functions

Example (F1) "Let $f(x)$ be the function defined by $f(x)=x^3-x$."

  • This is an expression in mathematical English that a fluent reader of mathematical English will recognize gives a definition of a specific function.
  • (F1) is therefore a representation of that function.  
  • The word "representation" is not usually used in this way in math.  My intention is that it should be recognized as the same kind of object as many other representations.
  • The expression contains the formula $x^3-x$.  This is an encapsulated computation in the symbolic language of math. It allows someone who knows basic algebra and calculus to perform calculations that find the roots, extrema and inflection points of the function $f$.  
  • The word "let" suggests to the fluent reader of mathematical English that (F1) is a definition which is probably going to hold for the next chunk of text, but probably not for the whole article or book.
  • Statements in mathematical English are generally subject to conventions.  In a calculus text (F1) would automatically mean that the function had the real numbers as domain and codomain.
  • The last two remarks show that a beginner has to learn to read mathematical English. 
  • Another convention is discussed in the following diatribe.

Diatribe 

You would expect $f(x)$ by itself to mean the value of $f$ at $x$, but in (F1) the $x$ has the property of a bound variable.  In mathematical English, "let" binds variables. However, after the definition, in the text the "$x$" in the expression "$f(x)$" will be free, but the $f$ will be bound to the specific meaning.  It is reasonable to say that the term "$f(x)$" represents the expression "$x^3-x$" and that $f$ is the (temporary) name of the function. Nevertheless, it is very common to say "the function $f(x)$" to mean $f$.  

A fluent reader of mathematical English knows all this, but probably no one has ever said it explicitly to them.  Mathematical English and the symbolic language should be taught explicitly, including its peculiarities such as "the function $f(x)$".  (You may want to deprecate this usage when you teach it, but students deserve to understand its meaning.)

The positive integers

You have a mental representation of the positive integers $1,2,3,\ldots$.  In this discussion I will assume that "you" know a certain amount of math.  Non-mathematicians may have very different mental representations of the integers.

  • You have a concept of "an integer" in some operational way as an abstract object.
  • "Abstract object" needs a post of its own. Meanwhile see Mathematical Objects (abstractmath) and the Wikipedia articles on Mathematical objects and Abstract objects.
  • You have a connection in your brain between the concept of integer and the concept of listing things in order, numbering them by $1,2,3,\ldots$.
  • You have a connection in your brain between the concept of an integer and the concept of counting a finite number of objects.  But then you need zero!
  • You understand how to represent an integer using the decimal representation, and perhaps representations to other bases as well. 
  • Your mental image has the integer "$42"$ connected to but not the same as the decimal representation "42". This is not true of many students.
  • The decimal rep has a picture of the string "42" associated to it, and of course the picture of the string may come up when you think of the integer $42$ as well (it does for me — it is a an icon for the number $42$.)
  • You have a concept of the set of integers. 
  • Students need to be told that by convention "the set of integers" means the set of all integers.  This particularly applies to students whose native language does not have articles, but American students have trouble with this, too.
  • Your concept of  "the set of integers" may have the icon "$\mathbb{N}$" associated with it.  If you are a mathematician, the icon and the concept of the set of integers are associated with each other but not identified with each other.
  • For me, at least, the concept "set of integers" is mentally connected to each integer by the "element of" relation. (See third bullet below.)
  • You have a mental representation of the fact that the set of integers is infinite.  
  • This does not mean that your brain contains an infinite number of objects, but that you have a representation of infinity as a concept, it is brain-connected to the concept of the set of integers, and also perhaps to a proof of the fact that $\mathbb{N}$ is infinite.
  • In particular, the idea that the set of integers is mentally connected to each integer does not mean that the whole infinite number of integers is attached in your brain to the concept of the set of integers.  Rather, the idea is a predicate in your brain.  When it is connected to "$42$", it says "yes".  To "$\pi$" it says "No".
  • Philosophers worry about the concept of completed infinity.  It exists as a concept in your brain that interacts as a meme with concepts in other mathematicians' brains. In that way, and in that way only (as far as I am concerned) it is a physical object, in particular an object that exists in scattered physical form in a social network.

Graph of a function

This is a graph of the function $y=x^3-x$:

Graph of a cubic function

  • The graph is a physical object, either on a screen or on paper
  • It is processed by your visual system, the most powerful sensory management system in your brain
  • It also represents the graph in the mathematical sense (set of ordered pairs) of the function $y=x^3-x$
  • Both the mathematical graph and the physical graph are represented by modules in your brain, which associates the two of them with each other by a conceptual metaphor
  • The graph shows some properties of the function: inflection point, going off to infinity in a specific way, and so on.
  • These properties are made apparent (if you are knowledgeable) by means of the powerful pattern recognition system in your brain. You see them much more quickly than you can discover them by calculation.
  • These properties are not proved by the graph. Nevertheless, the graph communicates information: for example, it suggests that you can prove that there is an inflection point near $(0,0)$.
  • The graph does not determine or define the function: It is inaccurate and it does not (cannot) show all of the graph.
  • More subtle details about this graph are discussed in my post Representations 2.

Continuity

Example (C1) The $\epsilon-\delta$ definition of the continuity of a function $f:\mathbb{R}\to\mathbb{R}$ may be given in the symbolic language of math:

A function $f$ is continuous at a number $c$ if \[\forall\epsilon(\epsilon\gt0\implies(\forall x(\exists\delta(|x-c|\lt\delta\implies|f(x)-f(c)|\lt\epsilon)))\]

  • To understand (C1), you must be familiar with the notation of first order logic.  For most students, getting the notation right is quite a bit of work.  
  • You must also understand  the concepts, rules and semantics of first order logic.  
  • Even if you are familiar with all that, continuity is still a difficult concept to understand.
  • This statement does show that the concept is logically complicated. I don't see how it gives any other intuition about the concept. 

Example (C2) The definition of continuity can also be represented in mathematical English like this:

A function $f$ is continuous at a number $c$ if for any $\epsilon\gt0$ and for any $x$ there is a $\delta$ such that if $|x-c|\lt\delta$, then $|f(x)-f(c)|\lt\epsilon$. 

  • This definition doesn't give any more intuition that (C1) does.
  • It is easier to read that (C1) for most math students, but it still requires intimate familiarity with the quirks of math English.
  • The fact that "continuous" is in boldface signals that this is a definition.  This is a convention.
  • The phrase "For any $\epsilon\gt0$" contains an unmarked parenthetic insertion that makes it grammatically incoherent.  It could be translated as: "For any $\epsilon$ that is greater than $0$".  Most math majors eventually understand such things subconsciously.  This usage is very common.
  • Unless it is explicitly pointed out, most students won't notice that  if you change the phrase "for any $x$ there is a $\delta$"  to "there is a $\delta$ for any $x$" the result means something quite different.  Cauchy never caught onto this.
  • In both (C1) and (C2), the "if" in the phrase "A function $f$ is continuous at a number $c$ if…" means "if and only if" because it is in a definition.  Students rarely see this pointed out explicitly.  

Example (C3) The definition of continuity can be given in a formally defined first order logical theory

  • The theory would have to contain function symbols and axioms expressing the algebra of real numbers as an ordered field. 
  • I don't know that such a definition has ever been given, but there are various semi-automated and automated theorem-proving systems (which I know little about) that might be able to state such a definition.  I would appreciate information about this.
  • Such a definition would make the property of continuity a mathematical object.
  • An automated theorem-proving system might be able to prove that $x^3-x$ is continuous, but I wonder if the resulting proof would aid your intuition much.

Example (C4) A function from one topological space to another is continuous if the inverse of every open set in the codomain is an open set in the domain.

  • This definition is stated in mathematical English.
  • All definitions start with primitive data. 
  • In definitions (C1) – (C3), the primitive data are real numbers and the statement uses properties of an ordered field.
  • In (C4), the data are real numbers and the arithmetic operations of a topological field, along with the open sets of the field. The ordering is not mentioned.
  • This shows that a definition need not mention some important aspects of the structure. 
  • One marvelous example of this is that  a partition of a set and an equivalence relation on a set are based on essentially disjoint sets of data, but they define exactly the same type of structure.

Example (C4) "The graph of a continuous function can be drawn without picking up the chalk".

  • This is a metaphor that associates an action with the graph.
  • It is incorrect: The graphs of some continuous functions cannot be drawn.  For example, the function $x\mapsto x^2\sin(1/x)$ is continuous on the interval $[-1,1]$ but cannot be drawn at $x=0$. 
  • Generally speaking, if the function can be drawn then it can be drawn without picking up the chalk, so the metaphor provides a useful insight, and it provides an entry into consciousness-raising examples like the one in the preceding bullet.

References

  1. 1.000… and .999… (post)
  2. Conceptual blending (post)
  3. Conceptual blending (Wikipedia)
  4. Conceptual metaphors (Wikipedia)
  5. Convention (abstractmath)
  6. Definitions (abstractmath)
  7. Embodied cognition (Wikipedia)
  8. Handbook of mathematical discourse (see articles on conceptual blendmental representationrepresentationmetaphor, parenthetic assertion)
  9. Images and Metaphors (abstractmath).
  10. The interplay of text, symbols and graphics in math education, Lin Hammill
  11. Math and the modules of the mind (post)
  12. Mathematical discourse: Language, symbolism and visual images, K. L. O’Halloran.
  13. Mathematical objects (abmath)
  14. Mathematical objects (Wikipedia)
  15. Mathematical objects are “out there?” (post)
  16. Metaphors in computing science ​(post)
  17. Procept (Wikipedia)
  18. Representations 2 (post)     
  19. Representations and models (abstractmath)
  20. Representations II: dry bones (post)
  21. Representation theorems (Wikipedia) Concrete representations of abstractly defined objects.
  22. Representation theory (Wikipedia) Linear representations of algebraic structures.
  23. Semiotics, symbols and mathematical visualization, Norma Presmeg, 2006.
  24. The transition to formal thinking in mathematics, David Tall, 2010
  25. Theory in mathematical logic (Wikipedia)
  26. What is the object of the encapsulation of a process? Tall et al., 2000.
  27. Where mathematics comes from, by George Lakoff and Rafael Núñez, Basic Books, 2000. 
  28. Where mathematics comes from (Wikipedia) This is a review of the preceding book.  It is a permanent link to the version of 04:23, 25 October 2012.  The review is opinionated, partly wrong, not well written and does not fit the requirements of a Wikipedia entry.  I recommend it anyway; it is well worth reading.  It contains links to three other reviews.

Notes on Viewing  

This post uses MathJax. If you see mathematical expressions with dollar signs around them, or badly formatted formulas, try refreshing the screen. Sometimes you have to do it two or three times.

Representing and thinking about sets

Notes on viewing

Representations of sets

Sets are represented in the math literature in several different ways, some mentioned here.  Also mentioned are some other possibilities.  Introducing a variety of representations of any type of math object is desirable because students tend to assume that the representation is the object.

Curly bracket notation

The standard representation for a finite set is of the form "$\{1,3,5,6\}$". This particular example represents the unique set containing the integers $1$, $3$, $5$ and $6$ and nothing else. This means precisely that the statement "$n$ is an element of $S$" is true if $n=1$, $n=3$, $n=5$ or $n=6$, and it is false if $n$ represents any other mathematical object. 

In the way the notation is usually used, "$\{1,3,5,6\}$", "$\{3,1,5,6\}$", "$\{1,5,3,6\}$",  "$\{1,6,3,5,1\}$" and $\{ 6,6,3,5,1,5\}$ all represent the same set. Textbooks sometimes say "order and repetition don't matter". But that is a statement about this particular representation style for sets. It is not a statement about sets.

It would be nice to come up with a representation for sets that doesn't involve an ordering. Traditional algebraic notation is essentially one-dimensional and so automatically imposes an ordering (see Algebra is a difficult foreign language).    

Let the elements move

In Visible Algebra II, I experimented with the idea of putting the elements at random inside a circle and letting them visibly move around like goldfish in a bowl.  (That experiment was actually for multisets but it applies to sets, too.)  This is certainly a representation that does not impose an ordering, but it is also distracting.  Our visual system is attracted to movement (but not as much as a cat's visual system).  

Enforce natural ordering

One possibility would be to extend the machinery in a visible algebra system that allows you to make a box you could drag elements into. 

This box would order the elements in some canonical order (numerical order for numbers, alphabetical order for strings of letters or words) with the property that if you inserted an element in the wrong place it would rearrange itself, and if you tried to insert an element more than once the representation would not change.  What you would then have is a unique representation of the set.

An example is the device below.  (If you have Mathematica, not just CDF player, you can type in numbers as you wish instead of having to use the buttons.) 

This does not allow a representation of a heterogenous set such as $\{3,\mathbb{R},\emptyset,\left(\begin{array}{cc}1&2\\0&1\\ \end{array}\right)\}$.  So what?  You can't represent every function by a graph, either.

Hanger notation

The tree notation used in my visual algebra posts could be used for sets as well, as illustrated below. The system allows you to drag the elements listed into different positions, including all around the set node. If you had a node for lists, that would not be possible.

This representation has the pedagogical advantage of shows that a set is not its elements.

  • A set is distinct from its elements
  • A set is completely determined by what the elements are.

Pattern recognition

Infinite sets are sometimes represented using the curly bracket notation using a pattern that defines the set.  For example, the set of even integers could be represented by $\{0,2,4,6,\ldots\}$.  Such a representation is necessarily a convention, since any beginning pattern can in fact represent an infinite number of different infinite sets.  Personally, I would write, "Consider the even integers $\{0,2,4,6,\ldots\}$", but I would not write,  "Consider the set $\{0,2,4,6,\ldots\}$".

By the way, if you are writing for newbies, you should say,"Consider the set of even integers $\{0,2,4,6,\ldots\}$". The sentence "Consider the even integers $\{0,2,4,6,\ldots\}$" is unambiguous because by convention a list of numbers in curly brackets defines a set. But newbies need lots of redundancy.

Representation by a sentence

Setbuilder notation is exemplified by $\{x|x>0\}$, which denotes the positive reals, given a convention or explicit statement that $x$ represents a real number.  This allows the representation of some infinite sets without depending on a possibly ambiguous pattern. 

A Visible Algebra system needs to allow this, too. That could be (necessarily incompletely) done in this way:

  • You type in a sentence into a Setbuilder box that defines the set.
  • You then attach a box to the Setbuilder box containing a possible element.
  • The system then answers Yes, No, or Can't Tell.

The Can't Tell answer is a necessary requirement because the general question of whether an element is in a set defined by a first order sentence is undecidable. Perhaps the system could add some choices:

  • Try for a second.
  • Try for an hour.
  • Try for a year.
  • Try for the age of the universe.

Even so, I'll bet a system using Mathematica could answer many questions like this for sentences referring to a specific polynomial, using the Solve or NSolve command.  For example, the answer to the question, "Is $3\in\{n|n\lt0 \text{ and } n^2=9\}$?" (where $n$ ranges over the integers) would be "No", and the answer to  "Is $\{n|n\lt0 \text{ and } n^2=9\}$ empty?" would also be "No". [Corrected 2012.10.24]

References

  1. Explaining “higher” math to beginners (previous post)
  2. Algebra is a difficult foreign language (previous post)
  3. Visible Algebra II (previous post)
  4. Sets: Notation (abstractmath article)
  5. Setbuilder notation (Wikipedia)

Notes on Viewing  

  • This post uses MathJax. If you see mathematical expressions with dollar signs around them, or badly formatted formulas, try refreshing the screen. Sometimes you have to do it two or three times.
  • To manipulate the demos in this post, you must have Wolfram CDF Player installed on your computer. It is available free from the Wolfram website. The code for the demos is in the Mathematica notebook Representing sets.nb.  

Algebra is a difficult foreign language

Note: This post uses MathJax.  If you see mathematical formulas with dollar signs around them, or badly formatted formulas, try refreshing the screen. Sometimes you have to do it two or three times.

Algebra

In a previous post, I said that the symbolic language of mathematics is difficult to learn and that we don't teach it well. (The symbolic language includes as a subset the notation used in high school algebra, precalculus, and calculus.) I gave some examples in that post but now I want to go into more detail.  This discussion is an incomplete sketch of some aspects of the syntax of the symbolic language.  I will write one or more posts about the semantics later.

The languages of math

First, let's distinguish between mathematical English and the symbolic language of math. 

  • Mathematical English is a special register or jargon of English. It has not only its special vocabulary, like any jargon, but also used ordinary English words such as "If…then", "definition" and "let" in special ways. 
  • The symbolic language of math is a distinct, special-purpose written language which is not a dialect of the English language and can in fact be read by mathematicians with little knowledge of English.
    • It has its own symbols and rules that are quite different from spoken languages. 
    • Simple expressions can be pronounced, but complicated expressions may only be pointed to or referred to.
  • A mathematical article or book is typically written using mathematical English interspersed with expressions in the symbolic language of math.

Symbolic expressions

A symbolic noun (logicians call it a term) is an expression in the symbolic language that names a number or other mathematical object, and may carry other information as well.

  • "3" is a noun denoting the number 3.
  • "$\text{Sym}_3$" is a noun denoting the symmetric group of order 3.
  • "$2+1$" is a noun denoting the number 3.  But it contains more information than that: it describes a way of calculating 3 as a sum.
  • "$\sin^2\frac{\pi}{4}$" is a noun denoting the number $\frac{1}{2}$, and it also describes a computation that yields the number $\frac{1}{2}$.  If you understand the symbolic language and know that $\sin$ is a numerical function, you can recognize "$\sin^2\frac{\pi}{4}$" as a symbolic noun representing a number even if you don't know how to calculate it.
  • "$2+1$" and "$\sin^2\frac{\pi}{4}$" are said to be encapsulated computations.
    • The word "encapsulated" refers to the fact that to understand what the expressions mean, you must think of the computation not as a process but as an object.
    • Note that a computer program is also an object, not a process.
  • "$a+1$" and "$\sin^2\frac{\pi x}{4}$" are encapsulated computations containing variables that represent numbers. In these cases you can calculate the value of these computations if you give values to the variables.  

symbolic statement is a symbolic expression that represents a statement that is either true or false or free, meaning that it contains variables and is true or false depending on the values assigned to the variables.

  • $\pi\gt0$ is a symbolic assertion that is true.
  • $\pi\lt0$ is a symbolic assertion that it is false.  The fact that it is false does not stop it from being a symbolic assertion.
  • $x^2-5x+4\gt0$ is an assertion that is true for $x=5$ and false for $x=1$.
  • $x^2-5x+4=0$ is an assertion that is true for $x=1$ and $x=4$ and false for all other numbers $x$.
  • $x^2+2x+1=(x+1)^2$ is an assertion that is true for all numbers $x$. 

Properties of the symbolic language

The constituents of a symbolic expression are symbols for numbers, variables and other mathematical objects. In a particular expression, the symbols are arranged according to conventions that must be understood by the reader. These conventions form the syntax or grammar of symbolic expressions. 

The symbolic language has been invented piecemeal by mathematicians over the past several centuries. It is thus a natural language and like all natural languages it has irregularities and often results in ambiguous expressions. It is therefore difficult to learn and requires much practice to learn to use it well. Students learn the grammar in school and are often expected to understand it by osmosis instead of by being taught specifically.  However, it is not as difficult to learn well as a foreign language is.

In the basic symbolic language, expressions are written as strings of symbols.

  • The symbolic language gives (sometimes ambiguous) meaning to symbols placed above or below the line of symbols, so the strings are in some sense more than one dimensional but less than two-dimensional.
  • Integral notation, limit notation, and others, are two-dimensional enough to have two or three levels of symbols. 
  • Matrices are fully two-dimensional symbols, and so are commutative diagrams.
  • I will not consider graphs (in both senses) and geometric drawings in this post because I am not sure what I want to write about them.

Syntax of the language

One of the basic methods of the symbolic language is the use of constructors.  These can usually be analyzed as functions or operators, but I am thinking of "constructor" as a linguistic device for producing an expression denoting a mathematical object or assertion. Ordinary languages have constructors, too; for example "-ness" makes a noun out of a verb ("good" to "goodness") and "and" forms a grouping ("men and women").

Special symbols

The language uses special symbols both as names of specific objects and as constructors.

  • The digits "0", "1", "2" are named by special symbols.  So are some other objects: "$\emptyset$", "$\infty$".
  • Certain verbs are represented by special symbols: "$=$", "$\lt$", "$\in$", "$\subseteq$".
  • Some constructors are infixes: "$2+3$" denotes the sum of 2 and 3 and "$2-3$" denotes the difference between them.
  • Others are placed before, after, above or even below the name of an object.  Examples: $a'$, which can mean the derivative of $a$ or the name of another variable; $n!$ denotes $n$ factorial; $a^\star$ is the dual of $a$ in some contexts; $\vec{v}$ constructs a vector whose name is "$v$".
  • Letters from other alphabets may be used as names of objects, either defined in the context of a particular article, or with more nearly global meaning such as "$\pi$" (but "$\pi$" can denote a projection, too).

This is a lot of stuff for students to learn. Each symbol has its own rules of use (where you put it, which sort of expression you may it with, etc.)  And the meaning is often determined by context. For example $\pi x$ usually means $\pi$ multiplied by $x$, but in some books it can mean the function $\pi$ evaluated at $x$. (But this is a remark about semantics — more in another post.)

"Systematic" notation

  • The form "$f(x)$" is systematically used to denote the value of a function $f$ at the input $x$.  But this usage has variations that confuse beginning students:
    • "$\sin\,x$" is more common than "$\sin(x)$".
    • When the function has just been named as a letter, "$f(x)$" is more common that "$fx$" but many authors do use the latter.
  • Raising a symbol after another symbol commonly denotes exponentiation: "$x^2$" denotes $x$ times $x$.  But it is used in a different meaning in the case of tensors (and elsewhere).
  • Lowering a symbol after another symbol, as in "$x_i$"  may denote an item in a sequence.  But "$f_x$" is more likely to denote a partial derivative.
  • The integral notation is quite complicated.  The expression \[\int_a^b f(x)\,dx\] has three parameters, $a$, $b$ and $f$, and a bound variable $x$ that specifies the variable used in the formula for $f$.  Students gradually learn the significance of these facts as they work with integrals. 

Variables

Variables have deep problems concerned with their meaning (semantics). But substitution for variables causes syntactic problems that students have difficulty with as well.

  • Substituting $4$ for $x$ in the expression $3+x$ results in $3+4$. 
  • Substituting $4$ for $x$ in the expression $3x$ results in $12$, not $34$. 
  • Substituting "$y+z$" in the expression $3x$ results in $3(y+z)$, not $3y+z$.  Some of my calculus students in preforming this substitution would write $3\,\,y+z$, using a space to separate.  The rules don't allow that, but I think it is a perfectly natural mistake. 

Using expressions and writing about them

  • If I write "If $x$ is an odd integer, then $3+x$ is odd", then I am using $3+x$ in a sentence. It is a noun denoting an unspecified number which can be constructed in a specified way.
  • When I mention substituting $4$ for $x$ in "$3+x$", I am talking about the expression $3+x$.  I am not writing about a number, I am writing about a string of symbols.  This distinction causes students major difficulties and teacher hardly ever talk about it.
  • In the section on variables, I wrote "the expression $3+x$", which shows more explicitly that I am talking about it as an expression.
    • Note that quotes in novels don't mean you are talking about the expression inside the quotes, it means you are describing the act of a person saying something.
  • It is very common to write something like, "If I substitute $4$ for $x$ in $3x$ I get $3 \times 4=12$".  This is called a parenthetic assertion, and it is literally nonsense (it says I get an equation).
  • If I pronounce the sentence "We know that $x\gt0$" we pronounce "$x\gt0$" as "$x$ is greater than zero",  If I pronounce the sentence "For any $x\gt0$ there is $y\gt0$ for which $x\gt y$", then I pronounce the expression "$x\gt0$" as "$x$ greater than zero$",  This is an example of context-sensitive pronunciation
  • There is a lot more about parenthetic assertions and context-sensitive pronunciation in More about the languages of math.

Conclusion

I have described some aspects of the syntax of the symbolic language of math. Learning that syntax is difficult and requires a lot of practice. Students who manage to learn the syntax and semantics can go on to learn further math, but students who don't are forever blocked from many rewarding careers. I heard someone say at the MathFest in Madison that about 25% of all high school students never really understand algebra.  I have only taught college students, but some students (maybe 5%) who get into freshman calculus in college are weak enough in algebra that they cannot continue. 

I am not proposing that all aspects of the syntax (or semantics) be taught explicitly.  A lot must be learned by doing algebra, where they pick up the syntax subconsciously just as they pick up lots of other behavior-information in and out of school. But teachers should explicitly understand the structure of algebra at least in some basic way so that they can be aware of the source of many of the students' problems. 

It is likely that the widespread use of computers will allow some parts of the symbolic language of math to be replaced by other methods such as using Excel or some visual manipulation of operations as suggested in my post Mathematical and linguistic ability.  It is also likely that the symbolic language will gradually be improved to get rid of ambiguities and irregularities.  But a deliberate top-down effort to simplify notation will not succeed. Such things rarely succeed.

References

 

 

Abstract objects

Some thoughts toward revising my article on mathematical objects.  

Mathematical objects are a kind of abstract object.  There are lots of abstract objects that are not mathematical objects,  For example, if you keep a calendar or schedule for appointments, that calendar is an abstract object.  (This example comes from [2]). 

It may be represented as a physical object or you may keep it entirely in your head.  I am not going to talk about the latter possibility, because I don't know what to say.

  1. If it is a paper calendar, that physical object represents the information that is contained in your calendar.  
  2. Same for a calendar on a computer, but that is stored as magnetic bits on a disk or in flash memory. A computer program (part of the operating system) is required to present it on the screen in such a way that you can read it.  Each time you open it, you get a new physical representation of the calendar.

Your brain contains a module (see [5], [7]) that interprets the representation in (1) or (2) and which has connections with other modules in your brain for dates, times, locations and whether the appointment is for a committee, a medical exam, or whatever.  

The calendar-interpreter module in your brain is necessary for the physical object to be a calendar.  The physical object is not in itself your calendar.  The calendar in this sense does not exist in the physical world.  It is abstract.  Since we think of it as a thing, it is an abstract object.

The abstract object "my calendar" affects the physical world (it causes you to go to the dentist next Tuesday).  The relation of the abstract object to the physical world is mediated by whatever physical object you call your calendar along with the modules in the brain that relate to it.  The modules in the brain are actions by physical objects, so this point of view does not involve Cartesian style dualism.

Note:  A module is a meme.  Are all memes modules?  This needs to be investigated.  Whatever they are, they exist as physical objects in people's brains.

Mathematical objects

A rigorous proof of a theorem about a mathematical object tends to refer to the object as if it were absolutely static and did not affect anything in the physical world.  I talked about this in [10], where I called it the dry bones representation of a mathematical object.  Mathematical objects don't have to be thought of this way, but (I suggest) what makes them mathematical objects is that they can be thought of in dry bones mode.  

If you use calculus to figure out how much fuel to use in a rocket to make it go a mile high, then actually use that amount in the rocket and send it off, your calculations have affected your physical actions, so you were thinking of the calculations as an abstract object.  But if you sit down to check your calculations, you concentrate on the steps one by one with the rules of algebra and calculus in mind.  You are looking at them as inert objects, like you would look at a bone of a dinosaur to see what species it belongs to. From that point of view your calculations form a mathematical object, because you are using the dry-bones approach.

Caveat

All this blather is about how you should think about mathematical objects.  It can be read as philosophy, but I have no intention of defending it as philosophy.  People learning abstract math at college level have a lot of trouble thinking about mathematical objects as objects, and my intention is to start clarifying some aspects of how you think about them in different circumstances.  (The operative word is "start" — there is a lot more to be said.)

About the exposition of this post (a commercial)

You will notice that I gave examples of abstract objects but did not define the word "abstract object".  I did the same with mathematical objects.  In both cases, I put the word "abstract object" or "mathematical object" in boldface at a suitable place in the exposition.

That is not the way it is done in math, where you usually make the definition of a word in a formal way, marking it as Definition, putting the word in bold or italics, and listing the attributes it must have.  I want to point out two things:

  • For the most part, that behavior is peculiar to mathematics.
  • This post is not a presentation of mathematical ideas.  

This gives me an opportunity for a commercial:  Read what we have written about definitions in References [1], [3] and [4].

References

  1. Atish Bagchi and Charles Wells, Varieties of Mathematical Prose, 1998.
  2. Reuben Hersh, What is mathematics, really? Oxford University Press, 1997
  3. Charles Wells, Handbook of Mathematical Discourse.
  4. Charles Wells, Mathematical objects in abstractmath.org
  5. Math and modules of the mind (previous post)
  6. Mathematical Concepts (previous post)
  7. Thinking about abstract math (previous post)
  8. Terrence W. Deacon, Incomplete Nature.  W. W. Norton, 2012. [I have read only a little of this book so far, but I think he is talking about abstract objects in the sense I have described above.]
  9. Gideon Rosen, Abstract Objects.  Stanford Encyclopedia of Philosophy.
  10. Representations II: Dry Bones (previous post)

 

http://plato.stanford.edu/entries/abstract-objects/

Freezing a family of functions

To manipulate the diagrams in this post, you must have Wolfram CDF Player installed on your computer. It is available free from the Wolfram website. The Mathematica notebooks used here are listed in the references below.

Some background

  • Generally, I have advocated using all sorts of images and metaphors to enable people to think about particular mathematical objects more easily.
  • In previous posts I have illustrated many ways (some old, some new, many recently using Mathematica CDF files) that you can provide such images and metaphors, to help university math majors get over the abstraction cliff.
  • When you have to prove something you find yourself throwing out the images and metaphors (usually a bit at a time rather than all at once) to get down to the rigorous view of math [1], [2], [3], to the point where you think of all the mathematical objects you are dealing with as unchanging and inert (not reacting to anything else).  In other words, dead.
  • The simple example of a family of functions in this post is intended to give people a way of thinking about getting into the rigorous view of the family.  So this post uses image-and-metaphor technology to illustrate a way of thinking about one of the basic proof techniques in math (representing the object in rigor mortis so you can dissect it).  I suppose this is meta-math-ed.  But I don’t want to think about that too much…
  • This example also illustrates the difference between parameters and variables. The bottom line is that the difference is entirely in how we think about them. I will write more about that later.

 A family of functions

This graph shows individual members of the family of functions \( y=a\sin\,x\) for various values of $latex a$. Let’s look at some of the ways you can think about this.

  • Each choice of  “shows the function for that value of the parameter $latex a$”.  But really, it shows the graph of the function, in fact only the part between $latex x=-4$ and $latex x= 4$.
  • You can also think of it as showing the function changing shape as $latex a$ changes over time (as you slide the controller back and forth).

Well, you can graph something changing over time by introducing another axis for time.  When you graph vertical motion of a particle over time you use a two-dimensional picture, one axis representing time and the other the height of the particle. Our representation of the function $latex y=a\sin\,x$ is a two-dimensional object (using its graph) so we represent the function in 3-space, as in this picture, where the slider not only shows the current (graph of the) function for parameter value $latex a$ but also locates it over $latex a$ on the $latex z$ axis.

The picture below shows the surface given by $latex y=a\sin\,x$ as a function of both variables $latex a$ and $latex x$. Note that this graph is static: it does not change over time (no slide bar!). This is the family of functions represented as a rigorous (dead!) mathematical object.

If you click the “Show Curves” button, you will see a selection of the curves in middle diagram above drawn as functions of $latex x$ for certain values of $latex a$. Each blue curve is thus a sine wave of amplitude $latex a$. Pushing that button illustrates the process going on in your mind when you concentrate on one aspect of the surface, namely its cross-sections in the $latex x$ direction.

Reference [4] gives the code for the diagrams in this post, as well as a couple of others that may add more insight to the idea. Reference [5] gives similar constructions for a different family of functions.

References

  1. Rigorous view in abstractmath.org 
  2. Representations II: Dry Bones (post)
  3. Representations III: Rigor and Rigor Mortis (post)
  4. FamiliesFrozen.nb,  FamiliesFrozen.cdf (Mathematica file used to make this post)
  5. AnotherFamiliesFrozen.nbAnotherFamiliesFrozen.cdf (Mathematica file showing another family of functions)

 

Thinking about abstract math

 

The abstraction cliff

In universities in the USA, a math major typically starts with calculus, followed by courses such as linear algebra, discrete math, or a special intro course for math majors (which may be taken simultaneously with calculus), then go on to abstract algebra, analysis, and other courses involving abstraction and proofs.

At this point, too many of them hit a wall; their grades drop and they change majors.  They had been getting good grades in high school and in calculus because they were strong in algebra and geometry, but the sudden increase in abstraction in the newer courses completely baffles them. I believe that one big difficulty is that they can't grasp how to think about abstract mathematical objects.  (See Reference [9] and note [a].)   They have fallen off the abstraction cliff.  We lose too many math majors this way. (Abstractmath.org is my major effort to address the problems math majors have during or after calculus.)

This post is a summary of the way I see how mathematicians and students think about math.  I will use it as a reference in later posts where I will write about how we can communicate these ways of thinking.

Concept Image

In 1981, Tall and Vinner  [5] introduced the notion of the concept image that a person has about a mathematical concept or object.   Their paper's abstract says

The concept image consists of all the cognitive structure in the individual's mind that is associated with a given concept. This may not be globally coherent and may have aspects which are quite different from the formal concept definition.

The concept image you may have of an abstract object generally contains many kinds of constituents:

  • visual images of the object
  • metaphors connecting the object to other concepts
  • descriptions of the object in mathematical English
  • descriptions and symbols of the object in the symbolic language of math
  • kinetic feelings concerning certain aspects of the object
  • how you calculate parameters of the object
  • how you prove particular statements about the object

This list is incomplete and the items overlap.  I will write in detail about these ideas later.

The name "concept image" is misleading [b]), so when I have written about them, I have called them metaphors or mental representations as well as concept images, for example in [3] and [4].

Abstract mathematical concepts

This is my take on the notion of concept image, which may be different from that of most researchers in math ed. It owes a lot to the ideas of Reuben Hersh [7], [8].

  • An abstract mathematical concept is represented physically in your brain by what I have called "modules" [1] (physical constituents or activities of the brain [c]).
  • The representation generally consists of many modules.  They correspond to the list of constituents of a concept image given above.  There is no assumption that all the modules are "correct".
  • This representation exists in a semi-public network of mathematicians' and students' brains. This network exercises (incomplete) control over your personal representation of the abstract structure by means of conversation with other mathematicians and reading books and papers.  In this sense, an abstract concept is a social object.  (This is the only point of view in the philosophy of math that I know of that contains any scientific content.)

Notes

[a]  Before you object that abstraction isn't the only thing they have trouble with, note that a proof is an abstract mathematical object. The written proof is a representation of the abstract structure of the proof.  Of course, proofs are a special kind of abstract structure that causes special problems for students.

[b] Cognitive science people use "image" to include nonvisual representations, but not everyone does.  Indeed, cognitive scientists use "metaphor" as well with a broader meaning than your high school English teacher.  A metaphor involves the cognitive merging of parts of two concepts (specifically with other parts not merged). See [6].

[c] Note that I am carefully not saying what the modules actually are — neurons, networks of neurons, events in the brain, etc.   From the point of view of teaching and understanding math, it doesn't matter what they are, only that they exist and live in a society where they get modified by memes  (ideas, attitudes, styles physically transmitted from brain to brain by speech, writing, nonverbal communication, appearance, and in other ways).

References

  1. Math and modules of the mind (previous post)
  2. Mathematical Concepts (previous post)
  3. Mental, physical and mathematical representations (previous post)
  4. Images and Metaphors (abstractmath.org)
  5. David Tall and Schlomo Vinner, Concept Image and Concept Definition in Mathematics with particular reference to limits and continuity, Journal Educational Studies in Mathematics, 12 (May, 1981), no. 2, 151–169.
  6. Conceptual metaphor (Wikipedia article).
  7. What is mathematics, really? by Reuben Hersh, Oxford University Press, 1999.  Read online at Questia.
  8. 18 Unconventional Essays on the Nature of Mathematics, by Reuben Hersh. Springer, 2005.
  9. Mathematical objects (abstractmath.org).

 

 

Representations 2

Introduction

In a recent post I began a discussion of the mental, physical and mathematical representations of a mathematical object. The discussion continues here. Mathematicians, linguists, cognitive scientists and math educators have investigate some aspects of this topic, but there are many subtle connections between the different ideas which need to be studied.

I don’t have any overall theoretical grasp of these relationships. What I will do here is grope for an overall theory by mentioning a whole bunch of fine points. Some of these have been discussed in the literature and some (as far as I know) have not been discussed.  Many of them (I hope)  can be described as “an obvious fact about representations but no one has pointed it out before”.  Such fine points could be valuable; I think some scholars who have written about mathematical discourse and math in the classroom are not aware of many of these facts.

I am hoping that by thrashing around like this here (for graphs of functions) and for other concepts (set, function, triangle, number …) some theoretical understanding may emerge of what it means to understand math, do math, and talk about math.

The graph of a function

Let’s look at the graph of the function $latex {y=x^3-x}&fg=000000$.

What you are looking at is a physical representation of the graph of the function. The graph creates in your brain a mental representation of the graph of the function. These are subtly related to each other and to the mathematical definition of the graph.

Fine points

  1. The mathematical definition [2] of the graph of this function is: The set of ordered pairs of numbers $latex {(x,x^3-x)}&fg=000000$ for all real numbers $latex {x}&fg=000000$.
  2. In the physical representation, each point $latex {(x,x^3-x)}&fg=000000$ is shown in a location determined by the conventional $latex {x-y}&fg=000000$ coordinate system, which uses a straight-line representation of the real numbers with labels and ticks.
    • The physical representation makes use of the fact that the function is continuous. It shows the graph as a curving line rather than a bunch of points.
    • The physical representation you are looking at is not the physical representation I am looking at. They are on different computer screens or pieces of paper. We both expect that the representations are very similar, in some sense physically isomorphic.
    • “Location” on the physical representation is a physical idea. The mathematical location on the mathematical graph is essentially the concept of the physical location refined as the accuracy goes to infinity. (This last statement is a metaphor attached to a genuine mathematical construction, for example Cauchy sequences.)
  3. The mathematical definition of “graph” and the physical representation are related by a metaphor. (See Note 1.)
    • The physical curve in blue in the picture corresponds via the metaphor to the graph in the mathematical sense: in this way, each location on the physical curve corresponds to an ordered pair of the form $latex {(x,x^3-x)}&fg=000000$.
    • The correspondence between the locations and the pairs is imperfect. You can’t measure with infinite accuracy.
    • The set of ordered pairs $latex {(x,x^3-x)}&fg=000000$ form a parametrized curve in the mathematical sense. This curve has zero thickness. The curve in the physical representation has positive thickness.
    • Not all the points in the mathematical graph actually occur on the physical curve: The physical curve doesn’t show the left and right infinite tails.
    • The physical curve is drawn to show some salient characteristics of the curve, such as its extrema and inflection points. This is expected by convention in mathematical writing. If the graph had left out a maximum, for example, the author would be constrained (by convention!) to say so.
    • An experienced mathematician or advanced student understands the fine points just listed. A newbie may not, and may draw false conclusions about the function from the graph. (Note 2.)
  4. If you are a mathematician or at least a math student, seeing the physical graph shown above produces a mental image(see Note 3.) of the graph in your mind.
  5. The mathematical definition and the mental image are connected by a metaphor. This is not the same metaphor as the one that connects the physical representation and the mathematical definition.
    • The curve I visualize in my mental representation has an S shape and so does the physical representation. Or does it? Isn’t the S-ness of the shape a fact I construct mentally (without consciously intending to do so!)?
    • Does the curve in the mental rep have thickness? I am not sure this is a meaningful question. However, if you are a sufficiently sophisticated mathematician, your mental image is annotated with the fact that the curve has zero thickness. (See Note 4.)
    • The curve in your mental image of the curve may very well be blue (just because you just looked at my picture) but you must have an annotation to the effect that that is irrelevant! That is the essence of metaphor: Some things are identified with each other and others are emphatically not identified.
    • The coordinate axes do exist in the physical representation and they don’t exist in the mathematical definition of the graph. Of course they are implied by the definition by the properties of the projection functions from a product. But what about your mental image of the graph? My own image does not show the axes, but I do “know” what the coordinates of some of the points are (for example, $latex {(-1,0)}&fg=000000$) and I “see” some points (the local maximum and the local minimum) whose coordinates I can figure out.

Notes

1. This is metaphor in the sense lately used by cognitive scientists, for example in [6]. A metaphor can be described roughly as two mental images in which certain parts of one are identified with certain parts of another, in other words a pushout. The rhetorical use of the word “metaphor” requires it to be a figure of speech expressed in a certain way (the identification is direct rather than expressed by “is like” or some such thing.)  In my use in this article a metaphor is something that occurs in your brain.  The form it takes in speech or writing is not relevant.

2. I have noticed, for example, that some students don’t really understand that the left and right tails go off to infinity horizontally as well as vertically.   In fact, the picture above could mislead someone into thinking the curve has vertical asymptotes: The right tail looks like it goes straight up.  How could it get to x equals a billion if it goes straight up?

3. The “mental image” is of course a physical structure in your brain.  So mental representations are physical representations.

4. I presume this “annotation” is some kind of physical connection between neurons or something.  It is clear that a “mental image” is some sort of physical construction or event in the brain, but from what little I know about cognitive science, the scientists themselves are still arguing about the form of the construction.  I would appreciate more information on this. (If the physical representation of mental images is indeed still controversial, this says nothing bad about cognitive science, which is very new.)

References

[1] Mental Representations in Math (previous post).

[2] Definitions (in abstractmath).

[3] Lakoff, G. and R. E. Núñez (2000), Where Mathematics Comes From. Basic Books.

Mental, Physical and Mathematical Representations

For a given mathematical object, a mathematician may have:

  • A mental representation of the object. This can be a metaphor, a mental image, or a kinesthetic understanding of the object.
  • A physical representation of the object. This may be a (physical) picture or drawing or three-dimensional model of the object.
  • A mathematical definition and one or more mathematical representations for the object. Such a representation is itself a mathematical object.

The boldface things in this list are related to each other in lots of ways, and they are fuzzy and overlap and don’t include every phenomenon connected with a math object.

I have written about these things ([1], [2], [3], [4]). So have lots of other people. In this post I summarize these ideas. I expect to write about particular examples later on and will use this as a reference.

Two Examples

The following examples point out a few of the relationships between the ideas in boldface above. There is much more to understand.

Function as black box

The idea that a function is a black box or machine with input and output is a metaphor for a function.

A is a metaphor for B means that A and B are cognitively pasted together in such a way that the behavior of A is in many ways like the behavior of B. Such a thing is both useful and dangerous, dangerous because there will be ways in which A behaves that suggest inappropriate ideas about B.

The function as machine is a good metaphor: for example functional composition involves connecting the output of one machine to the input of another, and the inverse function is like running the machine backward.

The function as machine is a bad metaphor: For example, it is wrong to think you could build a machine to calculate any given function exactly. But you can still imagine such a machine, given by a specification (it outputs the value of the function at a given input) and then, in your imagination, connecting the input of one to the output of another must perforce calculate the composite of the corresponding functions.

Like any metaphor, this is a mental representation. That means the metaphor has a physical instantiation in your brain. So a metaphor has a physical representation.

Different people won’t have quite the same concept of a particular metaphor. So a metaphor will have lots of slightly different physical representations, but mathematicians form a community, and communication between mathematicians fine-tunes the different physical instantiations so that they correspond more closely to each other. This is the sense in which mathematical objects have a shared existence in a community as Reuben Hersh has suggested.

A function is a mathematical object, which can be rigorously specified as a set of ordered pairs together with a domain and a codomain. There is a cognitive relationship between the concepts of function as math object and function as black box with input and output.

Triangle

A triangle can be drawn, or created on a computer and a physical image printed out. You may also have a mental image of the triangle.

The physical and the mental images are not the same thing, but they are definitely related. The relationship is mediated by the neuronal circuitry behind your retinas, which performs a highly sophisticated transformation of the pixels on your retina into an organized physical structure in your brain, connected to various other neurons.

This circuitry exists because it helps us get a useful understanding of the world through our eyes. So a picture of a triangle takes advantage of pre-existing neuron structure to generate a useful mental representation that helps us understand and prove things about triangles.

This mental representation also lives in a community of mathematician. Like any community, it has subgroups with “dialects” — varying understanding of representation.

For example, a mathematician who looks at the triangle below sees a triangle that looks like a right triangle. A student sees a triangle that is a right triangle.

This is “sees” in the sense of what their brain reports after all that processing. The mathematician’s brain connects the “triangle I am seeing” module (in their brain) to the “looks like a right triangle” module, but does not connect it to the “is a right triangle” module because they don’t see any statement in the surrounding text that it is a right triangle. The student, on the other hand, fallaciously makes the connection to “is a right triangle” directly.

In some sense, a student who does not make that connection directly is already a mathematician.

A triangle also exists as a mathematical object in your and my brain. It is described by a formal mathematical definition. The pictures of triangles you see above do not fit this definition. For one thing, the line segments in the pictures have thickness. But the pictures trigger a reaction in your neurons that causes your brain to cognitively paste together the line segments in the drawing to the segments required by the formal definition. This is a kind of metaphor of concrete-to-abstract that connects drawings to math objects that mathematicians use all the time.

Note that this “concrete-to-abstract metaphor” itself has a physical existence in your brain.  It drops, for example, the property of thickness that the line segments in the drawing have when matching them (in the metaphor) with the line segments in the corresponding abstract triangle.  On the other hand, it preserves the sense the all three angles in the triangle are acute.  The abstract mathematical concept of triangle (the generic triangle) has no requirement on the angles except that they add up to pi.

Summary

The discussions above describe a few of the complex and subtle relationships that exist between

  • Mental representations of math objects
  • Physical representations of math objects
  • Formally defined math objects and their formally defined representations.

I have purported to discuss how mathematics is understood (especially in connection with language) in several articles and a book but only a few of the relationships I just described are mentioned in any of those articles. Perhaps one or two things I said caused you to react: “Actually, that’s obviously true but I never thought of it before”. (Much the way I had mathematicians in the ’60′s tell me, “I see what you mean that addition is a function of two variables, but I never thought of it that way before”.) (I was a brash category theorist wannabe then.)

A lot of research has been done on understanding math, and some research has been done on mathematical discourse. But what has been done has merely exposed the fin of the shark.

References

[1] Images and metaphors (in abstractmath).

[2] Representations and Models (in abstractmath).

[3] Mathematical Concepts (previous blog).

[4] Mental Representations in Math (previous blog).

Thinking about mathematical objects revisited

How we think about X

It is notable that many questions posted at MathOverflow are like, “How should I think about X?”, where X can be any type of mathematical object (quotient group, scheme, fibration, cohomology and so on).  Some crotchety contributors to that group want the questions to be specific and well-defined, but “how do I think about…” questions  are in my opinion among the most interesting questions on the website.  (See note [a]).

Don’t confuse “How do I think about X” with “What is X really?” (pace Reuben Hersh).  The latter is a philosophical question.  As far as I am concerned, thinking about how to think about X is very important and needs lots of research by mathematicians, educators, and philosophers — for practical reasons: how you think about it helps you do it.   What it really is is no help and anyway no answer may exist.

Inert and eternal

The idea that mathematical objects should be thought of as  “inert” and “eternal”  has been around for awhile.  (Never mind whether they really are inert and eternal.)  I believe, and have said in the past [1], that thinking about them that way clears up a lot of confusion in newbies concerning logical inference.

  • That mathematical objects are “inert” means that the do not cause anything. They have no effect on the real world or on each other.
  • That they are “eternal” means they don’t change over time.

Naturally, a function (a mathematical object) can model change over time, and it can model causation, too, in that it can describe a process that starts in one state and achieves stasis in another state (that is just one way of relation functions to causation).  But when we want to prove something about a type of math object, our metaphorical understanding of them has to lose all its life and color and go dead, like the dry bones before Ezekiel started nagging them.

It’s only mathematical reasoning if it is about dead things

The effect on logical inference can be seen in the fact that “and” is a commutative logical operator. 

  • “x > 1 and x < 3″ means exactly the same thing as “x < 3 and x > 1″
  • “He picked up his umbrella and went outside” does not mean the same thing as “He went outside and picked up his umbrella”.

The most profound effect concerns logical implication.  “If  x > 1 then x > 0″ says nothing to suggest that x > 1 causes it to be the case that x > 0.  It is purely a statement about the inert truth sets of two predicates lying around the mathematical boneyard of objects:  The second set includes the first one.  This makes vacuous implication perfectly obvious.  (The number -1 lies in neither truth set and is irrelevant to the fact of inclusion).

Inert and eternal rethought

There are better metaphors than these.  The point about the number 3 is that you think about it as outside time. In the world where you think about 3 or any other mathematical object, all questions about time are meaningless.

  • In the sentence “3 is a prime”, we need a new tense in English like the tenses ancient (very ancient) Greek and Hebrew were supposed to have (perfect with gnomic meaning), where a fact is asserted without reference to time.
  • Since causation involves this happens, then this happens, all questions about causation are meaningless, too.  It is not true that 3 causes 6 to be composite, while being irrelevant to the fact that 35 is composite.

This single metaphor “outside time” thus can replace the two metaphors “inert” and “eternal” and (I think) shows that the latter two are really two aspects of the same thing.

Caveat

Thinking of math objects as outside time is a Good Thing when you are being rigorous, for example doing a proof.  The colorful, changing, full-of-life way of thinking of math that occurs when you say things like the statements below is vitally necessary for inspiring proofs and for understanding how to apply the mathematics.

  • The harmonic series goes to infinity in a very leisurely fashion.
  • A function is a machine — when you dump in a number it grinds away and spits out another number.
  • At zero, this function vanishes.

Acknowledgment

Thanks to Jody Azzouni for the italics (see [3]).

Notes

a.  Another interesting type of question  “in what setting does such and such a question (or proof) make sense?” .  An example is my question in [2].

References

1.  Proofs without dry bones

2. Where does the generic triangle live?

3. The revolution in technical exposition II.