Category Archives: exposition

Liberal-artsy people

I graduated from Oberlin College with a B.A. as a math major and minors in philosophy and English literature, with only three semesters of science courses.  I was and am "liberal-artsy".   As professor of math at Case Western Reserve University,  I had lots of colleagues in both pure and applied math who started out with B.Sc. degrees. We did not always understand each other very well!

Caveat: "Liberal-artsy" and "Narrowly Focused B.Sc. type" (I need a better name) are characteristics that people may have in varying amounts, and many professors in science and math have both characteristics.   I do, myself, although I am more L.A. that B.Sc.  Furthermore, I know nothing about any sociological or cognitive-science research on these characteristics.  I am making it all up as I write.  (This is a blog post, not a tome.)

I recently posted on secants and  tangents.  These articles were deliberately aimed to tickle the interests of L.A.  students.

Liberal-artsy types want to know about connections between concepts.  In each post, I wrote on both common meanings of the words (secant line and function, tangent line and function) and the close connections between them.  Some trig teachers / trig texts tell students about these connections but too many don't.   On the other hand, many B.Sc. types are left cold by such discussions.  B.Sc. types are goal-oriented and want to know a) how do I use it? b) how do I calculate it?  They get impatient when you talk about anything else.  I say point out these connections anyway.

L.A. types want to know about the reason for the name of a concept.  The post on secants refers to the metaphor that "secant" means "cutting". This is based on the etymology of "secant", which is hidden to many students  because it is based on Latin.  The post makes the connection that the "original" definition of "secant" was the length of a certain line segment generated by an angle in the unit circle. The post on tangents makes an analogous connection, and also points out that most tangent lines that students see touch the curve at only a single point, which is not a connotation of the English word "touch".

Many people think they have learned something when they know the etymology of a word.  In fact, the etymology of a word may have little or nothing to do with its current meaning, which may have developed over many centuries of metaphors that become dead, generate new metaphors that become dead, umpteen times, so that the original meaning is lost.  (The word "testimony" cam from a Latin phrase meaning hold your testicles, which is really not related to its meaning in present-day English.)

So I am not convinced that etymologies of names can help much in most cases.  In particular, different mathematical definitions of the same concept can be practically disjoint in terms of the data they use, and there is no one "correct" definition, although there may be only one that motivates the name.  (There often isn't a definition that motivates the name.  Think "group".)  But I do know that when I mention the history of a name of a concept in class, some students are fascinated and ask me questions about it.

L.A. types are often fascinated by ETBell-like stories about the mathematician who came up with a concept, and sometimes the stories illuminate the mathematical idea.  But L. A. types often are interested anyway.  It's funny when you talk about such a thing in class, because some students visibly tune out while others noticeably perk up and start paying attention.

So who should you cater to?  Answer:  Both kinds of students.  (Tell interesting stories, but quickly and in an offhand way.)

The posts on secants and tangents also experimented with using manipulable diagrams to illustrate the ideas.  I expect to write about that more in another post.

For more about the role of definitions, check out the abmath article and also Timothy Gowers' post on definitions (one of a series of excellent posts on working with abstract math).


Send to Kindle

Tangents

The interactive examples in this post require installing Wolfram CDF player, which is free and works on most desktop computers using Firefox, Safari and Internet Explorer, but not Chrome. The source code is the Mathematica Notebook Tangent Line.nb, which is available for free use under a Creative Commons Attribution-ShareAlike 2.5 License. The notebook can be read by CDF Player if you cannot make the embedded versions in this post work.

This is an experiment in exposition of the mathematical concepts of tangent.  It follows the same pattern as my previous post on secant, although that post has explanations of my motivation for this kind of presentation that are not repeated here.

Tangent line

A line is tangent to a curve (in the plane) at a given point if all the following conditions hold (Wikipedia has more detail.):

  1. The line is a straight line through the point.
  2. The curve goes through that point.
  3. The curve is differentiable in a neighborhood of the point.
  4. The slope of the straight line is the same as the derivative of the curve at that point.

In this picture the curve is $ y=x^3-x$ and the tangent is shown in red. You can click on the + signs for additional controls and information.

Etymology and metaphor

The word “tangent” comes from the Latin word for “touching”. (See Note below.) The early scholars who talked about “tangent” all read Latin and knew that the word meant touching, so the metaphor was alive to them.

The mathematical meaning of “tangent” requires that the tangent line have slope equal to the derivative of the curve at the point of contact. All of the red lines in the picture below touch the curve at the point (0, 1.5). None of them are tangent to the curve there because the curve has no derivative at the point:

The curve in this picture is defined by

The mathematical meaning restricts the metaphor. The red lines you can generate in the graph all touch the curve at one point, in fact at exactly at one point (because I made the limits on the slider -1 and 1), but there are not tangent to the curve.

Tangents can hug!

On the other hand, “touching” in English usage includes maintaining contact on an interval (hugging!) as well as just one point, like this:

The blue curve in this graph is given by

The green curve is the derivative dy/dx. Notice that it has corners at the endpoints of the unit interval, so the blue curve has no second derivative there. (See my post Curvature).

Tangent lines in calculus usually touch at the point of tangency and not nearby (although it can cross the curve somewhere else). But the red line above is nevertheless tangent to the curve at every point on the curve defined on the unit interval, according to the definition of tangent. It hugs the curve at the straight part.

The calculus-book behavior of tangent line touching at only one point comes about because functions in calculus books are always analytic, and two analytic curves cannot agree on an open set without being the same curve.

The blue curve above is not analytic; it is not even smooth, because its second derivative is broken at $x=0$ and $x=1$. With bump functions you can get pictures like that with a smooth function, but I am too lazy to do it.

Tangent on the unit circle

In trigonometry, the value of the tangent function at an angle $ \theta$ erected on the x-axis is the length of the segment of the tangent at (1,0) to the unit circle (in the sense defined above) measured from the x-axis to the tangent’s intersection with the secant line given by the angle. The tangent line segment is the red line in this picture:


This defines the tangent function for $ -\frac{\pi}{2} < x < \frac{\pi}{2}$.

The tangent function in calculus

That is not the way the tangent function is usually defined in calculus. It is given by \tan\theta=\frac{\sin\theta}{\cos\theta}, which is easily seen by similar triangles to be the same on -\frac{\pi}{2} < x < \frac{\pi}{2}.

We can now see the relationship between the geometric and the $ \frac{\sin\theta}{\cos\theta}$ definition of the tangent function using this graph:


The red segment and the green segment are always the same length.
It might make sense to extend the geometric definition to $ \frac{\pi}{2} < x < \frac{3\pi}{2}$ by constructing the tangent line to the unit circle at (-1,0), but then the definition would not agree with the $ \frac{\sin\theta}{\cos\theta}$ definition.

References

Send to Kindle

Case Study in Exposition: Secant

The interactive examples in this post require installing Wolfram CDF player, which is free and works on most desktop computers using Firefox, Safari and Internet Explorer, but not Chrome. The source code comes from several Mathematica notebooks lists in the References. The notebooks are available for free use under a Creative Commons Attribution-ShareAlike 2.5 License. The notebook can be read by CDF Player if you cannot make the embedded versions in this post work.

Pictures, metaphors and etymology

Math texts and too many math teachers do not provide enough pictures and metaphors to help students understand a concept.  I suspect that the etymology of the technical terms might also be useful. This post is an experimental exposition of the math concept of “secant” that use pictures, metaphors and etymology to describe the concept.

The exposition is interlarded with comments about what I am doing and why.  An exposition directly aimed at students would be slimmer — but some explanations of why you are doing such and such in an exposition are not necessarily out of place every time!

Secant Line

The word “secant” is used in various related ways in math.  To start with, a secant line on a curve is the unique line determined by two distinct points on the curve, like this:


The word “secant” comes from the Latin word for “cut”, which came from the Indo-European root “sek”, meaning “cut”.  The IE root also came directly into English via various Germanic sound changes to give us “saw” and “sedge”.

The picture

Showing pictures of mathematical objects that the reader can fiddle with may make it much easier to understand a new concept.  The static picture you get above by keeping your mitts off the sliders requires imagining similar lines going through other pairs of points. When you wiggle the picture you see similar lines going through other pairs of points.  You also get a very strong understanding of how the secant line is a function of the two given points.  I don’t think that is obvious to someone without some experience with such things.

This belief contains the hidden claim that individuals vary a lot on how they can see the possibilities in a still picture that stands as an example of a lot of similar mathematical objects.  (Math books are full of such pictures.)  So people who have not had much practice learning about possible variation in abstract structures by looking at one motionless one will benefit from using movable parametrized pictures of various kinds.  This is the sort of claim that is amenable to field testing.

The metaphor

Most metaphors are based on a physical phenomenon.  The mathematical meanings of “secant” use the metaphor of cutting.  When the word “secant” was first introduced by a European writer (see its etymology) in the 16th century, the word really was a metaphor.   In those days essentially every European scholar read Latin. To them “secant” would transparently mean “cutting”.  This is not transparent to many of us these days, so the metaphor may be hidden.

If you examine the metaphor you realize that (like all metaphors) it involves making some remarkably subtle connections in your brain.

  • The straight line does not really cut the curve.  Indeed, the curve itself is both an abstract object that is not physical, so can’t be cut, and also the picture you see on the screen, which is physical, but what would it mean to cut it?  Cut the screen?  The line can’t do that.
  • You can make up a story that (for example) the use was suggested by the mental image of a mark made by a knife edge crossing the plane at points a and b that looks like it is severing the curve.
  • The metaphor is restricted further by saying that it is determined by two points on the curve.   This restriction turns the general idea of secant line into a (not necessarily faithful!) two-parameter family of straight lines.  You could define such a family by using one point on the curve and a slope, for example.  This particular way of doing it with two points on the curve leads directly to the concept of tangent line as limit.

Secant on circle

Another use of the word “secant” is the red line in this picture:


This is the secant line on the unit circle determined by the origin and one point on the circle, with one difference: The secant of the angle is the line segment between the origin and the point on the curve.  This means it corresponds to a number, and that number is what we mean by “secant” in trigonometry.

To the ancient Greeks, a (positive) number was the length of a line segment.

The Definition

The secant of an angle $\theta$ is usually defined as $\frac{1}{\cos\theta}$, which you can see by similar triangles is the length of the red line in the picture above.

In many parts of the world, trig students don’t learn the word “secant”. They simply use $\frac{1}{\cos\theta}$.

This illustrates important facts about definitions:

  • Different equivalent definitions all make the same theorems true.
  • Different equivalent definitions can give you a very different understanding of the concept.

The red-line-segment-in-picture definition gives you a majorly important visual understanding of the concept of “secant”.  You can tell a lot from its behavior right off (it goes to infinity near $\pi/2$, for example).

The definition $\sec\theta=\frac{1}{\cos\theta}$ gives you a way of computing $\sec\theta$.  It also reduces the definition of $ \sec\theta$ to a previously known concept.

It used to be common to give only the $ \frac{1}{\cos\theta}$ definition of secant, with no mention of the geometric idea behind it.  That is a crime.  Yes, I know many students don’t want to “understand” stuff, they only want to know how to do the problems.  Teachers need to talk them out of that attitude.  One way to do that in this case is to test them on the geometric definition.

Etymology

This idea was known to the Arabs, and brought into European view in the 16th century by Danish mathematician Thomas Fincke in “Geometria Rotundi” (1583), where the first known use of the word “secant” occurs.  I have not checked, but I suspect from the title of the book that the geometric definition was the one he used in the book.

It wold be interesting to know the original Arabic name for secant, and what physical metaphor it is based on.  A cursory search of the internet gave me the current name in Arabic for secant but nothing else.

Graph of the secant function

The familiar graph of the secant function can be seen as generated by the angle sweeping around the curve, as in the picture below. The two red line segments always have the same length.


References

Mathematica notebooks used in this post:

 

Send to Kindle

Etymology

Retire

I was recently asked about the etymology of the English word “retire”(in connection with quitting work).  It comes from Old French “retirer”, compounded from “re” (meaning “back”, a prefix used in Latin) and the Old French verb “tirer” meaning something like “pull” (which comes from a Germanic language, not Latin, and is related to “tier”, but not apparently to “tire”).

Its earliest citations in the Oxford English Dictionary show meanings such as

  • Pull back or retreat from the enemy.
  • To move back for safety or storage (“they retired to their houses”).
  • Leave office or work permanently.

All these meanings appear in print in the 16th century.

What good does it do to know this?  Not much.  You can’t explain the modern meaning of a word knowing the meaning of its ancient roots.

In the case of “retire”, I can make up a story of meanings changing using a chain of metaphors.

  1. “Retirer” in French meant literally “pull back” in the physical sense, for example pulling on a dog’s leash to drag it back so it won’t get into a fight with another dog. This literal meaning has not survived in the English word “retire” (nor, I think, in the French word “retirer”).
  2. In the 12th century (sez the OED without citation) the French word was used to refer to an army pulling back from a battle.  This is clearly a metaphor based on the literal meaning.  In a phrase such as “The Army retired from battle” it has become intransitive, but perhaps people once said things like “The General retired the Army from battle”.  Note that in modern English we could use the exact same metaphor with “pull back”: “The General pulled the Army back from battle”, although “withdrew” would be more common.
  3. Now someone comes along and uses the metaphor “going to work is like being in a battle”, and says things like “He retired from his job”.   This happened in English before 1533 and the usage has survived to this day.  It is probably the commonest meaning of the word “retire” now.

Now all that is a story I made up.  It is plausible, but it might have happened in a different way.  It is not at all likely we will discover the workings of metaphors in the minds of people who lived 600 years ago.  (Conceivably someone could have written down their thoughts about the word “retire” and it will be discovered in an odd subcrypt of Durham Cathedral and some linguist would get very excited, but I could win the lottery, too).

That’s why knowing the original literal meaning of the roots of a modern English word really means nothing about the modern meaning.  There could have been many steps along the way where a metaphorical usage became the standard meaning, then someone took the standard meaning and used it in another metaphor, maybe many times.  And metaphors aren’t the only method.  Words can change meaning because of misunderstanding, specialization, generalization, use in secret languages that become public, and so on.

I didn’t include etymology in the Handbook, mainly for this reason.  But there are certain mathematical words where knowing the metaphor or even the literal meaning can be of help.  I’ll write about that in a separate article.

 

Send to Kindle

Some demos of families of functions

I have posted on abstractmath.org a CDF file of families of functions whose parameters you can control interactively. It is fascinating to play with them and see phenomena you (or at least I) did not anticipate.  Some of them have questions of the sorts you might ask students to discuss or work out.  Working out explanations for many of the phenomena demand some algebra skills, and sometimes more than that.

The Mathematica command that sets up one of the families looks like this:

Manipulate[
Plot[{Sin[a x], a Cos[a x]}, {x, -2 Pi, 2 Pi},
PlotRange -> {{-4, 4}, {-4, 4}}, PlotStyle -> {Blue, Red},
AspectRatio -> 1], {{a, 1}, -4, 4, Appearance -> “Labeled”}]

It would be straightforward to make a command something like

PlotFamily[functionlist, domain, plotrange]

with various options for colors, aspect ratio and so on that would do these graphs.  But I found it much to easy to simply cut and paste and put in the new inputs and parameters as needed.

This sort of Mathematica programming is not hard if you have an example to copy, but you do need to get over the initial hump of learning the basic syntax.   I know of no other language where it would be as easy as the example above to produce an interactive plot of a family of functions.

But many people simply hate to learn a new language.  If this sort of interactive example turns out to be worthwhile, someone could design an interface that would allow you to fill in the blanks and have the command constructed for you.  (I could say the same about some of other cdf files I have posted on this blog recently.) But that someone won’t be me.  I have too much fun coming up with new ideas for math  exposition to have to spend time working out all the details.  And all my little experiments are available to use under the Creative Commons License.

I would appreciate comments and suggestions.

 

Send to Kindle

Unless

Mark Meckes recently wrote (private communication):

I’m teaching a fairly new transition course at Case this term, which involves explicitly teaching students the basics of mathematical English along with the obvious things like logic and proof techniques.  I had a student recently ask about how to interpret “A unless B”.  After a fairly lively discussion in class today, we couldn’t agree on the truth table for this statement, and concluded in the end that “unless” is best avoided in mathematical writing.  I checked the Handbook of Mathematical Discourse to see if you had anything to say about it there, but there isn’t an entry for it.  So, are you aware of a standard interpretation of “unless” in mathematical English?

I did not consider  “unless” while writing HMD.   What should be done to approach a subject like this is to

  • think up examples  (preferably in a bull session with other mathematicians) and try to understand what they mean logically, then
  • do an extensive research of the mathematical literature to see if you can find examples that do and do not correspond  with your tentative understanding.  (Usually you find other uses besides the one you thought of, and sometimes you will discover that what you came up with is completely wrong.)  

What follows is an example of this process.

I can think of three possible meanings for “P unless Q”:

1.  “P if and only if not Q”,
2.  “not Q implies P”
3.  “not P implies Q”.

An example that satisfies (1) is “x^2-x is positive unless 0 \leq x \leq 1“.  I have said that specific thing to my classes — calculus students tend not to remember that the parabola is below the line y=x on that interval. (And that’s the way you should show them — draw a picture, don’t merely lecture.  Indeed, make them draw a picture.)

An example of (2) that is not an example of (1) is “x^2-x is positive unless x = 1/2“.  I don’t think anyone would say that, but they might say “x^2-x is positive unless, for example, x = 1/2“.  I would say that is a correct statement in mathematical English.  I guess the phrase “for example” translates into telling you that this is a statement of form “Q implies not P”, where Q is now “x = 1/2”.   Using the contrapositive, that is equivalent to “P implies not Q”, but that is neither (2) nor (3).

An example of (3) that is not an example of (1) is “x^2-x is positive unless -1 < x < 1“.  I think that any who said that (among math people) would be told that they are wrong, because for example (\frac{-1}{2})^2-\frac{-1}{2} = \frac{3}{4}.  That reaction amounts to saying that (3) is not a correct interpretation of “P unless Q”.

Because of examples like these, my conjecture is that “P unless Q” means “P if and only if not Q”.  But to settle this point requires searching for “unless” in the math literature and seeing if you can find instances where “P unless Q” is not equivalent to “P if and only if not Q”.  (You could also see what happens with searching for “unless” and “example” close together.)

Having a discussion such as the above where you think up examples can give you a clue, but you really need to search the literature.  What I did with the Handbook is to search JStor, available online at Case.  I have to say I had definite opinions about several usages that were overturned during the literature search. (What “brackets” means is an example.)

My proxy server at Case isn’t working right now but when I get it repaired I will look into this question.

Send to Kindle

Picturing derivatives

The CDF files in G&G posts no longer work. I have been unable to find out why.I expect to produce another document on abstractmath.org that will include this example and others. A link willl be posted here when it is done.

This is my first experiment at posting an active Mathematica CDF document on my blog. To manipulate the graph below, you must have Wolfram CDF Player installed on your computer. It is available free from their website.

This is a new presentation of old work. It is a graph of a certain fifth degree polynomial and its first four derivatives.

The buttons allow you to choose how many derivatives to show and the slider allows you to show the graphs from x=-4 up to a certain point.

How graphs like this could be used for teaching purposes

You could show this in class, but the best way to learn from it would be to make it part of a discussion in which each student had access to a private copy of the graph.  (But you may have other ideas about how to use a graph like this.  Share them!)

Some possible discussion questions:

  1. Click button 1. Now you see the function and the derivative. Move the slider all the way to the left and then slowly move it to the right.  When the function goes up the derivative is positive.  What other things do you notice when you do this?
  2. If you were told only that one of the functions is the derivative of the other, how would you rule out the wrong possibility?
  3. What can you tell about the zeroes of the function by looking at the derivative?
  4. Look at the interval between x=1.5 and x=1.75.  Does the function have one or two zeroes in that interval?  On my screen it looks as if the curve just barely  gets above the x axis in that interval.  What does that say about it having one or two zeroes?  How could you verify your answer?
  5. Click button 2.  Now you have the function and first and second derivatives.  What can you say about maxima, minima and concavity of the function?
  6. Find relationships between the first and second derivatives.
  7. Now click button 4.  Evidently the 4th derivative is a straight line with positive slope.  Assume that it is.  What does that tell you about the graph of the third derivative?
  8. What characteristics of the graph of the function can you tell from knowing that the fourth derivative is a straight line of positive slope?
  9. What can you say about the formula for the function knowing that the fourth derivative is a straight line of positive slope?
  10. Suppose you were given this graph and told that it was a graph of a function and its first four derivatives and nothing else.  Specifically, you do not know that the fourth derivative is a straight line.  Give a detailed explanation of how to tell which curve is the function and which curve is each specific derivative.

Making this manipulable graph

I posted this graph and a lot of others several years ago on abstractmath.org.  (It is the ninth graph down).  I fiddled with this polynomial until I got the function and all four derivatives to be separated from each other.  All the roots of the function and all its derivatives are real and all are shown.  Isn’t this gorgeous?

To get it to show up properly on the abmath site I had to thicken the graph line.  Otherwise it still showed up on the screen but when I printed it on my inkjet printer the curves disappeared. That seems to be unnecessary now.

Mathematica 8.0 has default colors for graphs, but I kept the old colors because they are easier to distinguish, for me anyway (and I am not color blind).

Inserting CDF documents into html

A Wolfram document explains how to do this.  I used the CDF plugin for WordPress.  WordPress requires that, to use the plugin, you operate your blog from your own server, not from WordPress.com.  That is the main reason for the recent change of site.

The Mathematica files are New5thDegreePolynomial.nb and New5thDegreePolynomial.cdf on my public folder of Mathematica files.  You may download the .cdf file directly and view it using CDF player if you have trouble with the embedded version. To see the code you need to download the .nb file and open all cells.

Here are some notes and questions on the process.  When I find learn more about any of these points I will post the information.

  1. At the moment I don’t know how to get rid of the extra space at the top of the graph.
  2. I was surprised that I could not click on the picture and shrink or expand it.
  3. It might be annoying for a student to read the questions above and have to go up and down the screen to see the graph.  I had envisioned that the teacher would ask the questions and have the students play with the graph and erupt with questions and opinions.  But you could open two copies of the .cdf file (or this blog) and keep one window showing the graph while the other window showed the questions.
  4. Which raises a question:  Could it be possible to program the graph with a button that when pushed would make the graph (only) appear in another window?

Other approaches

  1. I have experimented with Khan Academy type videos using CDF files.  I made a screen shot and at a certain point I pressed a button and the graph appropriately changed.   I expect to produce an example video which I can make appear on this blog (which supposedly can show videos, but I haven’t tried that yet.)
  2. It should be possible to have a CDF in which the student saw the graph with instructional text underneath it equipped with next and back buttons.  The next button would trigger changes in the picture and replace the text with another sentence or two.  This could be instead of spoken stuff or additional to it (which would be a lot of work).  Has anyone tried this?

Note

My reaction to Khan Academy was mostly positive.  One thing that struck me that no one seems to have commented on is that the lectures are short. They cover one aspect (one definition or one example or what one theorem says) in what felt to me like ten or fifteen minutes.  This means that you can watch it and easily go back and forth using the controls on the video display.  If it were a 50-minute lecture it would be much harder to find your way around.

I think most students are grasshoppers:  When reading text, they jump back and forth, getting the gist of some idea, looking ahead to see where it goes, looking back to read something again, and so on.  Short videos allow you to do this with spoken lectures. That seems to me remarkably useful.

Send to Kindle

A tiny step towards killing string-based math

I discussed endographs of real functions in my post  Endographs and cographs of real functions.  Endographs of finite functions also provide another way of thinking about functions, and I show some examples here.  This is not a new idea; endographs have appeared from time to time in textbooks, but they are not used much, and they have the advantage of revealing some properties of a function instantly that cannot be seen so easily in a traditional graph or cograph.

In contrast to endographs of functions on the real line, an endograph of a finite function from a set to itself contains all the information about the function.  For real functions, only some of the arrows can be shown; you are dependent on continuity to interpolate where the infinite number of intermediate arrows would be, and of course, it is easy to produce a function, with, say, small-scale periodicity, that the arrows would miss, so to speak.  But with an endograph of a finite function, WYSIATI (what you see is all there is).

Here is the endograph of a function.  It is one function.  The graph has four connected components.

You can see immediately that it is a permutation  of the set \{1,2,3,4,5,6\}, and that it is involution (a permutation f for which f f=\text{id}).  In cycle notation, it is the permutation (1 2)(5 6), and the connected components of the endograph correspond to the cycle structure.

Here is another permutation:

You can see that to get f^n=\text{id} you would have to have n=6, since you have to apply the 3-cycle 3 times and the transposition twice to get the identity.   The cycle structure (1 2 4)(0 3) tells you this, but you have to visualize it acting to see that.  The endograph gives the newbie a jumpstart on the visualization.  “The power to understand and predict the quantities of the world should not be restricted to those with a freakish knack for manipulating abstract symbols” (Brett Victor).   This is an argument for insisting that this permutation is the endograph, and the abstract string of symbols (1 2 4)(0 3) is a representation of secondary importance.  [See Note 1.]

Here is the cograph of the same function.  It requires a bit of visualization or tracing arrows around to see its cycle structure.

If I had rearranged the nodes like this

the cycle structure would be easier to see.  This does not indicate as much superiority of the endograph metaphor over the cograph metaphor as you might think:  My endograph code [Note 2] uses Mathematica’s graph-displaying algorithm, which automatically shows cycles clearly.   The cograph code that I wrote specifies the placement of the nodes explicitly, so I rearranged them to obtain the second cograph above using my knowledge of the cycle structure.

The following endographs of functions that are not permutations exhibit the general fact that the graph of a finite function consists of cycles with trees attached.   This structure is obvious from the endographs, and it is easy to come up with a proof of this property of finite functions by tracing your finger around the endographs.

This is the endograph of the polynomial 2 n^9+5 n^8+n^7+4 n^6+9 n^5+1 over the finite field of 11 elements.

Here is another endograph:

I constructed this explicitly by writing a list of rules, and then used Mathematica’s interpolating polynomial to determine that it is given by the polynomial

6 x^{16}+13 x^{15}+x^{14}+3 x^{13}+10 x^{12}+5  x^{11}\\ +14 x^{10}+4 x^9+9 x^8+x^7+14 x^6\\ +15  x^5+16 x^4+14 x^3+4 x^2+15 x+11

in GF[17].

Quite a bit is known about polynomials over finite fields that give permutations.  For example there is an easy proof using interpolating polynomials that a polynomial that gives a transposition must have degree q-2.  The best reference for this stuff is Lidl and Niederreiter, Introduction to Finite Fields and their Applications

The endographs above raise questions such as what can you say about the degree or coefficients of a polynomial that gives a digraph like the function f below that is idempotent (f f=f).  Students find idempotence vs. involution difficult to distinguish between.  Digraphs show you almost immediately what is going on.  Stare at the digraph below for a bit and you will see that if you follow f to a node and then follow  it again you stay where you are (the function is the identity on its image).  That’s another example of the insights you can get from a new metaphor for a mathematical object.

The following function is not idempotent even though it has only trivial loops.  But the digraph does tell you easily that it satisfies f^4=f^3.

Notes

[1] Atish Bagchi and I have contributed to this goal in Graph Based Logic and Sketches, which gives a bare glimpse of the possibility of considering that the real objects of logic are diagrams and their limits and morphisms between them, rather than hard-to-parse strings of letters and logical symbols.  Implementing this (and implementing Brett Victor’s ideas) will require sophisticated computer support.  But that support is coming into existence.  We won’t have to live with string-based math forever.

[2] The Mathematica notebook used to produce these pictures is here.  It has lots of other examples.

Send to Kindle

Experiment with abstractmath.org

This is a rant about technical problems with creating abstractmath.org.  You will not get great new insights into mathematical language.  You will not get any purty pictures, either.  But if you read the following anyway and have suggestions, I would appreciate them no end.

I have long been frustrated with the process I used to create articles for abstractmath.  The process has been this:  I write the article in Word using MathType, then use their facility for generating an html file that uses pictures (stored in separate files) for the more complicated math expressions, then load them into the abstractmath website.

There are many good and bad things about this. Two of the most aggravating:

  • It is difficult to change links if I reorganize something.  With a TeX file I could write WinEdt macros to do it, but the Word macro language makes manipulating links (and doing many other things) a %#!!*.
  • The documents look different in different browsers.  IE Explorer 8 does the best job, Chrome looks uglier, and Firefox is the ugliest.  After a document has been posted for a while, sometimes I open it to discover weird things, such as the recent discovery that some of my bullets had turned into copyright signs (Firefox turns bullets into double hyphens that are close to invisible).

For the past few weeks I have experimented with generating PDF documents using PDFLaTeX.   I did this with the section called Functions: Notation and Terminology, the only one that is posted so far.  Posting it required 45 minutes  of fixing links in other articles by hand.

Creating that section in TeX was a pain.  I used GrindEQ Math to convert the original document to TeX, which required a great deal of preprocessing (mostly to recover the links, but for some formatting things too) and postprocessing to fix many many things.  I also had to recreate the sidebars by hand; I used wrapfig. Wrapfig does not work well.

In the process of converting this and some other files that I may post later, I created a bunch of macros in the Word macro language (horrible, although in principle I believe in OOP) and the WindEdt macro language, which is pretty good.  Even with the macros, it is a lot of labor to do the conversion.

I have decided to abandon the effort.  I may post a few more articles that I have already transferred to  PDF, but I doubt I will revise any more from scratch.

One thing that has changed since I started doing the conversions was that a recent revision of MathType allows you to type the equations directly in TeX and to toggle back and forth between MathType form and TeX form.  Before that you had to select symbols from a palette.  That was the single most frustrating thing about using Word with MathType.  I type fluently in TeX but I couldn’t use it.  Now that I can type the TeX in directly the prospect of editing articles and writing new ones using Word is much less painful.

So one direction I will go in is to revise the articles already on the web using Word and MathType.  I also expect to kill a good many of the incomplete or less well thought out ones in favor of links to Wikipedia.

But there is another direction, opened up by Mathematica’s Computable Document files.  I have published some experiments with this in the last four posts here.  I expect to be able to turn some of the articles in abstractmath.org into computable documents.  The reader will have to have to have the free CDF Player on their computer, but if you download it once you have it forever.

Send to Kindle