This post has been turned into a page on WordPress, accessible in the upper right corner of the screen. The page will be referred to by all topic posts for Abstracting Algebra.
This post has been turned into a page on WordPress, accessible in the upper right corner of the screen. The page will be referred to by all topic posts for Abstracting Algebra.
using the style of presenting trees used in academic computing science. The parentheses are a clue to the structure; omitting them results in $latex 4x-2=6$, which has the different structure
By the time students take calculus they supposedly have learned to perceive and work with this invisible structure, but many of them still struggle with it. They have a lot of trouble with more complex expressions, but even something like $latex \sin x + y$ gives some of them trouble.
The tree expression makes the invisible structure explicit. Some math educators such as Jason Dyer and Bret Victor have experimented with the idea of students working directly with a structured form of an algebraic expression, including making the structured form interactive.
How could the tree structure be used to help struggling algebra students?
1) If they are learning on the computer, the program could provide the tree structure at the push of a button. Lessons could be designed to present algebraic expressions that look similar but have different structure.
2) You could point out things such as:
a) “inside the parentheses pushes it lower in the tree”
b) “lower in the tree means it is calculated earlier”
3) More radically, you could teach algebra directly using the tree structure, with the intention of introducing the expression-as-a-string form later. This is analogous to the use of the initial teaching alphabet for beginners at reading, and also the use of shape notes to teach sight reading of music for singing. Both of these methods have been shown to help beginners, but the ITA didn’t catch on and although lots of people still sing from shape notes (See Note 1) they are not as far as I know used for teaching in school.
4) You could produce an interactive form of the structure tree that the student could use to find the value or solve the equation. But that needs a section to itself.
When I discovered the TreeForm command in Mathematica (which I used to make the trees above), I was inspired to use it and the Manipulate command to make the tree interactive.

This is a screenshot of what Mathematica shows you. When this is running in Mathematica, moving the slide back and forth causes the dependent values in the tree also change, and when you slide to 3.5, the slot corresponding to $latex 4(x-2)$ becomes 6 and the slot over “Equals” becomes “True”:
As seen in this post, these are just screen shots that you can’t manipulate. The Mathematica notebook Expressions.nb gives the code for this and lets you experiment with it. If you don’t have Mathematica available to you, you can still manipulate the tree with the slider if you download the CDF form of the notebook and open it in Mathematica CDF Player, which is available free here. The abstractmath website has other notebooks you may want to look at as well.
Moving the slider back and forth constitutes finding the correct value of x by experiment. This is a peculiar form of bottom-up evaluation. With an expression whose root node is a value rather than an equation, wiggling the slider constitutes calculating various values with all the intermediate steps shown as you move it. Bret Victor ‘s blog shows a similar system, though not showing the tree.
Another way to use the tree is to arrange to show it with the calculated values blank. (The constants and the labels showing the operation would remain.) The student could start at the top blank space (over Times) and put in the required value, which would obviously have to be 6 to make the space over Equals change to “True”. Then the blank space over Plus would have to be 1.5 in order to make multiplying it by 4 be 6. Then the bottom left blank space would have to be 3.5 to make it equal to 1.5 when -2 is added. This is top down evaluation.
You could have the student enter these numbers in the blank spaces on the computer or print out the tree with blank spaces and have them do it with a pencil. Jason Dyer’s blog has examples.
My example code in the notebook is a kludge. If you defined a special VertexRenderingFunction for TreeForm in Mathematica, you could create a function that would turn any algebraic expression into a manipulatable tree with a slider like the one above (or one with blank spaces to be filled in). [Note 2]. I expect I will work on that some time soon but my main desire in this series of blog posts is to through out ideas with some Mathematica code attached that others might want to develop further. You are free to reuse all the Mathematica code and all my blog posts under the Creative Commons Attribution – ShareAlike 3.0 License. I would like to encourage this kind of open-source behavior.
1. Including me every Tuesday at 5:30 pm in Minneapolis (commercial).
2. There is a problem with Equals. In the hacked example above I set the increment the value jumps by when the slider is moved to 0.1, so that the correct value 3.5 occurs when you slide. If you had an equation with an irrational root this would not work. One thing that should work is to introduce a fuzzy form of Equals with the slide-increment smaller that the latitude allowed in the fuzzy Equals.
When I was a junior taking advanced calculus under Fuzzy Vance at Oberlin College, I sat one night staring at the book trying to understand how the Riemann Sum of a function “converged” to the value of the integral. The book said that it converges as the mesh goes to 0, and that any Riemann Sum was included in the convergence theorem, with any choice of partition, even irregular, and with any choice of points-to-evaluate-at in each subinterval. [Note 1.]
I had a satori [Note 2]. I felt like the guy in the ads who sits in front of his new ultrafast computer with the wind blowing his hair back and bracing himself by holding onto the desk. (My hair was dark then but I certainly was not wearing a tie.)
That convergence theorem was talking about something BIG.
I visualized a Cloud of Riemann Sums floating around and swerving closer to the Right Answer as their meshes decreased.
A Riemann Sum has a lot of parameters:
Those are three independent parameters, except for the constraint imposed by the mesh on each choice of subintervals. [Note 3].
I tell my students that we have to zoom in and zoom out [Reference 2] from a problem. When we zoom out a complicated structure is thought of as a point in a certain relationship with other structures-as-points. Then to understand something we zoom in (selectively) to see the details that make it work. What I remember from my satori is that I didn’t visualize them as points but rather as little blurs, sort of like the blurs in Mumford’s red book [Reference 3], which I think was the first non-constipated math text I had ever seen.
In the nineties, I was on a grant to create Mathematica programs for students, and one of the notebooks I created allowed you to easily exhibit Riemann sums with various parameters. I also included code that would show a cloud.
Below is a cloud. It is a plot of the values of 300 Riemann sums for $latex \int_0^{\pi} \sin x \,dx$. They have randomly chosen meshes from $latex 0$ to $latex \pi/2$ and the subintervals and individual evaluation points for each subinterval are also chosen randomly.
The cloud below is a plot of the values of 300 Riemann sums for the area of the upper right quarter circle of radius 2 with center at origin. Its meshes range from 0 to 1, and other properties are similar to the one above. The vertical spread of the points is considerably bigger, presumably because of the vertical tangent line at the right hand end of the integral.
When you click on the code for either of these you get a different cloud with the same parameters.
You can access the notebook containing the code for this via Abmath Gate. Be sure to read the ReadMe file.
[1] This was 1961. Of course the book didn’t say things such as “with any choice of points-to-evaluate-at”. It said what it had to say in stilted academic prose which required reading it two or three times before understanding it. Academic prose is much better these days. See Reference [1].
I was quite good at reading complicated prose. My ACT scores were a tad higher in English or Language or whatever it is called that they were in Math. With the Internet, math exposition should do much more with pictures, interactive things, and lots of examples (which don’t waste paper now). But that is another diatribe…
[2] This is a snooty word for lightbulb flashing over your head. Every once in awhile I give in to the temptation to use some obscure word to impress people as to the variety of things I know about. Teachers, don’t do this to your students. Other professors are fair game.
[3] The same choice of subinterval can correspond to many different meshes, if your definition of mesh requires only that each subinterval be narrower than the mesh, rather than requiring that the mesh be the size of the biggest subinterval. (I had never thought about that until I wrote this.)
[4] The Mathematica Demonstrations website has several other notebooks that exhibit Riemann Sums.
[1] The Revolution in Technical Exposition II, post on this blog.
[2] Zooming and Chunking in abmath.
[3] D. Mumford, The Red Book of Varieties and Schemes (second expanded ed.), Springer Lecture Notes in Math 1358, Springer-Verlag, Berlin, 1999. (I have not seen this edition. What I remember is the Red Book as it was in the 1967 Algebraic Geometry Summer School at Bowdoin. I hope the smudges survive in the new version. As I remember the smudges were bigger for points that were more generic or something like that. Those smudges caused me a kind of sartori, too.)
I have written a lot about math exposition in the past. [Note 1.] Lately I have been thinking about the effect of technological change on exposition.
Texting
A lot of commentators have complained that their students’ writing style has “deteriorated” because of texting, specifically their use of abbreviations and acronyms.
Last January I resumed teaching mathematics after an exactly ten year lapse. My students and I email a lot, post on message boards, hand in homework, write up tests. I have seen very few “lol”s and “cu”s and the like, mostly in emails and almost entirely from students whose native language is not English. (See Note 1.)
As far as I can see the students’ written language has not deteriorated. In fact I think native English speakers write better English than they did ten years ago. (But Minnesota has a considerably better educational system than Ohio.)
Besides, if lol and cu become part of the written language, so what? Many Old Fogies may find it jarring, but Old Fogies die and their descendants talk however they want to.
Bulleted lists
I have been using Powerpoint part of the time in teaching (I had already given some talks using it). People complain about that affecting our style, too. But I think that in particular bulleted and numbered lists are great. I wish people would use them more often. Consider this passage from a recent version of Thomas’ Calculus [1]:
$latex \displaystyle \int_a^bx\,dx=\dfrac{b^2}{2}-\dfrac{a^2}{2}\quad (a< b)\quad\quad\quad(1)&fg=000000$
This computation gives the area of a trapezoid. Equation (1) remains valid when $latex {a}&fg=000000$ and $latex {b}&fg=000000$ are negative. When $latex {a<b<0}&fg=000000$, the definite integral value … is a negative number, the negative of the area of the trapezoid dropping down to the line $latex {y=x}&fg=000000$ below the $latex {x}&fg=000000$-axis. When $latex {a<0}&fg=000000$ and $latex {b>0}&fg=000000$, Equation (1) is still valid and the definite integral gives the difference between two areas …
It would be much better to write something like this:
Equation (1) is valid for any $latex {a}&fg=000000$ and $latex {b}&fg=000000$.
- When $latex {a}&fg=000000$ and $latex {b}&fg=000000$ are positive, Equation (1) gives the area of a trapezoid.
- When $latex {a}&fg=000000$ and $latex {b}&fg=000000$ are both negative, the result is negative and is the negative of the area…
- When $latex {a<0}&fg=000000$ and $latex {b>0}&fg=000000$, the result is the difference between two areas…
That is much easier to read than the first version, in which you have to parse through the paragraph detecting that it states parallel facts. That is not terribly difficult but it slows you down. Especially in this case where the sentences are not written in parallel and contain remarks about validity in scattered places when in fact the equation is valid for all cases.
This book does use numbered or lettered lists in many other places.
The future is upon us
Lots of lists and illustrations require more paper. This will go away soon. Some future edition of the book on an e-reader could contain this list of facts as a nicely spaced list, much easier to grasp, and could contain three graphs, with $latex {a}&fg=000000$ and $latex {b}&fg=000000$ respectively left of the $latex {x}&fg=000000$-axis, straddling it, and to the right of it. This will cost some preparation time but no paper and computer memory at the scale of a book is practically free.
I use bulleted lists a lot in abstractmath, as here. Abstractmath is intended to be read on the computer. It is not organized linearly and a paper copy would not be particularly useful.
By the way, since the last time I looked at this page all the bullets have been replaced with copyright signs. (In three different browsers!) Somebody’s been Messing With Me. AArgH.
The Irish mystery writer Ken Bruen regularly uses lists, without bullets or numbers. Look at page 3 of The Killing of the Tinkers.
Some people find bulleted lists jarring simply because they are new. I think some are academic snobs who diss anything that sounds like something a business person would do. See my remarks at the end of the section on texting.
Notes
1. You can see much of what I have said on this blog about exposition by reading the posts labeled “exposition” (scroll down to the list of categories in the left column.) See also Varieties of Mathematical Prose by Atish Bagchi and me.
2. Foreign language speakers also write things like “Hi Charles” instead of “Dear Professor Wells” or using no greeting at all (which is probably the best thing to do). Dealing with a foreign language requires familiarity with the local social structure and customs of address, of being aware of levels of the various formal and informal registers, and so on. When we lived in Switzerland, how was I to know that “Ciao” went with “du” and “wiederluege” went with “Sie”? (If I remember correctly. Ye Gods, that was 35 years ago.)
References
1. Thomas’ Calculus, Early Transcendentals, Eleventh Edition, Media Upgrade. Pearson Education, 2008.