e-mail: [email protected]
PACS numbers: 05.70.Ln, 11.30.Er, 05.45.Pq
This paper is devoted to an analysis of one aspect of the second law of thermodynamics, namely its statement about irreversibility. However, I am not going to prove essentially new facts in this respect. What I actually want to do is to illustrate the origin of the irreversible behavior of time-reversible systems using an extremely simple model. The reason for such an approach is that, in my view, the second law (at least in its most general formulations) is not the expression of a new fundamental law of nature, it does not formulate new facts about dynamical systems, it just describes their properties in a new form — that is why it is being derived from the underlying dynamics. One can say that the problems usually associated with this law are not so much of a physical as of a conceptual nature. In such a context it may be more appropriate to rely on demonstrations rather than on proofs, since the topic is more a question of semantics and interpretation.
Microscopic laws of molecular dynamics are invariant with respect to the time reversal: only the momenta change their signs upon the transformation t � -t. This implies that if these laws allow some evolution of a system, they allow also an evolution in which the system passes through the same spatial configurations as the original ones but in the reversed order (and with reversed directions of momenta). If the same systems are viewed macroscopically, they evolve, on the contrary, in one direction only: they demonstrate irreversibility which is formulated in the second law. According to the latter, only evolution leading to growth or preservation of disorder is observable, or in other words, there is an arrow of time. This conceptual asymmetry represents a fundamental problem, called the problem of irreversibility — known for more than a century — which does not seem to be completely solved up to the present time.
There is a plethora of theories trying to explain the origin of macroscopic irreversibility. The approaches can be roughly subdivided into those treating ensembles (for overview see e.g., [1] and references therein) and the others studying individual systems (see, e.g., [2]).
In isolated systems, the irreversibility is usually being reduced to asymmetry in possibilities to prepare initial states which would evolve to equilibrium as compared to those evolving away from it [3,4]. In open systems, the classical approach is to view the environment as a source of random perturbations [5,6], thus actually substituting deterministic systems by stochastic ones. In such a way, however, the most appealing aspect of the problem — the reconciliation of microscopic reversibility with macroscopic irreversibility — is being lost.
Recently, a new promising approach to solution of the problem has been undertaken, studying, among other things, the simple model called the rotated baker map [7], defined on the unit square in the following way:
| (1) |
| (2) |
In this context, we will be interested in further generalization of Eq. (1), which represents a very simple model enabling to demonstrate many features of interrelation between reversibility and irreversibility.
In the following discussion, we will be frequently encountering the notions of measure and dimension, so that I find it meaningful here to give a brief description of what is meant by them.
Avoiding the use of excessively technical language, we can say that measure m on R n is a set function assigning a non-negative number to any subset of R n in such a way that the measure of the empty set is zero, the measure of a subset A � B is m(A) � m(B), and the measure of the union of subsets is less than or equal to the sum of the measures (the strict equality holding only in the case of disjoint subsets). This notion is a generalization of that of area or volume and it usually reduces to it.
A specific kind of measure is the dimension of a set, which is again a generalization of our intuitive notion concerning physical (topological) dimension. There are many ways to define and calculate it, leading to different values. The standard set of dimensions [8] is based on partitioning the phase space into a finite number N(e) of disjoint e-cells (boxes) and considering the probability pi(e) (the so-called natural measure) of finding points of the set in the box i. Of course, the probability has to be normalized so that
|
| (3) |
|
Specifically, for s � 0 we obtain the Hausdorff (or box-counting) dimension
| (4) |
| (5) |
It may be worth mentioning that for s � 2 we obtain the correlation dimension. One can also prove under rather general conditions that Dq � Dr for q < r.
To introduce shortly the notion of (ir)reversibility, consider a continuous dynamical system (a flow) defined in a phase space G and described by differential equations. If St denotes the evolution operator, taking the present state g0 � G to the future one, gt = St g0, then the present state can be retraced into the past as well. If this can be achieved by applying a well-defined operator
Another definition is not so much concerned with changing the direction of evolution as it is with the possibility to "reverse" the final state gt to a reversed one T gt (typically by reversing the directions of all momenta) and with application of St to the latter. The dynamics is then said to be time-reversible if
| (6) |
Obviously, the properties of the reversal operator T will depend on St in general. We expect to have different operators T for different evolution operators St. Therefore, T can be any transformation which transforms a final state of evolution into the initial state for the same evolution—fulfilling, of course, the property (6). Every such transformation will be evidently idempotent, i.e., T 2 = I, and therefore T -1 = T.

FIG. 1: Commutativity as the defining property of reversibility.
In real physical systems, the operator T has the evident meaning of the change of momenta: pi � -pi. The fact that for time-reversible systems we have St T gt = T g0, is just the consequence of the physical laws there. If, however, we want to speak about reversibility of systems in which there is no analogy to momenta — as is, e.g., the case of two-dimensional maps which will interest us in the following — we have to accept this consequence as a definition and denote as reversible all maps M, for which there exists an operator T, making the diagram in Fig. 1 commutative, i.e., such that M n T gn = T g0, or more generally
| (7) |
The operator T is defined here not by the physical essence of M, but just by the latter requirement, and we have to find it. If it does not exist, the map M is not reversible. For reversible M, the inverse map M -1 will then be, evidently, M -1 = T -1MT = TMT.
We will generalize the classical map (2) to what may be called the "generalized" baker map Bw (GBM for short), in a way similar to the one described in [9] and denoted there as the "slightly generalized baker's transformation". The map is defined 1 for any
| (8) |
| (9) |
Later it will be advantageous to have the above definitions rewritten in the form
| (10) |

FIG. 2: Bw transforms the filled rectangle with the area (w − 1)/w into the rectangle with the area 1/w. Similarly, the complementary empty rectangle with the area 1/w is transformed into the one with the area (w − 1)/w.
Bw is a piecewise linear mapping (see Fig. 2) behaving differently to the left and to the right of the vertical line x = (w - 1)/w, which we will call the dividing line. One can easily check that in order for the map be time-reversible, i.e., fulfill the time reversal condition Bwn T gn = T g0, one has to define the reversal operator T as "rotation" around (or reflection with respect to) the second diagonal y = 1 - x, i.e., as
| (11) |
The map can be analyzed using modern computer algebra systems, which enable us to compute the action of Bw with absolute precision. One can then observe time-reversed evolution on a computer screen, and analyze the reversibility. However, to prevent rounding errors, such computations require using rational coordinates and rational w, instead of finite-precision decimal values [10].
All simulations described in the paper were performed under the above conditions. The analysis was further simplified by restricting w to integers. The latter has no effect on the results presented in the paper, so that in the following I will mostly suppose integer values of w.
The expansions caused by Bw in the x direction and contractions in the y direction are characterized by local logarithmic rates lx and ly,
| (12) |
| (13) |
| (14) |
The map Bw represents probably the simplest possible model exhibiting "microscopic" reversibility and "macroscopic" irreversibility, which strongly motivates its study as that of an example illustrating the foundations of irreversible thermodynamics, see, e.g., [11]. To start with, we mention that all the points of E can be subdivided into fixed points, cycles (periodic orbits), and attractors, as well as points approaching those sets. Let us consider each of the sets separately.
The map Bw possesses two hyperbolic fixed points, (0, 0) and (1, 1). The local stable manifold W sloc [12] for the point (1, 1) contains any subset of the vertical line x = 1 in E, since Bwn g0 approaches (1, 1) for g0 = (1, y0) with any y0 within this subset. Similarly, the local unstable manifold for the point (0, 0) contains any subset of the horizontal line y = 0 in E (which we shall denote as the primary line in view of its later role) contained in (0, w - 1/w), because Bwn g0 departs from (0,0) for g0 = (x0, 0) with any x0 within this subset. We will discuss corresponding global manifolds W s and W u later.
The generalized baker map appears to have a rich structure of periodic orbits, or cycles. Consider a point g = (x, y). Any combination of operators Lx, Rx acting consecutively on x will yield an expression linear in x, so that setting it equal to x will give us an equation with a unique solution. As an example, the equation Lx Rx Lx x = x leads to
|
| (15) |
| (16) |
There are only two structures which do not generate cycles, namely Ln and Rn, leading to stationary points (0, 0) or (1, 1), respectively. But for any other structure there exists precisely one cycle visiting both sides of the dividing line in the given order. This shows that there is a one-to-one correspondence between the x coordinate of arbitrary point on the cycle and the sequence of positions of remaining points with respect to the dividing line. Therefore, the x coordinate of a point of the cycle "encodes" the sequence of operators Lx and Rx (operator structure), and vice versa, in a way very similar to that of the Bernoulli shift [8]. This is related to what is usually referred to as "symbolic dynamics" [11].
The set of all possible cycles has interesting structure and deserves a detailed study. Here I just mention some relevant results, starting with the remark that due to contractivity of Bw in the y direction, every cycle is a limit set for a subset of points in E.
The number of cycles grows exponentially with the cycle length. There are 2 p possible combinations of two operators Lx and Rx, having the length p. Two of them correspond to fixed points, and in the case in which p is not prime, some of the combinations (denote their number r) may be further reduced to ones corresponding to shorter periods. Every p of the remaining 2 p - r - 2 combinations represents cyclic permutations, so that they correspond to the same cycle. The total number of different cycles of the length p is then (2 p - r - 2)/p.
For any value of w we have, consequently, an infinite number of all possible cycles. The cycles created by the same sequences of Lx and Rx are topologically independent of w, and with growing w they are just scaled to smaller dimensions (with growing denominators) and translated towards the point (1, 0). This implies that there is the same number of cycles for any w. In the case of w = 2, every point with an odd denominator of the x coordinate belongs to some cycle. For w > 2, however, we do not have such a simple rule due to the above-mentioned scaling.
Consider now a period with very large p. Only a very small proportion out of 2 p sequences of corresponding operators Lx and Rx will be ordered in any sense — the majority will look like random sequences. The orbits they will generate will therefore be indistinguishable from random sets of points on the trajectories.
It is evident that the set of points belonging to cycles, and of points approaching them, is of zero measure in E and consequently the behavior of cycles is not what we could observe in GBM frequently.
We now come to the most important subset of E, which is — as we shall see — dense in E and therefore the behavior of its points represents typical properties of GBM.
Averaging the logarithmic expansion rates (13) and (14) over typical trajectories in E (see [13]), one obtains what is usually called Lyapunov, or time-averaged, exponents — the positive one
| (17) |
| (18) |
The existence of the positive Lyapunov exponent suggests that one should expect chaotic behavior of the iterates of Bwn g0 (for almost every
It follows from previous remarks that exceptions to this general statement include points of W sloc, approaching the hyperbolic fixed point (1, 1), and points belonging to cycles, or approaching them.
One can prove that the attractor consists of an infinite set of lines parallel to the x coordinate, see Fig. 4, and can be generated by successive applications of Bw to the primary line. The construction is based on the iterated function system, see [10]. It is well-known that in the case of
FIG. 3: (left) Iteration of the point (95/100, 95/100) by the action of B4 shows a distinct attractor with clearly visible self-similarity.
FIG. 4: (right) Iterative generation of the attractor of Bw (w = 4). First, Bw is applied to the line segment
From the definition of an unstable manifold, it follows that the points lying exactly on the attractor (i.e., points generated by the described construction) represent the global unstable manifold W u for the point (0, 0). In this sense, the latter represents the source for the attractor.
The attractor is markedly inhomogeneous (strange) for greater values of w. With growing y, its density decreases (for w > 2 but increases for w < 2, if we permit rational non-integer values for w). With decreasing w > 2, the inhomogeneity is less and less pronounced, until at last, for w = 2, the limit object becomes the set of equidistant horizontal lines. The latter specific case will be treated separately in Sec. IX.
| w | 2 | 3 | 4 | 5 | 10 | 100 | 1000 |
| D1(w) | 2.000 | 1.734 | 1.506 | 1.376 | 1.156 | 1.012 | 1.001 |
As with any attractor, one would like to know its dimensions Ds(w), s � 0. The Hausdorff dimension (4) is evidently D0(w) = 2 (see e.g. [14]). The information dimension is calculated with the help of formula (5), taking into account that one can use, in place of probability, the density of iterates of a typical single point 2. Repeating the calculations of Hoover and Posch [13], but for general w, one arrives at the following expression for the information dimension (5):
| (19) |
Table I gives the dimension for a few values of w, and the graph in Fig. 5 enables us to see the overall dependence of D1(w) on w. Evidently, one can obtain any value of D1(w) from the interval 2 � D1(w) > 1, by controlling w. Since D1 can be viewed as the measure of attractor inhomogeneity, the approach of the dimension to the value of 1, with growing w, suggests that the iterated points tend to accumulate (condense) on smaller subsets of E.
FIG. 5: Functional dependence of information dimension D1(w) on the value of parameter w. One sees that the dimension will approach 1 with infinite growth of w. The behavior for
It can be proved [15] that every attractor generated by an iterated function system is the closure of its periodic points. This suggests that the evolution generated by Bw may show Poincaré recurrences.
Since our main preoccupation is the study of interrelation between reversibility and irreversibility, we cannot be interested only in the "future" of a state defined by an initial point g0, i.e., in the trajectory beginning in this point, but we have to consider its "past" as well, viz. the trajectory ending there. Here we will therefore try to find out what is possible to tell about the whole trajectory going through a given point g0.
Repeated application of Bw to arbitrary point It is clear that T g0 is arbitrary (or random) in exactly the same sense as g0, so that the iterates Bwk T g0 will approach the attractor as well. This has the following consequences. Since Bw is contracting in the y direction, the distance between any two points, having exactly the same This, however, means that the past of a point will approach (under backward iteration) an object symmetrical with respect to the attractor. As this object is composed of vertical lines, every point in its vicinity will move away from it under forward iteration, since the differences in We have thus come to an important result: an orbit going through arbitrarily chosen point g0 has its past close to the repellor, and its future close to the attractor. In other words, a typical trajectory generated by the action of Bw departs (in the past) from the repellor and approaches (in the future) the attractor. Such unidirectional behavior is sometimes characterized as demonstrating the existence of the "arrow of time" and here it is the direct consequence of the dynamics defined by Bw. The time reversal of such a trajectory will again be going from the repellor to the attractor, since T transforms future points into the past ones (and attractors into repellors) and vice versa. The time reversal is thus not able to change this global aspect of evolution.
FIG. 6: Symmetry between the past and the future of the trajectory (for This state of affairs can best be seen when we choose a point d0 lying exactly on the second diagonal, i.e., a point d0 = (x0, 1 - x0). Such an initial point remains unchanged upon reversal and therefore its past will be the reversal of its future: past and future parts of the trajectory will be exactly symmetric with respect to the diagonal, Considering the explicit construction of the attractor, described in the previous section, we immediately see that the points lying exactly on the repellor represent the global stable manifold W s for the point Returning to cycles, we discover an interesting property, applying to any w. There are trajectories which unwind from a p1 cycle in the past and approach a p2 cycle in the future; see the example in Fig. 7. In my view, such behavior illustrates best the discussed difference between the past and the future, since it demonstrates that there exist two different limits in the behavior of Bw, separated by intermediate states. Let us mention in passing that the period of the future and the past cycle is determined — in accordance with the general rule expressed above symbolically as g = (xfut, ypast) — by denominators of the coordinates x and y, respectively.
FIG. 7: Trajectory unwinding from a 6-cycle in the past and approaching a 4-cycle in the future Bw can be viewed as mimicking the behavior of a thermodynamic system (a nonequilibrium thermostatted one for w � 2, and an isolated one for w = 2) and therefore it could be employed to explain the origin of irreversibility. In this way, one is lead to the tentative conclusion that the evolution of some physically relevant systems, sharing common properties with Bw, is such that their states approach an attractor and in the past they depart from a repellor. This being the case, we arrive at an alternative explanation of the interrelation between reversibility and irreversibility, avoiding some paradoxes usually derived from the second law. For example, the question of why we do not observe the evolution going in the direction from attractor to repellor now becomes meaningless, because such a possibility is ruled out by the system's dynamics. The problem is instead being shifted to the question of why we encounter an approach to the attractor but not the departure from the repellor — both proceeding in one and the same direction of time. In other words, the question is, why do we observe only one part of the full (and in principle possible) evolutionary trajectory? The next sections are devoted to tackling the problem of irreversibility in this alternative formulation.
Looking at the definition (8) and (9) of Bw with rational w, we see that if we choose arbitrary point g0 with rational coordinates, the coordinates of Expressing an arbitrary point on the trajectory as g = (x, y) � (p/q, r/s), we realize that in the (distant) future the mapping of the y coordinates of points is dominated by y � y/w, whereas that of the x coordinates is dominated by x � x/(w-1). This means that the y coordinates will have greater denominators than the x coordinates (s > q), whereas in the (distant) past it will be the opposite, since the role of coordinates becomes exchanged. We could thus base the notion of "age" on the difference s - q. For future points, the latter will be positive; for the past ones, negative. However, except for the points lying exactly on the diagonal (see Fig. 6), the presence will not be sharp: we will not observe monotonous behavior in its neighborhood and different trajectories may have present states of different ages. This is especially true if we choose present points with To illustrate the situation, consider the following typical subsequence of values s - q for w = 3 and k = -8 �8, with initial point g0 = (1/4, 3/5):
The subsequence is centered around the value 1, denoted in bold, corresponding to the initial point g0. The difference s - q lacks the most important aspect of age, namely its linearity. We can obtain the latter by defining the age as
which approaches, with growing k, equidistant values of "age", separated by exactly unit steps. One has to note that corresponding points on different trajectories can have slightly different ages.
These reasonings demonstrate that in a microscopically described system of GBM, the age of any point can be found in an unambiguous way and with reasonable precision; see Fig 8. It will increase in equal steps from past to future — except the small neighborhood of the "present" state. This age is absolute in the sense that it does depend only on the distance from the present, i.e., on the number of iteration steps. This is due to the fact that coordinates of points contain "traces" of their evolution, which enable us, at least in principle, to recover the iteration index k from the coordinates of arbitrary point. It is appropriate to stress here that the case of the classical baker map (w = 2) is no exception to this general rule.
FIG. 8: Dependence of "absolute age" t on iteration index k, denoted by circles. Although one of the worst cases has been chosen here intentionally, with initial point The described approach to age might seem extremely artificial, but it can be viewed, nevertheless, as representing what may be called "microscopic" age. Of course, one should not overemphasize this model description, but neither should it be underestimated. In my view, it demonstrates—together with the above-mentioned behavior of cycles, illustrated in Fig. 7—that there is, at least qualitatively, a difference between past and future already at the microscopic level. This then implies the existence of the property of the system, which I would call microscopic irreversibility.
If we now consider the evolution of a set of N points under the action of Bw, the above reasonings will apply to any point of the set. We could therefore — at least in principle — define in any evolutionary trajectory of the set the "age" as the average of ages of all individual points. The "present" state would then be the one with the (averaged) age closest to zero.
However, looking at the set without the knowledge of the coordinates of individual points, it is possible to know its "age" also by observing its shape. Specifically, in the case of a subset of E having a simple boundary (e.g., the case of points lying within a small square), the Bw—causing the growth of fractional expression of coordinates of all the points—will cause also the "distortion" of the boundary. This then means that such distortion is commensurate with the age of the set. To make this idea more definite, we turn to the distribution functions on E, giving the density (of probability) of points in E.
FIG. 9: Two states in the evolution of a distribution function (under the mapping B 3 ), the present state of which is defined as constant on the unit square.
The map defined by Bw on E will translate in a straightforward way into the evolution of distribution functions having E as their support. The distribution function, constant on a compact subset of E, will in the forward evolution approach a function independent of x, and in the backward one, independent of y (see Fig. 9). But even general distribution functions will very quickly approach functions almost independent of x, or y. That is why one can characterize, with sufficient precision, the future limit behavior of any distribution function with the help of the function FIG. 10: Section across the distribution function B35f0 by a plane perpendicular to the x-axis, the starting function being the constant f0 º 1.
In quite a similar way one can characterize the past limit behavior of any distribution function with the help of the variation Vx(f) of the function f(x, y0), representing a section going through arbitrary point y0. The difference Vy(f) - Vx(f) of variations along y and x will then grow steadily for n from -� to +�, and we can use it to estimate the "age" of the state, similarly as we used the difference of denominators in the case of a single point.
This then gives us the general picture of the evolution of a distribution function under the action of the map Bw. The absolute value of its variation will grow indefinitely for |n| � �, attaining somewhere in between its minimum. Choosing one of the states having low variation for the "present" state and denoting the corresponding iteration index by k = 0, we will again introduce what may be called the "age" k of the system in question. We could also go a bit further and make this age linear by taking the logarithm of the difference of variations, similarly as in the case of a single point. Such an age would then be mathematically well defined everywhere, with the possible exception of the neighborhood of the "initial" distribution function.
Let us note in passing that the distribution function that evolved from the standard "testing" function f0(x, y) � 1 will have the future maximum values (for w > 2) close to the line y = 0 and the minimum ones close to the line y = 1. After n iterations, the values will be (w - 1)n and (w - 1)-n, respectively. This can be readily seen by taking into account that the area (w -1)/w to the left of the dividing line is contracted by the action of Bw to the area 1/w, i.e., by the factor of w - 1. In a similar way, one will understand the behavior of the minimum.
The ratio of the maximum to the minimum value is then (w - 1)2n, which becomes—for w > 2—a huge number after just a few iterations. This illustrates very well the growing inhomogeneity of the distribution functions under the action of GBM.
Having introduced the notion of absolute age in a way formally different from the one used by the Brussels school [16,17], but conceptually—as it seems—rather close to it, we are ready to explain the apparent contradiction between microscopic reversibility and macroscopic irreversibility of generalized baker maps, or in other words, to explain the essence of the second law of thermodynamics as applied to this simple model.
The so called "problem of irreversibility" is usually expressed in two distinct formulations: one concerns the evolution of a state prepared by an experimenter, the other applies to the observation of a system without any question about its origin, i.e., of a system with an unknown past. We are thus facing two questions (both equivalent to the Loschmidt objection) which are being posed since the first explanation of the second law by Boltzmann in 1872: if both directions of evolution are microscopically equally possible, then (a) why are we not able to prepare states which would evolve in a direction "prohibited" by this law, and (b) why do we never observe such evolution in systems with an unknown past? In the language of the absolute age, these questions reduce to the following ones: why are we not able to prepare, nor to observe, states which belong to the distant past on the evolutionary trajectory of the system under consideration?
We will treat the two questions separately, because the explanations will turn out to be different for the two cases. We will also treat separately two different approaches, corresponding to microscopic and macroscopic descriptions.
We have shown that if we choose a point (with rational coordinates) in E in any way, the coordinates of its future and past iterations will be almost always (i.e., with the exception of zero measure of points) more complex than the present ones. The claim that we can prepare only absolutely "present" points will then become actually a self-evident tautology, stating that we can prepare "simple" coordinates more easily than the "complex" ones. This tautology, however, expresses exactly the essence of the problem. If we would ask why it is not possible to prepare an "old" point, the coordinates of which would become simpler in the future, we could first of all point out that if that were even possible in principle, such simplification would last only for a finite number of iterations—until the coordinates would become relatively simple (the state would become "present"). However, to prepare ab initio a point close to the repellor, i.e., a point with specific and thus very complicated coordinates, means to prepare an exceptional point, and this is essentially more difficult than to generate a point at random. I propose to call this state of affairs, in the given context, microscopic irreversibility.
It is evidently possible to prepare such a point, even numerically, only by choosing a point g0, letting it evolve to the point Bwk g0, and at last inverting the latter. The point T Bwk g0, having the age of -k iterations, would then approach, under the action of Bw, the point T g0 and the coordinates of its k iterations would be simplified. But it is not possible to find a general construction nor formula defining a set of such "old" points.
From this point of view, the microscopic irreversibility is related to substantially different requirements for preparation of the present and past states, respectively. Evolution leaves "traces" on the coordinates of the evolved point, and inverse evolution would require us, even in the case of a single point, to reconstruct precisely its whole history. But nothing similar is being required to prepare the present state. This crucial asymmetry lies, in my opinion, at the root of the explanation of irreversibility 3. One should not, however, forget that this situation is the result of chaotic properties of the evolution induced by the map Bw. If the map did not lead to chaotic dynamics, there would be no tautology; nothing would be self-evident in this sense.
We assumed that phase points have rational coordinates. The considerations of the last two paragraphs hold, however, equally for irrational coordinates. In particular, the fact remains that inversion of evolution would require us to recover exactly its whole history. In this sense, the argument applies to any coordinates.
Here we come to a variance with traditional view of irreversibility as formulated, e.g., by Bricmont [18]: "All the familiar examples of irreversible behavior involve systems with a large number of particles (degrees of freedom). If one were to make a movie of the motion of one molecule, the backward movie would look completely natural." Certainly, playing back the movie of a single point moving under the action of GBM, we would not be able to distinguish the backward movie from the forward one. But why should the physical properties depend on our common way of observation or on our observational possibilities? And what finite number of points is sufficient to distinguish between forward and backward motion? Superimposing all the phase points of the trajectory of a single particle onto one frame, as in Fig. 3, we would be able to make the distinction between the past and the future, between impossible and natural.
The simple model system of N noninteracting particles in a rectangular box [19] shows that large number of particles alone is not sufficient for irreversibility. The arguments of this section support this line of thought in that the dynamics of some systems may bring about the unidirectionality of microscopic evolution, manifesting itself in the appearance of attractors (and repellors). Observing then systems of N particles in the usual way, the inherent microscopic irreversibility becomes more and more pronounced with growing N, but it is present in the system all the time.
Moreover, the approach connecting irreversibility with a large number of particles is not straightforward either: it requires, e.g., the thermodynamic limit N � � to obtain irreversibility. For further details, see e.g., [20], giving a nice treatment of the difference between chaotic and irreversible behavior of Hamiltonian systems.
It might be appropriate to mention at this point the Poincaré recurrence theorem, stating that any isolated, locally measure-preserving system will eventually return close to any of its previous states. Approach to an attractor, an object dense in E, has such a recurrent property. But the property of being close in phase space does not mean being of a similar age: in an arbitrary close neighborhood of any point, there can be points of any age. The mixing property of Bw implies that the distance in the sense of metrics does not imply the distance in time, nor vice versa. In this sense the recurrence does not contradict irreversibility.
The analysis of behavior of a single point may be of interest on its own, but the essential properties of the model are seen much better when observing the action of GBM on a set of points.
The initial state can be, in the case of GBM, prepared essentially by defining a subset of E and filling it uniformly with random points (or according to an appropriate distribution function). The evolution of such a set can be observed in the author's animation 4, where the subset can be chosen as a rectangle. Watching the evolution, one can see that points contained originally in the bounded subset will in the future get into a growing number of horizontal stripes, and in the past into vertical ones. Going from the past to the future, they will pass through the "present" subset. This situation is depicted also in Fig. 11.
FIG. 11:Three states in evolution (w = 4) of an initial set of 2500 randomly chosen phase points, concentrated in the square subset of E. We are readily able to distinguish between the past, the present, and the future. The projections of points on the diagonal y = 1 - x represent an analogy to the positions of physical particles.
Consider a set W-k of N points from which a set W0, located within a small square, will evolve after k iterations. The greater the k, the closer The first question pertaining to the second law can be formulated in this case as asking why it is not possible to prepare a state corresponding to a distant past. To answer it, it is sufficient to note that the evolution of a set of points, although being reversible, does not offer an equal possibility for preparation of all initial states. Present states—e.g., points filling the small square in E—can be prepared without the slightest problems, but to prepare a state n steps before would require localization of points in 2n vertical stripes of E, and in every stripe with different, and well specified, density (cf. Fig. 10). How impossible this is can be best understood when we realize that this task is equivalent to the one of preventing the occurrence of points in the complementary subset.
Then the answer to the above question is that preparation of past states would require us to create them according to a distribution function with wild variation, which is practically impossible. Moreover, this function is extremely close to the other one, obtained by averaging of the former, and describing a readily preparable present state. So the preparation would have to be, in addition, extremely precise. Here again we see the crucial role of a sensitive dependence on initial conditions, since two distribution functions, differencing "macroscopically" only slightly, have an essentially different future.
The second question, why we do not observe states typical for the distant past, can also be answered easily: we do not observe them because any such state quickly becomes "present" or future, and observation means that we are watching a system with an unknown past without influencing it.
Let us find out now what we can learn about GBM if we want to consider it as a model of a physical system, exhibiting the difference between microscopic and macroscopic behavior. The difference derives typically from the distinction between the full description in phase space and various reduced descriptions. In the case of GBM, the unit square E can be viewed as the phase space of one particle (denoted usually as the We depart from the inversion operator T, the effect of which is analogous to the inversion of momenta in classical mechanics. If the GBM has to resemble, at least remotely, the dynamics of a 1D gas, then the particle position should be represented by a variable which is an invariant of T. The projection of points in E onto the "second" diagonal, y = 1 - x, has this property and we therefore assume that it represents positions. The projection of the attractor looks like a stationary (non)equilibrium distribution of points, which can be called the "limit state". Due to the symmetry with respect to the diagonal, the projection of the repellor will be identical to that of the attractor. The orthogonal projection of a point in E onto the "first" diagonal, y = x, could be viewed, with certain reservation, as the "momentum" of a particle.
FIG. 12: The projection of evolution depicted in Fig. 11 onto the second diagonal (scaled to unity length). Only 500 points were used to make the structure visible. The left-hand side of the line corresponds to the point (0, 1) in E. We are evidently not able to distinguish between the past and the future of the projected set, but we readily see the difference between both on the one hand, and the "present" on the other.
The "macroscopic" behavior of the system of particles is then described by what can be seen on the second diagonal (see Fig. 12). There we no longer observe the approach to the attractor, nor departure from the repellor. In the direction from present to future, we see only the change of inhomogeneous distribution of positions into the time-independent limit state. In the direction from past to present, we see the opposite change of almost homogeneous (non)equilibrium state into the inhomogeneous present one. Moreover, we are not able to distinguish between the past and the future due to the attractor/repellor symmetry. The transition from past to present contradicts the second law for a gas on a line; the transition from present to future is in accord with it.
We can now answer the two questions posed traditionally with respect to the second law. We already know that we cannot prepare microscopically the "past" state because this is practically impossible. But the macroscopic preparation offers still fewer possibilities, since in this case we can "assign" only positions to particles, but not "momenta". If we were even somehow able to create some specific arrangement of points on the diagonal, we would know nothing about their "momenta", so the latter would be necessarily random, and consequently the phase points would not be on the repellor. This explains why it is not possible to prepare macroscopically a state which would evolve contrary to the second law, and it seems to be the full explanation.
We cannot observe evolution going from the repellor to the present state for different reasons. First, any observation of a closed (isolated or thermostatted) system has finite duration. If we regard the trajectory generated by Bwk as a sequence of states with unbounded past and future, then the probability of observing any specific finite sequence of states is zero. The observation of the present state is by definition specific and finite, so the probability to find it at random (on a randomly chosen trajectory) is vanishingly small. But the probability of picking at random arbitrary interval with exclusively past states (on the repellor) is the same as that of seeing one with exclusively future states (on the attractor)—i.e., 1/2. However, we cannot distinguish macroscopically between the past and the future states. This explains why we are not able to know that we have seen a system in the past even if it would be there. This holds even under the assumption that systems can exist under unchanged outer conditions indefinitely.
But the trajectories of closed systems do not have unbounded past. In the finitely remote past a state "came into existence" as a result of interaction of the system with the surroundings, and afterwards its boundary conditions remained unchanged. That "first" state was actually the "present" one, and from the moment of its appearance the system tends to the attractor. Therefore, the unbounded past of the presently observed state is actually an illusion: from the existence of the present state we can assume the existence of the past states, we can even in principle recapture them, but the mere existence of the present state does not guarantee the real existence of past states on the repellor. Thus we can find the system always only in the future, approaching the attractor. This explains the second law in the form stating the properties of observed states, because it describes the behavior of a system with an unknown past.
This leads us to formulate the macroscopic irreversibility of the GBM as meaning that the evolution of this system approaches an attractor, and if it would be possible to recover its evolution from the past, the system would depart from the repellor. Such formulation is not subject to the Loschmidt paradox; it is free of any seeming contradictions. In my view, these are sufficient reasons to accept it. However, to do this, we have first to take into account that the usual statement of irreversibility applies to conservative (isolated) systems which should be simulated by B2—the classical baker map.
In the previous reasonings, we were dealing mostly with the full interval of parameter values, w > 1. However, the specific case of w = 2, corresponding to the classical baker map, does not share all general properties stated above, and the question therefore arises as to which of the properties do not carry over to this specific case and what consequences this may have.
The attractor of the GBM reduces, for w = 2, to the set of horizontal lines with yik = ci /2k, where k denotes the generation number and ci < 2k is an integer. Similarly, the repellor becomes the set of vertical lines with x coordinates of the same type, xik = ci /2k. Speaking mathematically, these sets represent an attractor/repellor pair as well (see e.g. [15], p. 82), therefore we will continue to denote them so.
From the physical point of view, however, those objects are usually considered irrelevant, since they are not "strange". All dimensions (3) of the attractor and repellor namely reduce to 2, so that the latter are homogeneous and consequently do not represent readily "visible" objects. Moreover, the sum of Lyapunov exponents
Returning now for a while to cycles, we have observed that the number of periodic orbits containing points with denominator not exceeding a given number d grows with falling w � 2, until in the case of w = 2 all points with a rational x coordinate, differing from those of the attractor/repellor ones, lie on trajectories approaching cycles. This can be understood considering that the Bx reduces in this specific case to Accepting the terminology according to which the attractor (repellor) exists even in the classical baker map, being only "invisible" there, we can formulate the obtained results in a unified manner, treating classical baker map, caricaturing an isolated system approaching equilibrium, in the same way as thermostatted systems approaching nonequilibrium steady states.
The notion of absolute age has the same meaning for w = 2 as for w > 2. The present states are defined by simple coordinates, and the present distribution functions have small variation. One can tell the difference between past and future in evolution of a point, and choosing an inhomogeneous initial (present) distribution function, one can distinguish microscopically between past and future not only by absolute age but also by predominantly vertical or horizontal stripes of maxima, respectively—at least in the initial stages of evolution, not too far from the present states.
Absolute age is essential, in my view, for the explanation of irreversibility. What was told about it above applies equally to the specific case of In this context, we should not leave unnoticed the Zermélo paradox, relying on the Poincaré recurrence theorem. According to it, even if a nonequilibrium state of an isolated system evolves to equilibrium, it must eventually return close to the initial state, thus violating the second law. This means, as applied to B2, that a system of points, starting e.g., in a small subset of E, will eventually return close to it. Apart from the standard reply that the recurrence time is enormously long, the model of B2 enables us to give further refutation of the objection. Since all rational points approach cycles, such recurrent behavior, albeit not excluded, would be possible only for extremely exceptional sets. Choosing namely the same denominator of x coordinates of all points, one obtains—as a rule—the sets "condensing" on a limited number of points in E. However, it is not too difficult to find also points, which will approach periodically moving sets. This confirms the existence of recurrences, but also demonstrates how rare and physically irrelevant they are 5. In this sense, the recurrence does not contradict the formulation of the second law about the arrow of time pointing all the time from the past to the future.
I believe that the model of generalized baker map is interesting even in itself. However, it was studied with the hope that its analysis will shed light on the properties of realistic physical systems. By the latter I mean mathematical models, all essential aspects of which mimic the properties of corresponding real physical objects. Let us start, therefore, the concluding discussion by a reconsideration of the basic assumptions used in the study of the generalized baker map.
All my numerical simulations were restricted to rational coordinates. The reason was to circumvent artificial irreversibility, which would otherwise appear as a numerical artifact due to rounding errors. Rational numbers are a dense subset in the field of real numbers, and almost everything mentioned above for rational coordinates of points will apply to real ones as well. There is actually only one major problem which the exclusive use of rational numbers could cause in our study: the absence of periodic orbits for trajectories containing points with irrational coordinates. This to me does not seem to be an essential problem, if we are aware of it.
Many "realistic" (in the above sense) N-particle model systems (e.g., dilute gas models with Lennard-Jones [21] or hard-sphere [22] potential) share a sensitive dependence on initial conditions with GBM. B2 is known to be strongly mixing [14] and the same is confirmed for many classical systems [23]. The mixing property is a consequence of Lyapunov instability, discovered in many interesting models [22]. The evolution of systems with this property leads to local contractivity of phase volume (measure) in the forward direction of time (implying the opposite when tracking the evolution backwards), or to uniform coverage of phase space by the trajectory. Approach to an attractor in GBM is then analogous, in realistic systems, to approach to nonequilibrium steady state, or to equilibrium.
The notion of absolute age enables us to view the past and the future as related to (strange or homogeneous) repellors and attractors—the structures in phase space, defining the arrow of time. The age, as introduced here, relies on the property of Bw, that the two coordinates of phase points behave differently under Bw. It is evident that this results from different contractivity of the map in different directions. We can therefore expect some signs of the absolute age in all systems with strange or homogeneous attractors.
We pointed to the different role of positions and momenta in the transition to macroscopic description. The momenta are responsible for the microscopic difference between past and future steady states, but they do not help us in their macroscopic identification. This state of affairs lies at the basis of macroscopic indiscernibility between past and future and thus supports our view that macroscopic description is intimately connected to the loss of information about the system under consideration [3]. This may be best seen in the fact that we are readily able to impose macroscopic constraints on positions, but we are not able to do the same (or at least not to a comparable extent) with momenta. This dramatically reduces our ability to influence the states of physical systems, thus leading to apparent irreversibility.
In this sense, the GBM manifests features which coincide with the general properties of e.g., rarefied real gases, as regards the global behavior in the distant past and future—as far as the interrelation between reversibility and irreversibility is concerned. The "mechanisms" by which irreversibility appears in reversible systems are common to GBM and strongly mixing models of N-particle systems. We may thus say that GBM represents an appropriate—albeit extremely simple—model, demonstrating essential aspects of interrelations between reversibility and irreversibility. The properties which became apparent in the study of GBM are therefore applicable to more realistic models, as follows.
In real experiments, we prepare a state by some manipulations with particles. The mathematical counterpart of such preparation is the localization of phase point w0 of a realistic system in its phase space. The evolution of the state then results from physical laws and is represented as the action of mathematical operator St. Whatever the physical preparation of a state may be, the localization will always result in a set of numbers having absolute precision. Of course, we do not know any state with infinite precision, but from the viewpoint of classical mechanics, any state must be absolutely precise per se. Application to w0 of St, simulating relevant physical law, yields absolutely precise numerical results for wt, for any value of t > 0, provided the system is deterministic.
As far as GBM is concerned, one can observe (on a monitor) even evolution going from repellor—i.e., from the distant past—to the present. Such evolution will take only finite number of iteration steps: after reaching w0, the point will move (forever) to the attractor. However, we cannot reconstruct past evolution in real physical experiments, since we cannot prepare a state corresponding to the precise localization of the point T wt . The same applies to the preparation of states according to a distribution function. In both cases, we prepare states and functions which are "present" and we are not able to prepare "past" states nor functions, which would eventually evolve into the present ones. Our possibilities to prepare states of physical systems are rather crudely restricted to states which are in the absolute sense "present".
In my view, this explains the seeming contradiction between microscopic reversibility and macroscopic irreversibility as rooted already at the microscopic level. Or still in other words, the problem of irreversibility has primarily nothing to do with the micro/macro dichotomy. It is connected to the past/future dichotomy instead, manifesting itself in the asymmetry of preparation of states close to the repellor/attractor. The macroscopic level just adds a further restriction, consisting in our impossibility to observe or practically influence momenta of particles.
The view presented in this paper may seem to contradict the orthodox approach, stressing the role of a large number of particles in obtaining macroscopic physical behavior, but this role is, actually, not disputed here, it is just not given the utmost significance. One certainly recognizes that the possibility to prepare a "past" state depends crucially also on how large the system is. Moreover, what is claimed, e.g., in [20], is that "the equality between the physical entropy production and the dynamical phase space contraction rates of Hamiltonian thermostatted systems only holds for macroscopic systems". Restricting the scope of the paper to the investigation of irreversibility only, such arguments do not apply to it.
In an effort to view the map as a model—albeit a very crude one—of real systems, and to apply the results of the model to them, one sees that a kind of correspondence can be established between GBM and macroscopic systems, the dynamics of which has an attractor/repellor pair (visible or hidden). The second law of thermodynamics for such systems can then be explained in the same way as the unique direction of evolution in GBM. Namely, it can be regarded as a consequence of the dynamics which—if continued to encompass the past—would lead from the past repellor to the future attractor. The observed properties of the systems then result from this simple symmetry. Macroscopic observation of systems—the sole viewpoint available to us—hides this fundamental microscopic property and causes a series of seeming contradictions between the two directions of time. Accepting this standpoint, we can claim that there is no real "problem" with the law and that the objections posed against it (including the classical ones of Loschmidt and Zermélo) are simply the results of misunderstanding.
I think that the described model essentially explains the origin of observed (macroscopic) irreversibility in unobservably (microscopically) time-reversible systems. The observed macroscopic irreversibility is actually reversible in principle, or the microscopic reversibility is observationally irreversible. The driving force of such behavior is a sensitive dependence of the evolution on initial conditions, which represents the hallmark of chaos.
1The action of Bw for 1 � w � 2 does not differ essentially from that for w � 2. In the former interval the contraction grows towards y = 1 as opposed to the y = 0 for the latter. That is why we will be analyzing Bw almost exclusively for w � 2.
2Calculation of the Kaplan-Yorke dimension DKY = 1 + l1/l2 will yield the identical result.
3It is equally difficult to prepare points belonging to a given distant future, but the law does not apply to this.
4The animation can be found at the web addresses http://www.sjf.tuke.sk/ket/kumicak/BakAnima.html and http://www.geocities.com/kumicak/BakAnima.html
5To give a simple example: the denominator 397 will yield, for all points in the subset x < 1/10, the set of 27 points returning to the subset every 44 iterations.


5. Age of states
Since the typical full phase trajectory (considered unbounded both in the future and in the past) consists of points moving from the past repellor to the future attractor, it is quite natural to call the points between the past and the future the present ones. This leads us to consider the possibility of introducing an "age" that could be applied along the trajectory.
and skipping eventual (and rare) zeroes for which this expression is not defined. In such a way, we get a really very precise approximation to linear age — a result which can be understood by noticing that Bw always maps s � ws, whereas q either remains unchanged or q� (w - 1)q. Then for gk we have on average s - q � wk - (w - 1)k-1 for k > 0. The logw of the latter approaches, with growing k, the value of k + const with very high precision. The above sample (sub)sequence is then transformed into the following one:
t = sgn(s - q)logw | s - q | (20)
taken over all possible dissections of the interval by m > 1 points. We can regard this expression as an acceptable approximation of the variation of the distribution function along the y axis.
Vy(f) =
sup
m
�
j=1 |f(x0, yj ) - f(x0, yj-1)|, (21)
6.
The problem of irreversibility
7. Microscopic irreversibility
7.1 Irreversibility in the evolution of a single point
7.2 Irreversibility in the evolution of many points
8. Macroscopic irreversibility
9.
The case of the classical baker map
which should be negative for strange attractors, becomes now zero, which means that the measure of subsets of E is not being contracted under B2. Such a homogeneous attractor, nevertheless, "attracts" the points iterated by B2 in the important sense that the iterates visit the neighborhood of any point of E with equal probability.
l1 + l2 = 2 - w
ln (w - 1), (22)
10. From the simple model to real-world physics
11. Conclusions
We have analyzed the parametrized set of generalized baker maps as simple models of time-reversible evolution, exhibiting signs of irreversibility. The analysis has shown that it is possible to define past, present, and future states and distribution functions on the definition domain of the map. The question about the origin of the preferred orientation of evolution—the arrow of time—can then be answered by stating that it is rooted in contractivity of the map: the latter produces a unique direction of evolution which can be brought into relation with the arrow. This applies equally to the measure-preserving classical baker map.
Acknowledgments
The final stage in the preparation of the paper has profited much from the discussions with William G. Hoover and Harald A. Posch. This work was partly supported by the Scientific Grant Agency (VEGA) of the Slovak Academy of Science and Ministry of Education of the Slovak Republic under the Grant No. 1/0428/03. The support is herewith greatly acknowledged.
APPENDIX: Animation of GBM
References
Footnotes:
File translated from TEX
by TTH, version 3.12.
On 1 Oct 2004, 21:06.