PhyloBotanist: Aubert's analysis of phylogenetic terminology, part 2: definitions

Tuesday, December 29, 2015

Aubert's analysis of phylogenetic terminology, part 2: definitions

Continuing the discussion of this paper from here. Originally I had the idea of going through the pages in sequence, but it may be more productive to tackle one by one what appear to be the main claims of the paper as I see them:

The various definitions provided in the paper are in some way better than the ones that are currently accepted.

There is no relevant difference between the systematics-relevant relationships and structures existing at any level of the diversity of life. (E.g. mother > daughter is completely equivalent to bony fish > land animals - they can all be drawn as diamonds and arrows, right?)

A strictly phylogenetic classification is formally impossible.

Cladism is part of structuralism and therefore characterised by "anti-realism and a metaphysical way of thinking".

Cladism is built on biologically unrealistic assumptions that have been empirically falsified.

There exists an objective approach to delimiting paraphyletic groups.

It would be preferable to have two parallel classifications, one of clades and one that includes taxa that are allowed to be non-monophyletic.

So this might turn into seven separate posts, although the last few will perhaps be short. This is the first one, dealing with the definitions, and it got rather long.

The definitions themselves

About the first two thirds or so of the present paper consist of a mixture of "lemmas", "definitions" and "observations" interspersed with helpful figures. As mentioned before, the latter consist of diamonds connected by arrows and serve to illustrate the definitions. To get a flavour of the text, consider this example:

This is accompanied by a figure showing diamonds connected by arrows and a group of diamonds marked to illustrate the concept of "lineage". Here is an alternative way of providing the same information:

A lineage is a line of descendants without gaps.

This alternative is (a) shorter, (b) easier to understand, and (c) fully equivalent in the sense of not missing a iota of information or precision compared to the original. The question is then why one would choose to express what boils down to perhaps two to three pages of really simple, one-sentence-ideas with thirty pages of lemmas, indexed place holders, set theory and suchlike.

Perhaps this is a bona fide attempt to be 'precise', but a reader who suspected that it represents an attempt to intimidate and bamboozle the reader with math, to make them think that something much more profound is going on than there is, could at least not easily be called unreasonable. Curiously, Aubert himself asks in a later part of his paper "whether the criticisms against [sic] classical evolutionary systematics do not come from an illusion of precision due to the use of mathematical tools in cladistic analysis". (Pot, kettle, black?)

Anyway, some of the definitions are trivial or irrelevant and clearly only included for the sake of completeness, while others are directly relevant to the controversy around whether partial clades should be formally accepted as taxa. Concentrating on the latter:

Lineage = As mentioned above, a line of descent without gaps. Aubert illustrates it in a way that strongly implies a linear, unbranched shape, with only one item per generation. I find this odd because to me such a lineage would be incomplete, but I guess that just means that I am a cladist. What is even more unusual, however, is that Aubert also defines the term in a way that it is satisfied with a single individual. Surely that does not fit the way this word is generally understood.

Ancestor = An item is an ancestor of another item if they are connected by a lineage. This is soon followed by an interesting "observation": An ancestor is apparently also its own ancestor, at least if I interpret the wording correctly. This is called at the same time "counter-intuitive" (true), "trivial" (not to me, perhaps because I do not believe that a single item is a lineage), and "greatly facilitat[ing] the formulation of many properties thereafter". It is nice that Aubert will find this to make his definitional jiu jitsu easier, but that does not necessarily mean that it makes biological sense. I mean, if I define rock as cheese I will also find it easy to prove that the moon is made of cheese, but most people would quite sensibly take issue with my definition.

Group = In Aubert's usage, any random assemblage of items in his graphs that he draws a line around. A cladist would, of course, consider anything non-monophyletic to not really be a group in any useful or meaningful sense. Often in this context people are asked to consider similarly nonsensical non-groups outside of the tree of life but likewise defined only by lacking a trait such as non-Hindus or shirts that are not white.

'Monophyletic' group = A group that contains its own common ancestor. Clearly here the currently accepted definition in the vast majority of scientific journals, conference talks, textbooks and university courses is a different one. I will subsequently scare quote this definition, so if the quote marks are not there I mean the widely accepted Hennigian definition.

At this point Aubert worries about this definition allowing groups that contain items and their common ancestor but not the intermediate ancestors, so he consequently clarifies:

Continuous 'monophyletic' group = A group that contains its own common ancestor, and all of whose members are are connected to the common ancestor by descent through other members

Ancestral group = A 'monophyletic' group containing a common ancestor of another group. We are now really, really, really deep in begging the question territory as far as the controversy between cladists and their opponents is concerned. Not only is such a group not really a group in any meaningful sense, most of it isn't ancestral to the subclade Aubert would exclude from the 'group' in the first place.

Directly ancestral group = The same but the ancestor-descendant relationship between the ancestral item in the so-called 'ancestral group' and the common ancestor of the other group is direct, without intermediaries.

Exclusively ancestral group = The descendant group has only this one ancestral group, i.e. it is not derived from a cross between two different ancestral groups. Here Aubert seems particularly concerned with endosymbiosis. I will never understand why so many people believe that this is relevant. The chloroplasts are a monophyletic group that has colonised eukaryotes, and the plants are a monophyletic group of eukaryotes that has been colonised by a clade of cyanobacteria. This appears to be another case of conflation, in this case of systematics-relevant lines of descent with systematics-irrelevant horizontal gene flow. One could just as well claim that a human needs a new family name after getting a heart transplant. Curiously, once more there is a place in the paper where Aubert warns against precisely that kind of conflation himself.

A mere four pages after 'monophyletic' we come to paraphyletic, polyphyletic, and holophyletic, the former two defined the usual way, the last the suggested alternative to what nearly everybody today calls monophyletic. Aubert now provides two 'utilitarian' arguments for changing monophyletic to holophyletic.

First, if the paraphylists cannot use the word monophyletic to mean either monophyletic or paraphyletic, they "[have] no other term to account for their concept" and "cannot assert their point of view". From a cladist perspective, that is a feature, not a bug, because they consider the concept unhelpful and the point of view wrong. Again: like non-Catholics, not really a group, etc. One wonders also, given how frequently paraphylists make up new terms, why they could not just have made one up for this eventuality.

Second, the either monophyletic or paraphyletic meaning would be "useful in studies of unrooted phylogenetic trees where precisely [sic] it is de facto impossible to distinguish holophyly and paraphyly". Maybe there are dozens of people out there who have that issue, but I am not aware of a single situation like that. Towards the end of the relevant section Aubert mentions that somebody has already made up a new word for that situation (clan), thus undermining his own utilitarian argument.

Heterophyletic = What the rest of the world calls non-monophyletic.

Canonical holophyletic group = Associated with a paraphyletic group, what it would turn into if all its missing sublineages were added.

Complementary group = If I understand correctly, the subclades left out of a clade to circumscribe a paraphyletic group.

Degree of paraphyly = Number of subclades that had to be left out of a clade to circumscribe a paraphyletic group. A similar degree of polyphyly is introduced later.

On page 19 Aubert defines holoclady and heteroclady to solve the same imaginary problem in exactly the same way as Podani and Vanderlaan et al. before him. From a cladist paradigm, the question whether a valid group includes only extant terminals or goes right down to the ancestor is a complete non-issue; it is only if one desperately wants to accept paraphyletic taxa that it even arises. What is more, we now have three sets of terms for the same thing. Decrease the confusion? Improve communication? I wish.

It is interesting to note that Aubert comes perilously to a cladist view when discussing monophyletic, paraphyletic and polyphyletic groups of terminals:

This means that a holocladic pattern suggests the existence of a real situation of holophyly, whereas a heterocladic pattern only indicates a lack of holophyly: it is not then possible to formally decide between a real situation of paraphyly or polyphyly.

Translated: para- and polyphyletic groups have the same shape on a tree, while monophyletic groups are significantly different from both. Yes. That is one of the cladist arguments for accepting only the latter, and for considering para- and monophyletic so different that one shouldn't lump them into a term like 'monophyletic' sensu Aubert.

I must admit that I got somewhat lost around the definitions of stem group, crown group and basal group. Stem and crown appear to be used in the same way as by everybody else, although there is a strange excurs arguing that because people find it useful to have a word for the branch between the lineage split that created a clade and the first surviving lineage split in the clade we should formally accept paraphyletic taxa. This does not follow, to say the least. But 'basal group' seems to be defined as all species of the clade that have already produced other descendant species. As Aubert believes that species can split off other species and still stay the same species (see Composite Species Concept) and thus be terminals themselves, the utility of this term is unclear to me.

Pages 26-27 reintroduce another previously seen paraphylist attempt at defining oneself to victory: Cladists do not classify, they only arrange, because a classification is about affinity, and of course the only meaning of affinity that Aubert accepts is phenetic similarity. Being related is not an affinity, apparently. Also, classifications should contain "similar objects". Like species, for example, which just happen to be the basal units of phylogenetic systematics?

The definitions peter out with 'cladogram', and are followed by a more structured discussion of history and philosophy of science.

So, are Aubert's definitions preferable to the currently accepted ones?

The real question is then, if the definitions provided by Aubert are supposed to be better, better in what sense? Explicitly the abstract claims:

First, that the current terminology "prevents proper communication between the proponents of either side". This is a red herring, because any set of definitions facilitates communications as soon as it is universally accepted. As the people using Haeckel's meaning of monophyly, for example, are clearly in the minority, the best course of action for somebody genuinely concerned about communication would be to tell them to use Hennig's meaning instead.

Second, that "consequently, the research in phylogenetics is globally erratic and the taxonomic classification is highly unstable." I have already given the only possible answer: Instability arises necessarily from progress in our scientific understanding of the diversity of life, regardless of what terminology we use. Classifications changed before cladism, only there was no clear criterion to settle the issue once enough data are in.

The rest of the paper supplies two additional claims, sometimes implicitly.

Third, that Aubert's definitions are to be preferred for 'morphosemantic' reasons. It is true that monophyletic is perhaps not ideal from that perspective compared to holophyletic, because the latter does stress the inclusion of all descendants of the common ancestor.

However, at least in my eyes such arguments are handily trumped by the practical consideration of minimising the confusion and disruption that would be caused by changing a widely accepted meaning. It could also be pointed out that Aubert himself is not consistent in that regard: hetero- means different, so to use heterophyletic as a synonym for what is now called non-monophyletic (~non-holophyletic sensu Aubert) seems odd from a 'morphosemantic' perspective. By his own logic, it should probably be a synonym for polyphyletic.

Fourth, that the new definitions and terms capture something or clarify or add precision to something in a way that had previously been overlooked. This and the first one (to facilitate communication) are also what I personally would consider to be the two legitimate reasons for any attempt at altering a terminology.

As should be clear from the above, I don't think they do. Several of them are clearly instances of circular argumentation, like lineage or ancestral group; more on that perhaps in subsequent posts dealing with the supposed impossibility of phylogenetic classifications etc. Others lump very different biological realities into one term, such as 'monophyletic' sensu Aubert, so they do the opposite of capturing something about nature as it is. Yet others would only lead to more confusion, such as again redefining 'monophyletic' or the new words for describing group shapes in synchronous classifications where others already exist. Many simply appear to be irrelevant.

Most importantly, however, it is all besides the point. No matter how much definitions are shuffled around, the real questions are on the lines of what classification would be most predictive, most useful, most natural, most objective, and so on. The real question is not whether I can write down "rock is a kind of cheese" and get it published somewhere, but whether I can demonstrate that the moon is indeed made from milk products.

As far as I can tell, there is essentially one actual argument in favour of paraphyletic taxa in the 54 page paper (spoiler: it is the same as Brummitt's). Of course, one argument would be enough if it was sound. But enough for now, this has got long enough.

30 comments:

UnknownJanuary 3, 2016 at 10:15 AM
Hi Alexander,
The first part of your post seems to be a trial of intent, i.e. an ad hominem fallacy. The entire math I use can be understood by a 17-year-old pupil, so there is no intimidation for a senior researcher. Moreover, it is perfectly impossible to study the algebraic properties of a mathematical object without mathematically defining it. So, your suggestion that I should have used only plain language definitions is at best an ad populum fallacy. The fact that I warn myself against empty precision is not a case against precision in absolute. The aim of my analysis of terminology is indeed to assess the consistency (or nonemptiness) of the associated concepts.

The fact of calling “paraphylists” evolutionary systematists is quite provocative. Evolutionary systematists are not obsessed with paraphyly; but cladists are really obsessed with holophyly. Cladists could be called holophylists, whereas evolutionary systematists would be better called monophylists.

So now, concentrating on the more relevant parts of your post. The word lineage can be understood at an individual level as well as at a populational level. My definition fit both. The way I define an ancestor has absolutely no consequence on our debates. You may be unfamiliar with math papers, but it is a very common trick used by many. Kwok (2011) uses the same, you should read it.

Again, your worries against my definition of a group is a non-issue. A set exists as long as its elements exist. The cladist usage of this verb (as well as quotes…), like “fish don’t exist” is only propaganda. The set of my cat and my keys in my pocket exists because both exist. Relevancy and existence are clearly distinct concepts.

Moreover, a group can be ancestral to another without all the members of the former being ancestral the latter. Just like you can be the parent of a child without all your cells being ancestral to the zygote. In the limit case, the colony of cells you are calling “me” becomes a paraphyletic group when you have children.

Concerning chloroplasts, you don’t seem to understand that in order to make cyanobacteria truly holophyletic you must include all plants. Plants didn’t arise through gene transfer, but through complete lineage merging.
ReplyDelete
Replies
UnknownJanuary 3, 2016 at 10:15 AM
Your treatment of my utilitarian argument is quite unfair. Evolutionists need the concept of monophyly sensu Haeckel to express their ideas. But you say that since the concept of monophyly sensu evolutionists is wrong, there shall be no word to express it, and this is an intended feature. So “you are wrong, now shut up!” Maybe I should have called it the anti-authoritarian argument. This kind of hijacking of language so as to avoid opponents to make their point makes me think more of Orwell’s Newspeak than a fair scientific debate. Secondly, the fact that “clan” was coined to fill this gap doesn’t undermine my argument since “clan” does not fit in X-phyletic series of words. Your suggestion that evolutionists should coin a new word themselves is again provocative. Ok, polyphyletic means “many origins”, and now I would like to coin a new word meaning “one origin”, let see how we can say “one” in Greek… Ah yes, it’s “mono-” (sarcasm) … Furthermore, several authors have used the words diphyletic and triphyletic for polyphylies of degree 2 and 3 respectively.

You recognize yourself that holophyletic is a better word for your concept of “monophyly”. So just use it. As monophyly is not a useful concept to you, you won’t use it, so there will be no ambiguity in cladist papers. Holophyly is recognized as a non-ambiguous synonym, so there is no reason not to use it, besides political (authoritarian) reasons. Moreover, early cladists didn’t wonder so much about disrupting communication when they hijacked “monophyly” from its original meaning. On your side, you may be right that heterophyletic is not morphosemantically ideal, however this is really a side issue. I didn’t coined it (Zander did), merophyletic could have been better but “mero” and “hetero” have quite the same interpretation, whereas “holo” and “mono” have clearly opposite meaning. However, if you really want to correct this, it is really easy to do.

You seem to overlook that systematics is not only a biological science, but also a classificatory science. As such, it should use a precise classificatory terminology. For example, holoclady and holophyly are really different concepts, the former being based on set-theory and the latter on graph-theory. Their logical and biological properties are thus quite different, and your arguments against the “composite species concept” relies entirely on this unrealistic conflation. Furthermore, you do not seem to have notice that I have argued that the distinction between synchronous and diachronous classification is irrelevant. Same remark concerning “classify” and “arrange”, and other classificatory terms. Some cladists have recognized that their “phylogenetic classification” should not be called a “classification”, but rather an arrangement or a systematization (see Griffith).

As a last point, I will simply point out that you can make scientific progress in understanding the relationships between things without changing the names of these things. Instability in taxonomy does not arise from progress, but from cladist dogma against paraphyly. This is again propaganda to say the contrary. Haeckel already knew 150 years ago that reptiles were paraphyletic. Saying today that “reptiles don’t exist” is not a progress in knowledge. Furthermore, evolutionists do take into account phylogenetic results in order to make a better classification.
ReplyDelete
Replies
UnknownJanuary 6, 2016 at 8:40 AM
I disagree that if your concern is "clear communication, then stats on how many people use the various terms are the ONLY thing that is relevant". It is a political view. If you are a scientist, clear communication implies proper terminology, rigorous concepts and a fair way of rebutting your opponents. Again, I see your attachment of using "monophyly" instead of holophyly as an authoritarian issue.

Ranks are relative, not absolute, so defining what is a family or an order in absolute is of course nonsensical, as everyone knows.

I know punctualism is a variant of gradualism, I even said it in my very paper.

So, no, I don't need saltationism.

Of course there are no real gaps and long branches are full of transitional fossils. But it is irrelevant to the fact that there are stasis and revolutions. Revolutions don't need to be instantaneous. A revolution is an acceleration of history, not a point in time.

So how to cleave off paraphyletic residue in an objective manner? To do so, the better way is to search in the fossil record an acceleration of evolution followed by an adaptive radiation. The cut can be made at the key innovation, i.e. the one directly responsible for the burst. Interestingly, the case of birds matches exactly this scenario, see for example Brusatte et al. (2014) Gradual Assembly of Avian Body Plan Culminated in Rapid Rates of Evolution across the Dinosaur-Bird Transition. Patrocladistic classification and other methods can be considered as heuristics to discover these leaps.
ReplyDelete
Replies
UnknownJanuary 7, 2016 at 1:07 AM
You persist in not understanding that evolutionary systematists classify according to the PROCESS of evolution. Your "long branch" argument is only about PATTERN. So yes, it is completely irrelevant to me and to any evolutionist. Saying that patristic distance doesn't exist is nonsensical, there are simply more or less intermediate species along the different branches. So again, we don't need abrupt gaps. A leap is not a gap. Gaps are discrete whereas leaps are continuums. Not understanding this is what I call metaphysical thinking, the contrary being dialectical thinking. Or should I find a German word to express this idea? Aufhebung?
ReplyDelete
Replies
UnknownJanuary 8, 2016 at 6:31 PM
I do not conflate mother-daughter with fish-tetrapod, but I have shown that the topological properties of the former are inherited to the latter, simply by zooming out.

Why couldn't I cleave a continuum into two same-level groups? The continuum is not homogeneous, so I can perfectly decide by using heterogeneities. So Tiktaalik is a fish, since the rapid burst of tetrapod speciation happened just after. When a revolution happen there is always a before and an after. So first of all, there is no problem in contrasting before/after, i.e. paraphyly/autophyly. Secondly, the revolutionary period is necessarily fuzzy, so there is always a small degree of subjectivity when deciding exactly where to cut. But the fuzzy period itself can be clearly and objectively determined and it is not a problem to acknowledge reality, historians have the same kind of dilemma when cutting historical periods from the continuum of history. Finally, in practice it is very rare to have a fossil that we cannot decide if it is before or after the revolution, usually a key innovation can be causally linked to the adaptive radiation and so serves as an objective cut (not to mention when we don't have any fossil of that period).
ReplyDelete
Replies
Dan HaugJanuary 9, 2016 at 9:10 AM
For the moment, let's set aside the dubious merits of patrocladistic methods and Linnean ranks. The ranking of paraphyletic residues is a separate question from whether parent grade / daughter clade is a useful way to think about evolution. I think it can be.

For any living species, a nested set of clades represents an increasingly general categorization as well an ancestor-descendant lineage. The ancestral Hyla is descended from the ancestral frog is descended from the ancestral lissamphibian; the ancestral Passer is descended from the ancestral bird is descended from the ancestral amniote; both are descended from the ancestral tetrapod. But if I want to talk about some fossil tetrapods that illustrate that divergence, I'll need to remove clades in order to be more specific. No, not just any tetrapod, a non-amniote non-lissamphibian tetrapod ("labirynthodont?").

In most cases, yes, clades are more informative than grades. While they circumscribe the same extant species, the difference between the old paraphytic "Amphibia" and the crown clade Lissamphibia is about 100 million years of evolution and a long list of apomorphies, some of which can't be inferred from fossils. In other cases, the differences are minimal to non-existent, depending on the preferred phylogenetic hypothesis (gymnosperms). A strongly supported grade (i.e. non-avialan maniraptorans) is better than a weakly supported potential side branch (deinonychosaurids) if you're studying bird evolution and would like stable names.

The important thing is having convenient and unambiguous words available to describe groups of evolutionary, ecological, or economic interest and make valid inferences about their characters and evolutionary relationships.
ReplyDelete
Replies

Add comment