Topological constraints in early multicellularity favor reproductive division of labor

1. School of Physics, Georgia Institute of Technology, United States
2. Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, United States
3. School of Biological Sciences, Georgia Institute of Technology, United States
Research Article

Abstract

Reproductive division of labor (e.g. germ-soma specialization) is a hallmark of the evolution of multicellularity, signifying the emergence of a new type of individual and facilitating the evolution of increased organismal complexity. A large body of work from evolutionary biology, economics, and ecology has shown that specialization is beneficial when further division of labor produces an accelerating increase in absolute productivity (i.e. productivity is a convex function of specialization). Here we show that reproductive specialization is qualitatively different from classical models of resource sharing, and can evolve even when the benefits of specialization are saturating (i.e. productivity is a concave function of specialization). Through analytical theory and evolutionary individual-based simulations, we demonstrate that reproductive specialization is strongly favored in sparse networks of cellular interactions that reflect the morphology of early, simple multicellular organisms, highlighting the importance of restricted social interactions in the evolution of reproductive specialization.

Introduction

The evolution of multicellularity set the stage for unprecedented increases in organismal complexity (Szathmáry and Smith, 1995; Knoll, 2011). A key factor in the remarkable success of multicellular strategies is the ability to take advantage of within-organism specialization through cellular differentiation (Queller and Strassmann, 2009; Brunet and King, 2017; Cavalier-Smith, 2017). Reproductive specialization, which includes both the creation of a specialized germ line during ontogeny (as in animals and volvocine green algae) and functional differentiation into reproductive and non-reproductive tissues (as in plants, green and red macroalgae, and fungi), may be especially important (Cooper and West, 2018; Michod et al., 2006; Ispolatov et al., 2012; Solari et al., 2013; Michod, 2007; West et al., 2015). Reproductive specialization is an unambiguous indication that biological individuality rests firmly at the level of the multicellular organism (Michod, 1999; Folse and Roughgarden, 2010), and is thought to play an important role in spurring the evolution of further complexity by inhibiting within-organism (cell-level) evolution (Buss, 1988) and limiting reversion to unicellularity (Libby and Ratcliff, 2014). Despite the central importance of reproductive specialization, its origin and further evolution during the transition to multicellularity remain poorly understood (McShea, 2000).

The origin of specialization has long been of interest to evolutionary biologists, ecologists, and economists. A large body of theory from these fields shows that specialization pays off only when it increases total productivity, compared to the case where each individual simply produces what they need (Szathmáry and Smith, 1995; Smith and Szathmáry, 1997; Goldsby et al., 2012; Corning and Szathmáry, 2015; Hidalgo and Hausmann, 2009; Boza et al., 2014; Taborsky et al., 2016; Page et al., 2006; Rueffler et al., 2012; Szekely et al., 2013; Findlay, 2008; Amado et al., 2018). Certain types of trading arrangements maximize the benefits of specialization; highly reciprocal interactions, which facilitate exchange between complementary specialists, amplify cooperation (Allen et al., 2017; Pavlogiannis et al., 2018). Still, previous work finds that even when groups grow in an ideal spatial arrangement, increased specialization and trade is only favored by natural selection when productivity increases as an accelerating function of the degree of specialization (i.e., productivity is a convex, or super-linear, function of the degree of specialization). Conversely, saturating functional returns (i.e. productivity is a concave, or sub-linear, function of the degree of specialization) should inhibit the evolution of specialization (Cooper and West, 2018; Michod et al., 2006; Ispolatov et al., 2012; Solari et al., 2013; Michod, 2007; West et al., 2015).

Reproductive specialization differs from classical models of trade in several key respects. Trade between germ (reproductive) and somatic (non-reproductive) cells is intrinsically asymmetric, because the cooperative action, multicellular replication, is not a product that is shared evenly. Selection acts primarily on the fitness of the multicellular group as a whole (Folse and Roughgarden, 2010). As a result, optimal specialization can result in behaviors that reduce the short-term fitness of some cells within the multicellular group (Michod et al., 2006; Michod, 2007), often manifest as reproductive altruism.

Understanding the evolution of cell-cell trade, a classic form of social evolution (Kirk, 2005), requires understanding the extent of between-cell interactions. Network theory has proven to be an exceptionally powerful and versatile technique for analyzing social dynamics (Wey et al., 2008; Lieberman et al., 2005), and indeed, is uniquely well suited to understanding the evolution of early multicellular organisms. When cells adhere through permanent bonds, sparse network-like bodies (i.e. filaments and trees) often result (Amado et al., 2018). This mode of group formation is not only common today among simple multicellular organisms (Umen, 2014; Claessen et al., 2014), but is the dominant mode of group formation in the lineages evolving complex multicellularity (i.e. plants, red algae, brown algae, and fungi, but not animals).

In this paper, we develop and investigate a model for how the network topology of early multicellular organisms affects the evolution of reproductive specialization. We find that under a broad class of sparse networks, complete functional specialization can be adaptive even when returns from dividing labor are saturating (i.e. concave/sub linear). Sparse networks impose constraints on who can share with whom, which counterintuitively increases the benefit of specialization (McShea, 2000). By dividing labor, multicellular groups can capitalize on high between-cell variance in behavior, ultimately increasing group-level reproduction. Further, we consider group morphologies that naturally arise from simple biophysical mechanisms and show that these morphologies strongly promote reproductive specialization. Our results show that reproductive specialization can evolve under a far broader set of conditions than previously thought, lowering a key barrier to major evolutionary transitions.

Model

Reproductive specialization can be modeled as the separation of two key fitness parameters, those related to either viability or fecundity, into separate cells within the multicellular organism (Michod, 2006; Folse and Roughgarden, 2010). The dichotomy of viability versus fecundity was originally used by Michod, 2006 to partition components of cellular fitness into actions that contribute to keeping a cell alive (viability), and actions that directly contribute to reproduction (fecundity). Multicellular organisms often have evolved to divide labor along these two lines (i.e. reproduction by germ cells and survival provided by somatic cells), while their unicellular ancestors had to do both. We define viability as activities keeping the cell alive (e.g. investing in cellular homeostasis or behaviors that improve survival), and fecundity as activities involved in cellular reproduction.

At the cellular level, there appears to be a fundamental asymmetry in how viability effort and fecundity effort can be shared among cells: while multicellular organisms readily evolve differentiated cells that are completely reliant on helper cells (i.e. glial cells that support neurons in animals or companion cells that support sieve tube cells in plants), no cell can directly share its ability to reproduce. To better understand the intuition behind this, consider a cell that elongates prior to fission. This cell must grow to approximately twice its original length. Two cells cannot elongate by 50% and then combine their efforts; elongation is an intrinsically single cell effort. We thus use a model in which viability can be shared across connected cells, but fecundity cannot be shared (note, in order to test the sensitivity of our predictions to this assumption, in a later section we will consider the more general case in which viability and fecundity can both be shared, but by different amounts).

We consider a model of multicellular groups composed of clonal cells that each invest resources into viability and fecundity. Because there is no within-group genetic variation, within-group evolution is not possible, though selection can act on group-level fitness differences. Specifically, we consider the pattern of cellular investment in fecundity and viability, and their sharing of these resources with neighboring cells within the group, to be the result of a heritable developmental program. Thus, selection is able to act on the multicellular fitness consequences of different patterns of cellular behavior within the group. We let v denote each cell’s investment into viability, and b denote each cell’s investment into fecundity. Each cell's total investment is constrained so that $v+b=1$. However, a cell's return on its investment is in general nonlinear. Here, we let α represent the ‘return on investment exponent': by tuning α above and below 1.0, we can simulate conditions with accelerating and saturating (i.e. convex and concave, or super- and sub-linear) returns on investment, respectively. We let $v~$ and $b~$ represent a cell’s return on viability and fecundity investments, respectively. Following Michod, 2006; Michod and Roze, 1997, we calculate a cell’s reproductive output as a multiplicative function of $v~$ and $b~$ (thus, both functions must be positive for a cell to grow). A single cell’s reproduction rate is $w=v~?b~=vα?bα$. At the group level, fitness is the total contribution of all cells in the group toward the production of new groups (i.e. group level reproduction). The group level fitness is thus the sum of $v~?b~$ over all cells.

Finally, cells may share the products of their investment in viability with other cells to whom they are connected.For a given group, the details about who may share with whom, and how much, is encoded in a weighted adjacency matrix $??$. The element $ci?j$ defines what proportion of viability returns cell i shares with cell j. Cells cannot give away all of their viability returns, as they would no longer be viable; mathematically, we count a cell among its neighbors and thus ensure that they always ‘share’ a positive portion of viability returns with themselves, so that $ci?i>0$. Furthermore, since a cell cannot share more viability returns than the total it possesses, we have $∑i=1Ncji=1$ for a group of N cells. For the networks we consider, each cell takes a fraction β of its viability returns and shares that fraction equally among all of its $ni$ neighbors (including itself), and keeps the rest of its returns $1-β$ for itself. Therefore cell i keeps a total fraction of $1-β+βni$ of its returns for itself and gives $βni$ to each of its non-self neighbors. In other words, $cii=1?β+βni$, $ci?j=βni$ if cells i and j are connected, and $ci?j=0$ if cells i and j are not connected. This means the total amount of returns kept by cell i depends on both the network topology and β. When $β=0$ there is no sharing, and when $β=1$ cells share everything equally among all connections and themselves. We refer to β as interaction strength. A given group topology (unweighted adjacency matrix) and β completely specify $??$.

Within a group of N cells, the overall returns on viability that a given cell enjoys, then, comprises its own returns as well as whatever is shared with it by other members of the group. This can be written as $v~i=viαcii+∑j≠invjαcji$, or equivalently, $v~i=∑jnvjαcji$. Note that this is a column sum, since it describes the total incoming viability returns a cell receives as a result of its own effort and trade with neighboring cells. Therefore, we write the group level reproduction rate (i.e. the group fitness) for a group of N cells as

(1) $\begin{array}{ll}W& =\sum _{i=1}^{i=N}{\stackrel{~}{b}}_{i}?{\stackrel{~}{v}}_{i}\\ W& =\sum _{i=1}^{i=N}{\stackrel{~}{b}}_{i}\sum _{j=1}^{j=N}{v}_{j}^{\alpha }{c}_{ji}\\ W& =\sum _{i=1}^{i=N}\sum _{j=1}^{j=N}{b}_{i}^{\alpha }{c}_{ji}{v}_{j}^{\alpha },\end{array}$

where all three of the above equations are equivalent. We investigate evolutionary outcomes under this definition of group level fitness for groups with different topologies (who shares with whom), and in scenarios with various return on investment exponents α.

Results

Fixed resource sharing

We first consider cases wherein cells within a group share across fixed intercellular interactions. In each case we vary the return on investment exponent, α, between 0.5 and 1.5, and the interaction strength, β, between 0.0 and 1.0, both in increments of 0.1. For each combination of topology, α, and β, the group investment strategy ($vi$ for all i) was allowed to evolve for 1000 generations.

We begin with simple topologies: groups with no connections and groups that are maximally connected. They represent, respectively, the case in which all cells within the group are autonomous and the case in which every cell interacts with all others (i.e. a ‘well-mixed’ group). In the absence of interactions, cells cannot benefit from functions performed by others and therefore must perform both functions v and b; hence specialization is not favored, and does not evolve. In the fully connected case, a high degree of specialization is observed for many values of α and β (Figure 1a). Consistent with classic results (Cooper and West, 2018; Michod et al., 2006; Ispolatov et al., 2012; Solari et al., 2013; Michod, 2007; West et al., 2015), specialization is only achieved in the fully connected case for $α>1$.

Figure 1

Next, we consider a simple sparse network in which each cell within a group is connected to only two other cells, forming a complete ring (Figure 1b); we refer to this as the neighbor network. Surprisingly, preventing trade between most cells encourages division of labor. We find that specialization evolves even when $α<1.0$, that?is, when the returns on investment are saturating or concave. In our simulations, this topology leads to alternating specialists in viability and fecundity (Figure 1b). Analytically, we find that this topology always favors at least some degree of specialization whenever $α>34?β$.

We next study a network with cells that can be separated into two disjoint sub-groups, where every edge of the network connects a cell in one sub-group to a cell in the other sub-group and no within sub-group connections exist, that?is, a bipartite graph (Figure 1c). We refer to the specific network structure in Figure 1c as the ‘balanced bipartite’ network. We find that specialization evolves even when $α<1.0$, similar to the neighbor network. However, we find that specialization evolves for a wider range of α and β values for the balanced bipartite network than for the neighbor network.

We can analytically determine under what conditions complete generalization is optimal. The complete generalist investment strategy is where every cell in the group invests equally into viability and fecundity, defined as: $vi*=12$ for all i. For these simple topologies, the complete generalist strategy is either a maximum or a saddle point, depending on the values of α and β. Complete generalization is only favored when the Hessian evaluated at the generalist investment strategy $?2W?vk?v?|v→?=H?$ is negative definite, that?is, all of its eigenvalues are negative. The largest eigenvalues of the Hessian for the complete, neighbor, and balanced bipartite networks are $α?(12)2?α-3?(-1+α?β)$, $α?(12)2?α-3?(-1+43?α?β)$, and $α?(12)2?α-3?(-1+2?NN+2?α?β)$, respectively. When α and β are chosen so that the largest eigenvalue becomes non-negative, complete generalization cannot maximize group fitness.

While we have not analytically shown where the fitness maximum occurs in cases where the generalist strategy becomes a saddle point, evolutionary simulations (Figure 1) suggest that when complete generalization is not a fitness maximum, a high degree of (or even complete) specialization typically does maximize fitness.

In all cases in which complete specialization is achieved in evolutionary simulations, $v~?b~$ terms for viability specialists go to zero, as they cannot reproduce on their own. Furthermore, the fecundity specialists are entirely reliant on the viability specialists for their survival; if viability sharing were suddenly prevented, their $v~?b~$ terms would also be zero. This amounts to complete reproductive specialization (Cooper and West, 2018; Kirk, 2005; Michod, 2006).

Evolving resource sharing

Until now, sharing has been included in every intercellular interaction within groups. Here, we consider the case in which there is initially no sharing, and sharing must evolve along with specialization. These simulations begin with no resource sharing (i.e. $β=0$); during every round, each group in the population has a 2% chance that a mutation will impact its developmental program, and the β value for one of its cells will change. The new β value is chosen from a truncated Gaussian with standard deviation of 10% of the mean, centered on the current value. Whatever is not retained is shared equally across all interactions, including the self term.

Evolutionary simulation results are similar to those from the fixed-sharing model (Appendix 1—figure 1). Saturating specialization (i.e. specialization despite a concave return function) still occurs for the neighbor and balanced bipartite topologies. Thus, for both fixed and evolved resource sharing, we observe specialization for the largest range of parameters (including $α<1$) not when the group is maximally connected, but rather when connections are fairly sparse. Therefore, a sparse group topology constitutes a cooperation-prone physical substrate that can favor the evolution of cellular.

As an example of the benefit of evolving sharing, consider that the maximum fitness according to Equation 1 for a group of N disconnected cells scales as $N?(12)2?α$. On the other hand, for the balanced bipartite network with a complete specialization strategy (i.e. $v→=?0,1,0,1,…?$), the fitness scales as $(N2?β2?N+2)$. The ratio of these fitnesses is $(N2?β2?N2+2?N)?22?α≈β?22?α-1$, where the approximation is for large N. So for larger groups and when $α>12-log?β2?log?2$, if a group can evolve resource sharing (i.e. letting $β→1$ and adopting the specialist investment strategy) its maximum fitness will increase.

Benefit of specialization

We now consider a simple example to highlight why specialization can be adaptive despite saturating (i.e., concave) returns from trade. Consider groups of four cells, connected via the nearest-neighbor topology (i.e. in a ring). We directly calculate the group-level fitness of generalists and specialists for two scenarios: $α=0.9$ and $α=1$ by summing the contributions of each cell within these groups (Figure 2). In this simple scenario, reproductive specialization strongly increases group fitness (33% for $α=1$ and 16% for $α=0.9$).

Figure 2

The benefit of specialization in neighbor networks increases with group size. For a ring of size N, fitness under the specialist strategy $v→=?0,1,0,1?…?$ is $W=β3?N$. For a ring of generalists the fitness is $W=N?(12)2?α$. Therefore, whenever $α>log?3-log?β2?log?2$, the ring of complete specialists enjoys a greater fitness than the ring of complete generalists. Again, note that complete generalization becomes disfavored when $α>34?β$, so there is a narrow regime where $34?β<α during which neither complete generalization nor complete specialization is optimal. Numerical optimization and evolutionary simulations suggest that even in this region, however, the specialization score of the optimal strategy is large (Figure 1).

Effect of sparsity

Surprisingly, saturating specialization appears to be the rule, rather than the exception, for sparsely connected graphs. We investigated Erd?s-Rényi random graphs with varying degrees of connectivity to systematically examine the relationship between sparsity and the value of α at which specialization is favored. We find that many randomly assembled graphs obtain maximum fitness through complete reproductive specialization even when α is below 1 (Figure 3b,c). It is only at the extremes of sparsity and connectivity (near the fully connected or fully unconnected points) that generalists maintain superior fitness for all values of $α<1$. We further show that this general trend is independent of the size of a group; saturating specialization is favorable for groups of size $N=10$, $N=100$, and $N=1000$. When network connectivity is at its minimum, the group consists solely of isolated cells that cannot interact. Under these conditions generalists are favored. Similarly, at maximum connectivity every cell interacts with every other cell. Under these conditions generalists are favored unless $α?β>1$. However, when connectivity is small but not zero, specialization arises most readily. We conjecture that the troughs in Figure 3b, where specialization occurs for the lowest values of α, occur when connectivity is just large enough so that the existence of a spanning tree is more likely than not.

Figure 3

Filaments and trees

Sparse topologies like the neighbor network configuration have significant biological relevance, and direct ties to early multicellularity. The first step in the evolution of multicellularity is the formation of groups of cells (Szathmáry and Smith, 1995; Kirk, 2005; Willensdorfer, 2008; Bonner, 1998; Fairclough et al., 2010). Simple groups readily arise through incomplete cell division, forming either simple filaments (Figure 4a) or tree-like morphologies (Figure 4b; Bengtson et al., 2017b; Droser and Gehling, 2008; Berman-Frank et al., 2007; Ratcliff et al., 2012). Filament topologies have been widely observed in independently-evolved simple multicellular organisms, from ancient fossils of early red algae (Butterfield, 2000; Figure 4a) to extant multicellular bacteria (Claessen et al., 2014) and algae (Umen, 2014). Branching multicellular phenotypes have also been observed to readily evolve from baker’s yeast (Ratcliff et al., 2015; Figure 4b), and are reminiscent of ancient fungus-like structures (Bengtson et al., 2017a) and early multicellular fossils of unknown phylogenetic position from the early Ediacaran (Droser and Gehling, 2008).

Figure 4

Simulations of populations of groups with filamentous and branched topologies reveal that specialization is indeed favored in the sub-linear regime (Figure 4a and b) ; conversely, sub-linear specialization is never observed for fully connected topologies (Figure 4c). While the generalist strategy is never a critical point for these networks (which have $??≠??T$, see Materials and methods), we conjecture that there is a nearby critical point which maximizes fitness at small values of α and becomes unstable at larger values of α. We introduce a new metric, $α*$, defined as the value of α such that the largest (least negative) eigenvalue of the Hessian evaluated at the complete generalist strategy is zero when $β=1$. For topologies in which each member has the same number of neighbors, $α*$ is a critical value at which generalization is no longer an optimal strategy. However, even for groups where the number of neighbors for each cell varies, we can still use $α*$ as a proxy for how amenable a topology is to saturating specialization. The smaller $α*$, the more specialization is likely to be favored. We plot vertical lines where $α=α*$ (solid?lines?in ?Figure 4(a) Figure 4(b)), and dotted lines to indicate roughly where the simulation curves cross specialization of 0.5. These results show that, for these topologies, $α*$ acts as an effective metric for how amenable a network is to saturating specialization. This metric $α*$ only depends on topology and can in principle be calculated analytically given any network. We examined the value of $α*$ as filaments and a variety of tree-like structures grow larger, and find that specialization becomes more strongly favored (Figure 4D ).?While group size has no effect on specialization for some topologies, like the neighbor network, filaments and trees all see a decrease in $α*$ as group size increases; $α*$ eventually plateaus once groups are larger than a few tens of cells. Simple and easily accessible routes to multicellular group formation can readily evolve in response to selection for organismal size (Ratcliff et al., 2012), and this process may also strongly favor the evolution of cellular differentiation (McCarthy and Enquist, 2005; Heim et al., 2017; McClain and Boyer, 2009; Bonner, 1998).

Mean field model

Finally, to capture some general principles underlying this phenomenon, we consider a mean-field model with N cells?(N?>>?1), each of which is connected to z other cells. For simplicity we consider the case in which $β=1$ and $α=1$. We pick $α=1$ as at this point, if the fitness of specialists is greater than that of generalists, specialization will be favored for at least some values of $α<1$. If the fitness of generalists is greater than or equal to that of specialists, specialization will only be favored if $α>1$.

For generalists, the fitness is simply $WG=N/4$, as each cell has $v=1/2$ and $b=1/2$ (before and after sharing). Viability specialists produce $v=1$ and $b=0$, while fecundity specialists produce $v=0$ and $b=1$. Viability specialists then share $v=1/(z+1)$ with each of their z neighbors. After sharing, fecundity specialists receive $v=1/(z+1)$ from each of their viability specialist neighbors. But how many of their neighbors are viability specialists? We label the fraction of cells connected to fecundity specialists that are viability specialists f, that?is, f is the mean number of viability specialists connected to each fecundity specialist divided by z, averaged over all fecundity specialists. For a bipartite graph, $f=1$; for a randomly connected graph on which half of cells are viability specialists and half of cells are fecundity specialists, $f=1/2$. Group fitness is thus:

(2) ${W}_{S}=\frac{z?f?N}{2?\left(z+1\right)}.$

Here, $z?f/(z+1)$ is the average viability returns each fecundity specialist has received after sharing, which is multiplied by the amount of fecundity each fecundity specialist has (1) and the number of fecundity specialists ($N/2$). Writing $WS$ in terms of $WG$:

(3) ${W}_{S}=\frac{2?z?f?{W}_{G}}{z+1}.$

Specialists will be favored if the ratio $WS/WG>1$. This will be true if:?

(4) $f>\frac{z+1}{2?z},$

which reduces to:

(5) $f>\frac{1}{2}+\frac{1}{2?z}.$

This inequality implies that specialization will only be favored if fecundity specialists are preferentially connected to viability specialists, that?is, if $f>1/2$. Further, for a fully connected network $f=1/2$, so this inequality is never satisfied, that?is, specialists cannot have larger fitness than generalists for $α=1$ and fully connected topologies, as classically predicted.

Further, f cannot be more than 1, so if the threshold from the inequality in Equation 5 is greater than or equal to 1, specialization cannot be favored for $α<1$. Thus, specialization for $α<1$ is only possible if:

(6) $\frac{1}{2}+\frac{1}{2?z}<1,$

which reduces to: $z>1$. This again reproduces a classic result: specialization for $α<1$ is not possible for disconnected cells.

This analysis allows us to interrogate specific cases. For example, if $z=3$, f must be greater than 2/3, while if $z=4$, f must only be greater than 5/8. Can such networks be constructed? The answer will depend on both the number of cells and how they are connected. Ultimately, the question of if a graph can be made with particular values of f and z is a graph coloring problem, and beyond the scope of this manuscript. However, this inequality presents a useful heuristic which can be used to determine if specialization is favored by measuring just a few properties of the graph.

Effect of varying ratios of specialists

We now allow the fraction of fecundity specialists to be X (rather than forcing $X=1/2$). For generalists, the group fitness is unchanged, $WG=N/4$, while for specialists the group fitness is:

(7) ${W}_{S}=\frac{z?f?X?N}{z+1}.$

Writing $WS$ in terms of $WG$ gives:

(8) ${W}_{S}=\frac{4?z?f?X?{W}_{G}}{z+1}.$

Specialists will be favored if the ratio $WS/WG>1$. This will be true if:?

(9) $f>\frac{z+1}{4?X?z}=\frac{1}{4?X}+\frac{1}{4?X?z}.$

Compared to the threshold value of f when $X=1/2$, if $X>1/2$, that?is, more than half of cells are fecundity specialists, the value of f necessary for specialization to be favored is lower. If $X<1/2$, the threshold value of f is higher than if $X=1/2$. In other words, 1:2 is different from 2:1, and they both are different from 1:1. Once again, the question of if a particular configuration can be created–and how–is a graph coloring problem beyond the scope of this manuscript. However, this mean field heuristic gives us some information about how to expect graphs with different ratios of specialists to generalists to behave.

We again ask what must be true for f to be less than 1 (if $f>1$, specialization will not be favored). Thus, specialization is only possible if:

(10) $\frac{1}{4?X}+\frac{1}{4?X?z}<1,$

which reduces to:

(11) $X>\frac{1}{4}+\frac{1}{4?z}.$

For a mean field model, specialization with $α<1$ is impossible if fewer than one fourth of cells are fecundity specialists. We stress here that this is a mean field model, and does not apply to scenarios in which cells have a wide range of values of z. If such networks do or do not favor specialization for $α<1$ will again be a graph coloring problem.

Discussion

During the evolution of multicellularity, formerly autonomous unicellular organisms evolve into functionally-integrated parts of a new higher?level organism (West et al., 2015; Michod and Nedelcu, 2003). Evolutionary game theory (Corning and Szathmáry, 2015; Nash, 1950; Smith, 1988) argues that functional specialization should only evolve when increased investment in trade increases reproductive output. Conventionally, this requires returns from specialization to be accelerating, that?is, convex or super-linear (Szathmáry and Smith, 1995; Smith and Szathmáry, 1997; Goldsby et al., 2012; Corning and Szathmáry, 2015; Boza et al., 2014; Taborsky et al., 2016; Page et al., 2006; Rueffler et al., 2012; Szekely et al., 2013). While this idea is intuitive, it is, in the case of fixed group topology, also overly restrictive. In this paper, we explore how social interactions within groups, measured by their network topology, affect the evolution of reproductive specialization. Indeed, when all cells within groups interact (with equal interaction strength), returns on investment must be an accelerating, that?is, convex, function of investment for specialization to evolve (Figure 1a; Szathmáry and Smith, 1995; Smith and Szathmáry, 1997; Corning and Szathmáry, 2015; Cooper and West, 2018). Yet for a broad class of sparsely?connected networks, complete specialization can evolve even when the viability and fecundity return on investment curves are saturating, that?is, concave (Figure 3).

To understand how specialization can be favored despite concave return on investment (ROI) curves, consider Jensen's inequality. Jensen's inequality states that for a convex function $F?(x)$, $?F?(x)?>F?(?x?)$, that?is, the average value of $F?(x)$, $?F?(x)?$, is larger than $F?(?x?)$, where $?x?$ is the average value of x. A corollary of Jensen's inequality is that the opposite is true for concave functions, that?is, for a concave function $G?(x)$, $?G?(x)?. Jensen’s inequality guarantees that for concave ROI functions generalists produce more total viability and fecundity than specialists, and that for convex ROI functions specialists produce more total viability and fecundity than generalists.

Crucially, however, Jensen's inequality does not connect ROI convexity/concavity to group fitness. Jensen’s inequality relates the degree of specialization to the average viability and average fecundity produced, but does not itself say anything about group fitness, which is the product of viability and fecundity averaged across all cells. For fully connected topologies (i.e. Figure 4c), greater absolute productivity proportionally increases group fitness, and differentiation can only evolve with accelerating benefits of specialization. This is not the case for topologically structured organisms, where fitness also depends on how complementary specialist cells are connected. Natural selection acts on realized productivity, that?is, average $v?b$; mutations that increase average v or average b without increasing average $v?b$ are not adaptive. The importance of connecting complementary specialists has long been appreciated in other contexts, such as metabolic cross-feeding, for which it has been shown that the spatial arrangement of unlike specialists plays a key role in determining their productivity (and thus fitness) (Co et al., 2020). Indeed, While Jensen's inequality ensures that generalists will produce more viability and fecundity than specialists given a concave ROI function, specialization can still increase the fitness of topologically structured groups by increasing realized productivity.

Rather than being unusual, networks favoring specialization readily arise as a consequence of physical processes structuring simple cellular groups (Allen et al., 2017). For example, septin defects during cell division create multicellular groups with simple graph structures (Figure 4a and b), where cells are connected only to parents and offspring (Bengtson et al., 2017b; Droser and Gehling, 2008; Ratcliff et al., 2012; Ratcliff et al., 2013). If cells share resources only with physically-attached neighbors, then the physical topology of the group describes its interaction topology, and these sparse networks strongly favor reproductive specialization. Finally, we note that the primary benefit of sparsity is that sparse networks are likely to be at least somewhat bipartite. The more bipartite-like a network is, the less effort is wasted, and the easier it is for specialization to be favored.

Disentangling the evolutionary underpinnings of ancient events is notoriously difficult. Still, it is worth examining the independent origins of complex multicellularity, which are independent runs of parallel natural experiments in extreme sociality. Complex multicellularity (large multicellular organisms with considerable cellular differentiation) has evolved in at least five eukaryotic lineages, once each in the animals (King, 2004), land plants (Kenrick and Crane, 1997), and brown algae (Silberfeld et al., 2010), two or three times in the red algae (Cock and Collén, 2015; Yoon et al., 2006), and 8–11 times in fungi (Nagy et al., 2018). In all cases other than animals, these organisms form multicellular bodies via permanent cell-cell bonds, creating long-lasting highly structured cellular networks. Both fossil and phylogenetic evidence suggests that early multicellular organisms in these lineages were considerably less complex, growing as relatively simple graph structures. For example, 1.2 billion year old red algae formed linear filaments of cells (Butterfield, 2000), basal multicellular charophyte algae formed circular sheets of cells radiating from a common center (Kenrick and Crane, 1997), the ancestor of the brown algae likely formed a branched haplostichous thallus that was either filamentous or pseudoparenchymatous (Silberfeld et al., 2010), and hyphal fungi are primarily composed of linear chains of cells. Much less is known about the topology of animals prior to the evolution of cellular specialization. One hypothesis is that early metazoans resembled extant colonial choanoflagellates (Fairclough et al., 2013), the closest-living protistan relatives of the animals (Fairclough et al., 2010). Extant colony-forming choanoflagellates have evolved a variety of multicellular structures with sparse cellular topologies and permanent cell-cell bonds. For example, many species form branched, tree-like structures (Leadbeater, 2015), Choanoeca flexa grows as a sheet of cells (Brunet et al., 2019), and Salpingoceca rosetta can form either linear chains or rosettes in which the cells are connected via cytoplasmic bridges formed through incomplete cytokinesis (Dayel et al., 2011). While these growth forms are quite diverse, they all share characteristics (i.e. permanent cellular bonds and sparse topologies) that promote the evolution of cellular differentiation.

The main differences between our work and previous investigations of the effect of group topology on specialization is that we consider the productivity of groups as a whole, not the cells within them, and we consider situations of highly asymmetric sharing. Our approach is general, and can be applied to other systems of trade and specialization, so long as (1) only the aggregate productivity of the group (and not the particles within it) is maximized, (2) the productivity of each particle within the group is a multiplicative function of returns on investment into two (or more) tasks, and (3) there is an asymmetry in how products of those investments are shared. While in this work we have focused on reproductive division of labor, a process in which fecundity returns are not shared at all, we show in the supplement that as long as sharing of two goods is sufficiently asymmetric, specialization with saturating returns on investment can still be adaptive (Appendix 1—figure 2).

Finally, we note that alternative paths to specialization likely exist. For example, cells at different positions in a group may experience different local environments, which may produce cells with varied fecundity-viability trade-offs. A previous paper demonstrated that the evolution of specialization is favored if these ‘positional effects’ result in an initially heterogeneous population of cell types (Tverskoi et al., 2018). However, these positional effects were considered for the case of well-mixed groups (i.e. completely connected network topologies). We thus anticipate that future work examining the relationship between cellular interaction topology and cellular heterogeneity (as well as a wide range of complex and varied relationships between viability, fecundity, and multicellular fitness) will provide unique insight into the origin and diversity of multicellular forms.

Conclusion

We explored the evolution of reproductive specialization in multicellular groups with various cellular interaction topologies. Our results demonstrate that group topological structure can play a key role in the evolution of reproductive division of labor. Indeed, within a broad class of sparsely connected networks, specialization is favored even when the returns from cooperation are saturating (i.e. concave); this result is in direct contrast to the prevailing view that accelerating (i.e. convex), returns are required for natural selection to favor increased specialization (Cooper and West, 2018; Michod et al., 2006; Ispolatov et al., 2012; Solari et al., 2013; Michod, 2007; West et al., 2015). Our results underscore the central importance of life history trade-offs in the origin of reproductive specialization (Michod et al., 2006; Michod, 2007; Hammerschmidt et al., 2014; van Gestel and Tarnita, 2017; Noh et al., 2018), and support the emerging consensus that evolutionary transitions in individuality are not necessarily highly constrained (Ratcliff et al., 2012; Ratcliff et al., 2017; Fairclough et al., 2010; Brunet and King, 2017; Pennisi, 2018; Black et al., 2019; Rose et al., 2020; van Gestel and Tarnita, 2017; Black et al., 2019; Staps et al., 2019; Grosberg and Strathmann, 2007).

Materials and methods

Analysis

The gradient of the fitness with respect to the group investment strategy $v→$, is

(12) $\frac{\mathrm{?}W}{\mathrm{?}\stackrel{\to }{v}}=\sum _{k=1}^{N}{\stackrel{^}{e}}_{k}\alpha \left({v}_{k}^{\alpha ?1}\sum _{j=1}^{N}{c}_{kj}\left(1?{v}_{j}{\right)}^{\alpha }?\left(1?{v}_{k}{\right)}^{\alpha ?1}\sum _{j=1}^{N}{c}_{jk}{v}_{j}^{\alpha }\right)$

where $e^k$ is a unit vector in the $kth$ direction. First notice that if $??=??T$, and $v→=121→$ where $1→$ is a vector of ones, then the gradient is zero. This strategy, $v→=121→$, corresponds to the ‘generalist’ strategy, where every cell invests equally into both tasks. Second, notice that if $??≠??T$ then the gradient is not zero under the generalist strategy, so at least some degree of specialization must be necessary to maximize fitness. To determine the stability of this solution we examine $??*$, the Hessian (see SI Equation 3) evaluated at the generalist critical point. If $??*$ is negative definite, then the generalist strategy is a fitness maximum and is therefore an optimal strategy. If, on the other hand, $??*$ has both positive and negative eigenvalues then the generalist strategy lies at a saddle point within the fitness landscape, and therefore the optimal strategy must be somewhere else in (or on the boundary of) the domain (i.e. $vi∈[0,1]$ for all $i∈1,2,…?N$). Finally, note that $??*$ is never positive definite since $1→$ is always an eigenvector with negative eigenvalue (when $??=??T$).

We also use the zero crossing of the largest eigenvalue of $??*$ evaluated at $v→=121→$ and $β=1$ as an overall measure of how amenable a network is to specialization, even when $??≠??T$.

Evolutionary simulations

Request a detailed protocol

Our evolutionary simulations maintain the same overall structure as the Wright-Fisher model: a discrete-time Markov chain framework with fitness-weighted multinomial sampling between generations and constant population size. Therefore we refer to them as Wright-Fisher evolutionary simulations. We initialize a population of $??=1000$ groups, each of group size $N=10$, with uniform random investment strategies. We then let them evolve for 1000 generations, selecting offspring according to the relative fitness of each group (see Equation 1). At each generation, there is a 2% chance for a mutation to a given group’s investment strategy $v→$. If a mutation occurs, a new investment strategy is selected from a truncated multivariate gaussian distribution centered at the current (pre-mutation) investment strategy and with standard deviation equal to $110?v→$. After mutations each group’s fitness is calculated according to Equation 1, and the population is ranked according to fitness. Finally, $??$ groups are selected (with replacement) to populate the next generation, according to a multinomial distribution weighted by the groups’ fitness ranks.

Measuring specialization

Request a detailed protocol

To quantify the degree of specialization associated with a given group’s optimal investment strategy—the one which maximizes the fitness—we introduce the following metric, which we refer to simply as ‘Specialization’:

(13) $\mathrm{??}=\frac{2}{N}?\sum _{i=1}^{N}\left(\mathrm{max}?\left({v}_{i},1-{v}_{i}\right)-\frac{1}{2}\right).$

Specialization ranges from 0 (for groups consisting of cells investing equally in functions v and b) to 1 for groups consisting of cells investing exclusively in either function.

Code availability

Request a detailed protocol

All evolutionary simulations and other computations associated with this work are available at github.com/dyanni3/topologicalConstraintsSpecialization?(Yanni, 2020;?copy archived at https://github.com/elifesciences-publications/topologicalConstraintsSpecialization).

Appendix 1

Analysis

As described in the main text, the fitness for a group of $N$ individuals is defined as

(1) $W=\sum _{i=1}^{N}\sum _{j=1}^{N}{b}_{i}^{\alpha }?{c}_{j?i}?{v}_{j}^{\alpha }$

and the gradient of the fitness with respect to the group investment strategy $v→$, is

(2) $\frac{\mathrm{?}W}{\mathrm{?}\stackrel{\to }{v}}=\sum _{k=1}^{N}{\stackrel{^}{e}}_{k}\alpha \left({v}_{k}^{\alpha ?1}\sum _{j=1}^{N}{c}_{kj}\left(1?{v}_{j}{\right)}^{\alpha }?\left(1?{v}_{k}{\right)}^{\alpha ?1}\sum _{j=1}^{N}{c}_{jk}{v}_{j}^{\alpha }\right)$

where $e^k$ is a unit vector in the $kth$ direction.

Hessian

The Hessian $?2?W??vk???v?$ is

(3) ${H}_{kl}=\left\{\begin{array}{ll}?{\alpha }^{2}\left({v}_{k}^{\alpha ?1}{c}_{k?}\left(1?{v}_{?}{\right)}^{\alpha ?1}+\left(1?{v}_{k}{\right)}^{\alpha ?1}{c}_{?k}{v}_{l}^{\alpha ?1}\right)& k\ne ?\\ \\ ?2{\alpha }^{2}\left({v}_{k}^{\alpha ?1}{c}_{kk}\left(1?{v}_{k}{\right)}^{\alpha ?1}\right)\\ +\left(\left(\alpha \right)\left(\alpha ?1\right){v}_{k}^{\alpha ?2}\sum _{j}{c}_{kj}\left(1?{v}_{j}{\right)}^{\alpha }\right)\\ +\left(\left(\alpha \right)\left(\alpha ?1\right)\left(1?{v}_{k}{\right)}^{\alpha ?2}\sum _{j}{c}_{jk}{v}_{j}^{\alpha }\right)& k=?\end{array}$

Of particular interest for us is the value of the Hessian at the generalist strategy when $??=??T$. In that case

(4) ${\mathbf{??}}^{*}=\alpha ?{\left(\frac{1}{2}\right)}^{2?\alpha -3}?\left[-\alpha ?\beta ?\mathbf{??}+\left(\alpha ?\beta -1\right)?\mathbf{??}\right].$

where $??$ is the row-normalized adjacency matrix of the network. If $??$ is the network’s adjacency matrix then?

${a}_{i?j}=\frac{{A}_{i?j}}{{\sum }_{j=1}^{N}{A}_{i?j}}.$

The case when $??=??T$

As noted above, when $??=??T$, the generalist strategy is always a critical point where $??W??v→=0$. To determine the stability of this solution we examine $??*$ (Equation 4). If $??*$ is negative definite, then the generalist strategy is a fitness maximum and is therefore an optimal strategy. If, on the other hand, $??*$ has both positive and negative eigenvalues then the generalist strategy lies at a saddle point within the fitness landscape, and therefore the optimal strategy must be somewhere else in (or on the boundary of) the domain (i.e. $vi∈[0,1]$ for all $i∈1,2,…?N$). Finally, note that $??*$ is never positive definite (when $??=??T$). Consider $H?1→$:

$\begin{array}{lc}& {\mathbf{H}}^{?}\stackrel{\to }{1}=\alpha {\left(\frac{1}{2}\right)}^{2\alpha ?3}\left[?\alpha \beta \mathbf{a}\stackrel{\to }{1}+\left(\alpha \beta ?1\right)\mathbf{I}\stackrel{\to }{1}\right]\\ & {\mathbf{H}}^{?}\stackrel{\to }{1}=\alpha {\left(\frac{1}{2}\right)}^{2\alpha ?3}\left[?\alpha \beta \stackrel{\to }{1}+\left(\alpha \beta ?1\right)\stackrel{\to }{1}\right]\\ & {\mathbf{H}}^{?}\stackrel{\to }{1}=?\alpha {\left(\frac{1}{2}\right)}^{2\alpha ?3}\stackrel{\to }{1}.\end{array}$

Note $a1→=1→$ since $??$ is row-normalized. Furthermore, $α>0$, so $1→$ is always an eigenvector of $??*$ with a negative eigenvalue.

We can next ask, under what conditions is $??*$ negative definite? This will depend on the group topology, the nonlinear returns on investment $α$, and the interaction strength $β$. We examine three cases: the neighbor graph, the balanced bipartite graph, and the complete graph.

Appendix 1—table 1

When $??=??T$, the matrix $??*$ is a special type of matrix called a circulant matrix, with well known properties. Its eigenvalues are given by the discrete Fourier transform of its first row. The $kth$ eigenvalue is

${\lambda }_{k}=\sum _{j=0}^{N?1}{H}_{1j}^{?}{e}^{\frac{2\pi i}{N}jk}.$

For the ring topology with $N=10$, for example

${\lambda }_{k}=\alpha {\left(\frac{1}{2}\right)}^{2\alpha ?3}\left(\left(?1+\frac{2\alpha \beta }{3}\right)?\frac{\alpha \beta }{3}{e}^{\frac{\pi i}{5}k}?\frac{\alpha \beta }{3}{e}^{\frac{9\pi i}{5}k}\right),$

which has its maximum when $k=5$,?

$\underset{k}{\mathrm{max}}?{\lambda }_{k}=\alpha ?{\left(\frac{1}{2}\right)}^{2?\alpha -3}?\left(-1+\frac{4?\alpha ?\beta }{3}\right).$

The maximum eigenvalue for the balanced bipartite and complete graphs can be computed similarly.

Evolution of resource sharing

Here we model the co-evolution of sharing and specialization. We start with generalists that do not share at all. We then allow the amount of sharing and the degree of specialization to evolve. As described in the main text, during every round, each group in the population has a 2% chance that one if its cells will mutate and change how much ‘viability’ it shares. When this occurs, the fraction of its output to retain is chosen from a Gaussian with standard deviation of 10% centered on the current value. Whatever is not retained is shared equally across its interactions. The degree of specialization evolves as in simulations described in the main text.

Results are shown in Appendix 1—figure 1, for neighbor topologies, balanced bipartite topologies, and for a complete network.

Appendix 1—figure 1

General case of sharing two resources

We have so far focused on reproductive specialization, wherein the returns from one type of task (reproduction) are completely unshared while returns from another task (viability) are shared according to some functional interaction strength $β$. Here, we generalize somewhat to consider the returns from two arbitrary tasks which may each be shared to some extent, given by functional interaction strengths $(β1,β2)$. For notational continuity we will continue to refer to the investment in those tasks as $b$ and $v$, and for tractability we will continue to assume that $α1=α2$ and that there is a single topology governing who can trade with whom within the group. Of course, further generalizations could be made — e.g. each task could experience different returns on investment, there could be an arbitrary number of tasks, the availability of trading partners could differ between tasks, etc. However, we hope to show by this relatively modest generalization that there is nothing unique to reproductive tasks whose fruits are totally unshared that leads to specialization under regimes of sublinear return on investment.

The fitness function is modified so that

(5) $W=\left(\mathbf{??}_{??}{}^{T}?{\stackrel{\to }{b}}^{\alpha }\right)?\left(\mathbf{??}_{??}{}^{T}?{\stackrel{\to }{v}}^{\alpha }\right)$

which yields the following Hessian at the generalist critical point (for the neighbor, balanced bipartite, and complete networks)

${\mathbf{H}}_{ab}^{?}=\alpha {\left(\frac{1}{2}\right)}^{2\alpha ?2}\left(?2\alpha \left({\mathbf{c}}_{\mathbf{1}}?{{\mathbf{c}}_{\mathbf{2}}}^{\text{T}}{\right)}_{ab}+\left(\alpha ?1\right)\left[\left({\mathbf{c}}_{\mathbf{1}}?{{\mathbf{c}}_{\mathbf{2}}}^{\text{T}}?\stackrel{\to }{1}{\right)}_{a}+\left({\mathbf{c}}_{\mathbf{1}}?{{\mathbf{c}}_{\mathbf{2}}}^{\text{T}}?\stackrel{\to }{1}{\right)}_{b}\right]{\delta }_{ab}\right),$

where

(6) ${\mathbf{c}}_{\mathbf{1}}={\beta }_{1}\mathbf{a}+\left(1?{\beta }_{1}\right)\mathbf{I}$
(7) ${\mathbf{c}}_{\mathbf{2}}={\beta }_{2}\mathbf{a}+\left(1?{\beta }_{2}\right)\mathbf{I}$

and,

${a}_{i?j}=\frac{{A}_{i?j}}{{\sum }_{j=1}^{N}{A}_{i?j}},$

where, as above, $??$ is the graph’s adjacency matrix (including self loops).

We see that for a given topology the adjacency matrix is fixed, so that $????$ and $????$ differ only in their functional interaction strengths $β1$ and $β2$. Therefore the maximum fitness strategy, specified by the vector $v→*$, for a given group will depend under our model on the following parameters:

$\begin{array}{ll}& \mathbf{A}\text{?}\to \text{?}\mathrm{A}\mathrm{d}\mathrm{j}\mathrm{a}\mathrm{c}\mathrm{e}\mathrm{n}\mathrm{c}\mathrm{y}\text{?}\mathrm{m}\mathrm{a}\mathrm{t}\mathrm{r}\mathrm{i}\mathrm{x},\text{?}\mathrm{s}\mathrm{p}\mathrm{e}\mathrm{c}\mathrm{i}\mathrm{f}\mathrm{i}\mathrm{e}\mathrm{s}\text{?}\mathrm{t}\mathrm{o}\mathrm{p}\mathrm{o}\mathrm{l}\mathrm{o}\mathrm{g}\mathrm{y}\\ & \beta {}_{1}\text{?}\to \text{?}\mathrm{F}\mathrm{u}\mathrm{n}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\mathrm{a}\mathrm{l}\text{?}\mathrm{i}\mathrm{n}\mathrm{t}\mathrm{e}\mathrm{r}\mathrm{a}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{?}\mathrm{s}\mathrm{t}\mathrm{r}\mathrm{e}\mathrm{n}\mathrm{g}\mathrm{t}\mathrm{h}\text{?}\mathrm{o}\mathrm{f}\text{?}\mathrm{r}\mathrm{e}\mathrm{s}\mathrm{o}\mathrm{u}\mathrm{r}\mathrm{c}\mathrm{e}\phantom{\rule{thinmathspace}{0ex}}1\\ & \beta {}_{2}\text{?}\to \text{?}\mathrm{F}\mathrm{u}\mathrm{n}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\mathrm{a}\mathrm{l}\text{?}\mathrm{i}\mathrm{n}\mathrm{t}\mathrm{e}\mathrm{r}\mathrm{a}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{?}\mathrm{s}\mathrm{t}\mathrm{r}\mathrm{e}\mathrm{n}\mathrm{g}\mathrm{t}\mathrm{h}\text{?}\mathrm{o}\mathrm{f}\text{?}\mathrm{r}\mathrm{e}\mathrm{s}\mathrm{o}\mathrm{u}\mathrm{r}\mathrm{c}\mathrm{e}\text{?}2\\ & \alpha \text{?}\to \text{?}\mathrm{S}\mathrm{p}\mathrm{e}\mathrm{c}\mathrm{i}\mathrm{a}\mathrm{l}\mathrm{i}\mathrm{z}\mathrm{a}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{?}\mathrm{p}\mathrm{o}\mathrm{w}\mathrm{e}\mathrm{r},\text{?}\mathrm{a}\mathrm{s}\mathrm{s}\mathrm{u}\mathrm{m}\mathrm{e}\mathrm{d}\text{?}\mathrm{t}\mathrm{o}\text{?}\mathrm{b}\mathrm{e}\text{?}\mathrm{e}\mathrm{q}\mathrm{u}\mathrm{a}\mathrm{l}\text{?}\mathrm{f}\mathrm{o}\mathrm{r}\text{?}\mathrm{r}\mathrm{e}\mathrm{s}\mathrm{o}\mathrm{u}\mathrm{r}\mathrm{c}\mathrm{e}\text{?}1\text{?}\mathrm{a}\mathrm{n}\mathrm{d}\text{?}2\end{array}$

We demonstrate the effect of these parameters on the optimal strategy by finding the minimum value of $α$ for which specialization becomes favored, which we denote $α*$, for a given pair $(β1,β2)$ and given topology. The results are shown in Appendix 1—figure 2.

Appendix 1—figure 2

Jensen’s inequality and sparse topologies and asymmetric sharing

To understand how average fitness decouples from average $v$ and average $b$ for sparse topologies and asymmetric sharing, consider a ring of four cells in three different configurations: one that alternates between viability and fecundity specialists, one in which like-specialists are connected to each other, and one in which all cells are generalists (pictured below). For simplicity, we will set $β=1$, and we will initially consider the case when $α=1$. When $α=1$, Jensen's inequality tells us that generalists and specialists will be equally productive. Classically, this would suggest that specialists and generalists should have the same fitness.

And, indeed, all three cases have the same average v and the same average b (?? for each). However, the average fitnesses are all different.

Next, we consider the same three configurations, but with $α=0.9$. Jensen's inequality tells us that for this value of $α$, generalists should have a higher average v and average b. Indeed, the average v and b is higher for generalists than for specialists: 0.536 versus 0.5. However, the average fitness of generalists, 0.287, is still lower than the average fitness of alternating specialists, 0.333.

These examples show that Jensen's inequality still holds, and still correctly tells us which configuration has the highest average v and average b. However, average v and average b are no longer directly proportional to average fitness. Therefore Jensen's inequality does not directly inform average fitness, and we should not expect convex ROI functions to be required for specialists to be favored.

Star graphs

Let $Wg$ be the fitness for the star shaped network group of generalists ($v=0.5$, $b=0.5$) and $Ws$ be the fitness of specialists (all of the points of the star get $v=1$, $b=0$ and the central point gets $v=0$, $b=1$).

Next assume there are $N$ cells on the points of the star and 1 cell in the center of the star. We then have:

(8) ${W}_{s}=\frac{N?\beta }{2}$

as the only individual with nonzero fitness is the central individual (all others have $b=0$). The central individual’s fecundity returns are $1α=1$, and it’s own viability returns are?0. However, the central individual gets shared $β2$ of each of the $N$ other individuals’ viability returns (which are $1α=1$ each).

Next, for generalists, we have

(9) ${W}_{g}=N?{\left(\frac{1}{2}\right)}^{2?\alpha }?\left(1-\beta +\frac{\beta }{2}+\frac{\beta }{N}\right)+{\left(\frac{1}{2}\right)}^{2?\alpha }?\left(\frac{\beta }{N}+1-\beta +N?\frac{\beta }{2}\right).$

The term on the left of Equation 9 comes from the fact that there are $N$ individuals each sharing $β2$ of their viability returns (which is $(12)α$ each) with themselves, and are getting $βN$ of the central individual’s $(12)α$ viability returns shared with them. Additionally, they are getting $1-β$ of their own viability returns (withheld from sharing). Finally, each of their fecundity returns is $(12)α$.

The term on the right of Equation 9 represents the contribution to the group fitness of the single central individual. That individual gets $β2*(12)α$ of viability returns shared to it $N$ times, and it also shares with itself and keeps a portion of its returns for itself. And it has a fecundity return of $(12)α$.

Star topologies in the limit of large N

We first examine Equation 9 in the limit where $N?1$:

(10) ${W}_{g}\approx N?{\left(\frac{1}{2}\right)}^{2?\alpha }?\left(1-\beta +\frac{\beta }{2}\right)+{\left(\frac{1}{2}\right)}^{2?\alpha }?\left(N?\frac{\beta }{2}\right)$

which reduces to

(11) ${W}_{g}\approx N?{\left(\frac{1}{2}\right)}^{2?\alpha }?\left(1-\beta +\frac{\beta }{2}+\frac{\beta }{2}\right)$

and finally

(12) ${W}_{g}\approx N?{\left(\frac{1}{2}\right)}^{2?\alpha }.$

To understand if generalists or specialists are favored we examine the ratio of generalist to specialist fitness $WgWs$.

(13) $\frac{{W}_{g}}{{W}_{s}}=\frac{\beta }{{2}^{\alpha +1}}.$

This means $Wg>Ws$ if $β>2α+1$. Since $β$ and $α$ are both bounded between?0?and 1, this is never achievable. Therefore, at large $N$, specialists are always favored.

Star topologies in general

Alternately, if do not assume $N$ is large, but set $β=1$ for simplicity, we can solve for the largest $α$ which would yield equal fitness to generalists and specialists:

(14) ${\alpha }_{m?a?x}=-\frac{1}{2}?{\mathrm{log}}_{2}?\left(\frac{2?{N}^{2}}{2?{N}^{2}+2?N+8}\right).$

Rings with even and odd numbers of cells

In the main text we plot $α*$ for ring graphs with even numbers of cells. We made this choice as specialization is slightly frustrated when there are odd numbers of cells. Rings with odd numbers of cells must have at least one location at which like specialists are connected, thus slightly increasing the value of $α*$ compared to a ring with an even number of cells (Appendix 1—figure 3). However, as the size of the graph increases, a single frustrated pairing matters less and less, and the value of $α*$ for rings with odd numbers and large N appears to approach the value of $α*$ for rings with even numbers of cells.

Appendix 1—figure 3

References

1. 1
2. 2
3. 3
4. 4
5. 5
6. 6
7. 7
8. 8
9. 9
10. 10
11. 11
12. 12
13. 13
14. 14
15. 15
16. 16
Independent emergence of complex multicellularity in the brown and red algae (2015)
In: I Ruiz-Trillo, A Nedelcu, editors. Evolutionary Transitions to Multicellular Life. Springer. pp. 335–361.
https://doi.org/10.1007/978-94-017-9642-2_16
17. 17
18. 18
19. 19
20. 20
21. 21
22. 22
23. 23
The New Palgrave Dictionary of Economics 1–8:924–929.
https://doi.org/10.1057/978-1-349-95121-5
24. 24
25. 25
26. 26
27. 27
28. 28
Hierarchical complexity and the size limits of life (2017)
Proceedings of the Royal Society B: Biological Sciences 284:20171039.
https://doi.org/10.1098/rspb.2017.1039
29. 29
30. 30
Division of labour and the evolution of multicellularity (2012)
Proceedings of the Royal Society B: Biological Sciences 279:1768–1776.
https://doi.org/10.1098/rspb.2011.1999
31. 31
32. 32
33. 33
34. 34
The multiple origins of complex multicellularity (2011)
Annual Review of Earth and Planetary Sciences 39:217–239.
https://doi.org/10.1146/annurev.earth.031208.100209
35. 35
36. 36
37. 37
38. 38
Organismal size, metabolism and the evolution of complexity in metazoans
(2005)
Evolutionary Ecology Research 7:681–696.
39. 39
Biodiversity and body size are linked across metazoans (2009)
Proceedings of the Royal Society B: Biological Sciences 276:2209–2215.
https://doi.org/10.1098/rspb.2009.0245
40. 40
41. 41
42. 42
43. 43
44. 44
45. 45
46. 46
Transitions in individuality (1997)
Proceedings of the Royal Society of London. Series B: Biological Sciences 264:853–857.
https://doi.org/10.1098/rspb.1997.0119
47. 47
48. 48
49. 49
50. 50
51. 51
52. 52
53. 53
Beyond society: the evolution of organismality (2009)
Philosophical Transactions of the Royal Society B: Biological Sciences 364:3143–3155.
https://doi.org/10.1098/rstb.2009.0095
54. 54
55. 55
56. 56
57. 57
Nascent life cycles and the emergence of higher-level individuality (2017)
Philosophical Transactions of the Royal Society B: Biological Sciences 372:20160420.
https://doi.org/10.1098/rstb.2016.0420
58. 58
59. 59
60. 60
61. 61
62. 62
63. 63
64. 64
65. 65
66. 66
67. 67
Correlated pay-offs are key to cooperation (2016)
Philosophical Transactions of the Royal Society B: Biological Sciences 371:20150084.
https://doi.org/10.1098/rstb.2015.0084
68. 68
69. 69
70. 70
71. 71
72. 72
73. 73
74. 74
75. 75

Decision letter

1. Raymond E Goldstein
Reviewing Editor; University of Cambridge, United Kingdom
2. Diethard Tautz
Senior Editor; Max-Planck Institute for Evolutionary Biology, Germany
3. Pierrick Bourrat
Reviewer; MacQuarie University, Australia

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The evolution of germ-soma differentiation is one of the most fundamental questions in evolutionary biology, and the present paper investigates the consequences of altering one of the most basic assumptions: the traditional (symmetric) division of labor that has been studied from biology to economics. The authors consider a diversity of network structures and fitness functions and they find that sparser networks lead to higher levels of specialization.

Decision letter after peer review:

Thank you for submitting your article "Topological constraints in early multicellularity favor reproductive division of labor" for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Diethard Tautz as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Pierrick Bourrat (Reviewer #2).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

This is a very interesting paper on the evolution of germ-soma differentiation in which the authors consider the topology of interactions between the cells that make up the whole. They find that classical considerations about the convexity or concavity of certain functions characterizing the advantages of specialization no longer hold when the network topology is nontrivial. And such topologies are indeed found in nature. The reviewers were generally very supportive of the work but raised a number of points that need to be addressed in a revised manuscript.

Essential revisions:

1) The authors use a notion of fitness in which clonal cells can have different fitnesses, or more accurately, clonal groups can have different fitness. We know that there are some precedents in the literature, but this notion of fitness does not correspond to the notion of fitness one can associate with natural selection. To illustrate why, consider the following plant example. Take a single genet with two ramets in two environmental patches, one rich and one poor. Each ramet might adopt a very different developmental strategy from the other, considering the ecological constraints it is subjected to. These two strategies would nevertheless not be heritable in the sense that two offspring ramets put in the same environmental patch would develop the same developmental strategies (excluding noise). Thus, the differential success of each ramet is not an evolutionary success that can be associated with natural selection. This is a case analogous to the one presented by the authors. The notion of fitness they refer to seems to be rather the notion of realized fitness. This has no implications for the author's results per se but instead leads to an interpretation in which natural selection is not at work for explaining the division of labor in situations of concavity.

2) Related to the previous point, there seems to be a tension between, on the one hand, the claim that a concave function can lead to an increase in reproductive specialization, and on the other hand, claims that it has something to do with fitness. Fitness is about expected values, and in a situation of concave function, two or more cells specializing would yield a lower collective fitness than when not specializing. From a purely analytical point of view (i.e., Fisher's fundamental theorem), this seems impossible. So my question to the authors is whether there is not hidden somewhere a convex function, which is the relevant one for the evolutionary dynamics observed. Otherwise, what is the ecological explanation of such a result? There must be some ecological constraints that give rise to this phenomenon, and it would be good to know what the authors think they are.

3) There are well-known cases presented in the population genetics literature in which Fisher's fundamental theorem seems violated, but this is because of the environment (including the social environment) changes over time, such as frequency-dependent effects on an individual's success. We wonder if the results of the authors could not related to this literature in some way.

4) The model description is a bit abstract and occasionally hard to follow. It would be great to have fecundity and viability defined, and even better to have some real biological example of what returns on viability might mean and how they might be shared (I don't find the filamentous fungi example informative, at least not in the way it is written). That would also help the reader understand why there are returns on viability but not on fecundity. That the vi vector is the "group investment strategy" also comes as a surprise and takes a bit for the reader to put it all together. Similarly, the existence of both a general adjancecy matrix and of a special case one that uses the β, is somewhat confusing the way it's described. If the authors anyway only work with the special case of equal sharing with the non-self neighbors then why not define the 1-β+β/ni quantities as cij when they appear in the text, and then write a fourth eqn for W in [1] that explicitly uses the β. That would certainly help the reader a lot.

5) Results subsection “Fixed resource sharing” first paragraph, we may be getting confused, but how can you vary β in the case when, as is now written, individual i "shares equally among interaction and self terms"? Doesn't this mean that β = 1?

6) “We conjecture that the troughs in Figure 3C, where specialization occurs for the lowest values of, occur when connectivity is just large enough so that a spanning tree is more likely to connect all individuals in the group than not”, we don't fully understand that conjecture: do the authors simply mean that the troughs occur when the random graph becomes connected with probability > 50%? (A spanning tree connects all individuals by definition.)

7) The authors suggest sparsity is the main determinant of whether a network supports reproductive specialization. But, their examples in 1B and 1C (where a ring is sparser than the bipartite network) to us suggest that it is not so much about "sparsity" as it is about "bipartiteness" – or how easy it is to subdivide the nodes into two classes such that most edges go between these two classes (that's what you'd want for specialization anyway, we guess), and that sparse graphs simply have a tendency to be close to bipartite. We suspect that a ring graph with an odd number of vertices will be less conducive to specialization (although you could still alternate germ/soma cells except at one point), and that a star graph where there is one node of degree n-1 and all the others have degree 1 may be an example of a sparse graph where evolving specialization is not so easy (because for this graph it's not clear how to divide the vertices into germ and soma).

8) Related to the previous point: we would be interested if the authors have considered what happens when the optimal strategy is not 1:1 but, say, 1:2. Does that make specialization more difficult? Here we think that, with a few additional simulations, the authors could add a lot to the paper in terms of the ability to connect properties of the graph (beyond comparing some explicit topologies and random graphs of varying sparsity) to its ability to support the evolution of reproductive specialization.

9) Finally, it would be nice to see how the different specialists are distributed on these networks (at least when the specialization is equal to 1). One can infer it, but we think it would visually help the reader to get the gist of how the model works very quickly.

https://doi.org/10.7554/eLife.54348.sa1

Author response

Essential revisions:

1) The authors use a notion of fitness in which clonal cells can have different fitnesses, or more accurately, clonal groups can have different fitness. We know that there are some precedents in the literature, but this notion of fitness does not correspond to the notion of fitness one can associate with natural selection. To illustrate why, consider the following plant example. Take a single genet with two ramets in two environmental patches, one rich and one poor. Each ramet might adopt a very different developmental strategy from the other, considering the ecological constraints it is subjected to. These two strategies would nevertheless not be heritable in the sense that two offspring ramets put in the same environmental patch would develop the same developmental strategies (excluding noise). Thus, the differential success of each ramet is not an evolutionary success that can be associated with natural selection. This is a case analogous to the one presented by the authors. The notion of fitness they refer to seems to be rather the notion of realized fitness. This has no implications for the author's results per se but instead leads to an interpretation in which natural selection is not at work for explaining the division of labor in situations of concavity.

Thank you for the thought-provoking comment. We are not completely sure if we understand your point correctly (and if not, we are happy to continue the discussion), but we are using the concept of fitness in the standard sense. While cells within groups can have different numbers of surviving offspring, in our simplified modeling world, selection does not act on these differences. Instead, we assume that the cells within each group are clonal, and that there can be genetic differences between groups. These genetic differences are responsible for cellular behavior, namely the extent to which they specialize in viability or fecundity tasks. Selection acts at the group level, in a way that is simply proportional to cellular productivity within these groups. Thus, selection acts between groups on differences in group fitness, which is caused by heritable variation in cellular behaviors underlying differentiation that vary between the groups.

However, we appreciate the reviewer’s point. We previously described mutations as affecting individual cells, which would of course mean that groups are no longer clonal. It is more accurate to say that in our simulation model, mutations change the pattern of specialization and sharing at different positions in the group, which we think of as being driven by a heritable developmental program. As with our analytical results, selection only acts on group-level fitness. When model parameters favor specialization, we see initially uniform groups in which every cell is a generalist gradually evolve developmental programs featuring complete reproductive specialization, providing a simulation test of our analytical results. We have thus revised and clarified the description of these simulations. We also stopped referring to the “fitness” of individual cells in the paper, since it is potentially confusing, instead describing the direct consequences of cellular specialization on their productivity (each cell’s ?? ? ??).

In addition to the changes noted in the paragraph above, we modified the model description to read:

“We consider a model of multicellular groups composed of clonal cells that each invest resources into viability and fecundity. Because there is no within-group genetic variation, within-group evolution is not possible, though selection can act on group-level fitness differences. Specifically, we consider the pattern of cellular investment in fecundity and viability, and their sharing of these resources with neighboring cells within the group, to be the result of a heritable developmental program. Thus, selection is able to act on the multicellular fitness consequences of different patterns of cellular behavior within the group.”

We modified the simulation description to read:

“These simulations begin with no resource sharing (i.e., ?? = 0); during every round, each group in the population has a 2% chance that a mutation will impact its developmental program, and the ?? value for one of its cells will change.”

2) Related to the previous point, there seems to be a tension between, on the one hand, the claim that a concave function can lead to an increase in reproductive specialization, and on the other hand, claims that it has something to do with fitness. Fitness is about expected values, and in a situation of concave function, two or more cells specializing would yield a lower collective fitness than when not specializing. From a purely analytical point of view (i.e., Fisher's fundamental theorem), this seems impossible. So my question to the authors is whether there is not hidden somewhere a convex function, which is the relevant one for the evolutionary dynamics observed. Otherwise, what is the ecological explanation of such a result? There must be some ecological constraints that give rise to this phenomenon, and it would be good to know what the authors think they are.

Thank you again for this question. We believe the reviewers have found a very important point that lacked clarity in the previous version of our manuscript. We appreciate the deep question regarding the presence of a hidden convexity. To be frank, we were surprised by this result at first as well. However, this surprise stems from the fact that the mathematical rule connecting specialization and the second derivative of the return on investment (ROI) function does not apply to asymmetric trade on sparse networks.

The following response explains the logic underlying this result and shows why there are no hidden convexities in our model. We apologize for its length, but since it is central to the paper we wanted to make the math absolutely clear.

In our model we make the simple assumption that group fitness is directly proportional to the sum of cellular productivity (each cell’s ?? ? ??) within the group. We see this as having the simplest biological interpretation, groups that generate a larger number of cells can make more progeny (i.e., more groups), all else equal. As a result, while holding the number of cells per group constant, the group with the highest fitness will be composed of cells with the highest average fitness.

To understand why a hidden convexity need not be present here, we will first discuss the mathematical rule that typically connects specialization and the second derivative of the ROI function: Jensen’s inequality. Jensen’s inequality states that for a convex function F(x), the average value of ??(??), ???(??)?, is larger than ??(????), where ???? is the average value of ??. In other words, ???(??)? > ??(????). A corollary of Jensen’s inequality is that the opposite is true for concave functions, i.e., for a concave function ??(??), ???(??)? < ??(????). We assume this is the rule the reviewers refer to when they say “in a situation of concave function, two or more cells specializing would yield a lower collective fitness than when not specializing.”

For the traditional case of fully connected topologies and symmetric sharing, Jensen’s inequality provides a mathematical connection between a group’s fitness and its degree of specialization. With fully connected topologies, each cell shares its returns on ?? and ?? equally with all cells. As a result, all cells end up with ?? equal to the average ?? and ?? equal to the average ??. As a result, each cell’s ?? ? ?? is thus the average of ?? multiplied by the average of ??. Thus, group fitness is directly proportional to average ?? and average ??. Jensen’s inequality guarantees that for convex ROI functions, the average v produced by specialists (???(??)?) is higher than the average ?? produced by generalists (??(????)). The same is true for ??. Since specialists must have larger average ?? and ?? than generalists, group fitness must be higher as well. The same argument holds in reverse for concave functions, i.e., if the ROI function is concave, the average ?? and average ?? are lower for specialists than for generalists so group fitness is lower, too. Thus, for fully connected topologies and symmetric sharing, Jensen’s inequality allows one to connect group fitness to average ?? and average ??.

Crucially, however, the connection between ROI convexity/concavity and fitness is indirect. Jensen’s inequality directly relates the degree of specialization to the average ?? and average ??; Jensen’s inequality does not itself say anything about group fitness. Group fitness is proportional to the average of the product ?? ? ??, which does not have to be directly proportional to average ?? or average ?? (even though for fully connected topologies and symmetric sharing ?????? is directly proportional to ???? and ????).

That Jensen’s inequality connects the degree of specialization to average ?? and average ??, but not group fitness is a crucial distinction. Jensen’s inequality is a mathematical truism, and cannot be violated. Indeed, for any concave ROI function, generalists will produce more ?? and ?? than specialists. This fact is just as true for sparse topologies and asymmetric sharing as it is for fully connected topologies and symmetric sharing. However, for sparse topologies and asymmetric sharing, group fitness is not directly proportional to average ?? and average ??; instead, group fitness strongly depends on network structure as well.

To understand how group fitness decouples from average ?? and average ?? for sparse topologies and asymmetric sharing, consider a ring of four cells in three different configurations: one that alternates between viability and fecundity specialists, one in which like-specialists are connected to each other, and one in which all cells are generalists (pictured in Author response image 1; red cells are viability specialists, blue cells are fecundity specialists, and purple cells are generalists). For simplicity, we will set ?? = 1, and we will initially consider the case when ?? = 1. When ?? = 1, Jensen’s inequality tells us that generalists and specialists will be equally productive. Classically, this would suggest that specialists and generalists should have the same fitness.

Author response image 1

And, indeed, all three cases have the same average ?? and the same average ?? (? for each). However, the group fitnesses are all different. Note, depending on the configuration, ???????? can be greater than, less than, or equal to ??????.Next, we consider the same three configurations, but with ?? = 0.9. Jensen’s inequality tells us that for this value of ??, generalists should have a higher average ?? and average ??. Indeed, the average ?? and ?? is higher for generalists than for specialists: 0.536 versus 0.5. However, the group fitness of generalists, 1.15, is still lower than the group fitness of alternating specialists (i.e., leftmost configuration in the schematic in Author response image 1), 1.333.

These examples show that Jensen’s inequality still holds, and still correctly tells us which configuration has the highest average ?? and average ??. However, average ?? and average ?? are no longer directly proportional to group fitness. Therefore, Jensen’s inequality does not directly inform group fitness, and we should not expect convex ROI functions to be required for specialists to be favored.

Ultimately, the difference between the classic fully connected topologies and the asymmetric sharing / sparse topologies we consider is the ability of viability specialists to preferentially share viability to fecundity specialists. In fully connected topologies, some of the potential benefit from specialization is wasted. Consider the case where viability specialists share viability with other viability specialists: this is entirely unhelpful, two viability specialists (with 0 fecundity) that help each other survive still both have zero fitness. When the return on investment function is linear (and the graph is fully connected), the effect of sharing with the same cell type (as opposed to a complementary cell type) exactly cancels out the benefits of specializing; trade is only beneficial once the return on investment function becomes convex. In this case, specialists make enough extra v and b that groups of specialists do better than groups of generalists.

Again, these results were surprising. To help establish intuition for the role of preferentially connecting unlike specialists, we developed a mean field model. Based on the average number of connections per cell, this model determines the fraction of connections that must be between unlike specialists for the network to support specialization with concave ROI. We find that if unlike specialists are preferentially connected, specialization despite concave ROI should be expected for a wide range of networks. These results are similar to our simulations of randomly generated graphs (see Figure 3 in the main text), in which we also observed specialization for a wide range of parameters. Combined, these observations suggest that specialization, despite concave ROI, does not require precisely designed topologies, but is a general principle applicable to many different network structures. We agree that this was not clear in the original manuscript, and have added this model to the main text as described below.

We added Discussion paragraphs on Jensen’s inequality:

“To understand how specialization can be favored despite concave return on investment (ROI) curves, consider Jensen’s inequality. […] With asymmetric sharing and sparse topologies, Jensen’s inequality still informs the average viability and fecundity produced, but does not directly inform the group fitness.”

We also added a mean field model to help the reader develop intuition for these results:

“Mean field model

Finally, to capture some general principles underlying this phenomenon, we consider a mean-field model with cells, each of which is connected to other cells. […]However, this inequality presents a useful heuristic which can be used to determine if specialization is favored by measuring just a few properties of the graph.”

We also added a Discussion paragraph on directed/wasted effort:

“Finally, we note that the primary benefit of sparsity is that sparse networks are likely to be at least somewhat bipartite. The more bipartite-like a network is, the less effort is wasted, and the easier it is for specialization to be favored.”

Finally we added a section on the four cell network:

“Jensen's inequality and sparse topologies and asymmetric sharing

To understand how average fitness decouples from average ?? and average ?? for sparse topologies and asymmetric sharing, consider a ring of four cells in three different configurations: one that alternates between viability and fecundity specialists, one in which like-specialists are connected to each other, and one in which all cells are generalists. […] Therefore, Jensen’s inequality does not directly inform average fitness, and we should not expect convex ROI functions to be required for specialists to be favored.”

3) There are well-known cases presented in the population genetics literature in which Fisher's fundamental theorem seems violated, but this is because of the environment (including the social environment) changes over time, such as frequency-dependent effects on an individual's success. We wonder if the results of the authors could not related to this literature in some way.

This question seems to be related to the central issue raised above – is there a hidden feature in the model that makes specialization beneficial despite a concave investment function (i.e., by adding convexity somewhere or introducing some frequency-dependent or environmental effect)? Above, we explained the logic of why specialization can be favored despite concave investment functions.

Also, to be clear, we don’t see our work as violating Fisher’s fundamental theorem, which states that selection will increase the mean fitness of a population at a rate proportional to the genetic variation in fitness in the population. Because our groups are clonal, all the genetic variation in our population occurs between groups, not within groups, and selection is acting only on group-level fitness. Thus, Fisher’s fundamental theorem is at a limit within groups (as there is no within-group genetic variation, there is no within-group evolution).

4) The model description is a bit abstract and occasionally hard to follow. It would be great to have fecundity and viability defined, and even better to have some real biological example of what returns on viability might mean and how they might be shared (I don't find the filamentous fungi example informative, at least not in the way it is written). That would also help the reader understand why there are returns on viability but not on fecundity. That the vi vector is the "group investment strategy" also comes as a surprise and takes a bit for the reader to put it all together. Similarly, the existence of both a general adjancecy matrix and of a special case one that uses the β, is somewhat confusing the way it's described. If the authors anyway only work with the special case of equal sharing with the non-self neighbors then why not define the 1-β+β/ni quantities as cij when they appear in the text, and then write a fourth eqn for W in [1] that explicitly uses the β. That would certainly help the reader a lot.

These are great points, and we agree that it is crucial that the model and its biological meaning are clear to the reader.

The dichotomy of viability vs. fecundity was originally used by Michod to partition components of cellular fitness into actions that contribute to keeping a cell alive (viability), and actions that directly contribute to reproduction (fecundity). The intuition underlying this is that multicellular organisms often have evolved to divide labor along these two lines (i.e., reproduction of the organism by germ cells and survival provided by somatic cells), while their unicellular ancestors had to do both. We define viability as an activity (Michod uses the term “effort”) that keeps the cell alive (e.g., investing in cellular homeostasis or behaviors that improve survival), and fecundity more narrowly as effort involved in cellular reproduction itself. At the cellular level, there appears to be a fundamental asymmetry in how viability effort and fecundity effort can be shared among cells: while multicellular organisms readily evolve differentiated cells that are completely reliant on helper cells for survival (i.e., glial cells that support neurons in animals or companion cells that support sieve tube cells in plants), no cell can directly share its ability to reproduce. To better understand the intuition behind this, consider a cell that elongates prior to fission. This cell must grow to approximately twice its original length. Two cells cannot elongate by 50% and then combine their efforts; elongation is an intrinsically single cell effort.

While we believe this was also Michod’s interpretation of “reproductive effort”, this concept, like many in biology, is subject to interpretation. While it is clear the cellular behaviors underlying replication cannot be shared, resources required for reproduction could have been provided by another cell. Fortunately, our main conclusions are general and can accommodate various definitions of reproductive effort, which include sharing. While we present the simplest case (no sharing of reproductive effort) to help explain our paper’s central idea as clearly as possible, we do not need to forbid sharing of reproductive effort in order for specialization to evolve with concave ROI functions. As long as there is a significant asymmetry in how much ?? and ?? are shared, specialization with concave ROI functions will evolve. We explore this generalization in Figure 2. Because much of the effort involved in reproduction is, by its nature, unshareable, we believe that asymmetry in sharing between ?? and ?? is a general feature of multicellular systems. Of course, our results are far more general than the evolution of multicellularity, and should apply to many systems in which entities trade along networks (biological, economic, etc.). We have updated the description in the text of how cells can share viability, illustrating the general point with additional examples.

Finally, we have updated our description of the model to clarify the roles of ?? and ??????.

We modified the “Model” section to read:

“Reproductive specialization can be modeled as the separation of two key fitness parameters, those related to either viability or fecundity, into separate cells within the multicellular organism (13,35). […]Thus, selection is able to act on the multicellular fitness consequences of different patterns of cellular behavior within the group.”

“In other words, ?????? = 1 ? ??, ?????? = ??/???? if cells ?? and ?? are connected, and ?????? = 0 if cells ?? and ?? are not connected.”

5) Results subsection “Fixed resource sharing” first paragraph, we may be getting confused, but how can you vary β in the case when, as is now written, individual i "shares equally among interaction and self terms"? Doesn't this mean that β = 1?

We do not allow cells to give all of their viability returns away, as this would result in cells that are not viable. Cells keep (1 ? ??) of their viability returns, and designate β of their viability returns for sharing. They then split their viability returns designated for sharing into ?? + 1 equally sized portions, give one portion to each of their ?? neighbors, and keep one portion for themselves. In other words, the β portion that is designated “for sharing” is not all given away, but is split equally among the connected cells and the cell itself. We have updated the explanation accordingly.

We removed the sentence “Individual ?? shares ?????? equally among interaction and self terms.” that was on line 146. The model was explained in the previous section, so this statement was ultimately redundant and unclear. Instead, we modified the model explanation to read:

“Cells cannot give away all of their viability returns, as they would no longer be viable; mathematically, we count a cell among its neighbors and thus ensure that they always “share” a positive portion of viability returns with themselves, so that ?????? > 0.”

And

“…and when ?? = 1 cells share everything equally among all connections and themselves.”

6) “We conjecture that the troughs in Figure 3C, where specialization occurs for the lowest values of, occur when connectivity is just large enough so that a spanning tree is more likely to connect all individuals in the group than not”, we don't fully understand that conjecture: do the authors simply mean that the troughs occur when the random graph becomes connected with probability > 50%? (A spanning tree connects all individuals by definition.)

The reviewers are correct that a spanning tree connects all individuals. We were attempting to state this fact using technical language that would be clear to experts (50% probability that a spanning tree was present) as well as with more general language accessible to non-experts (50% probability that all individuals connected). We have updated this statement to be clear to all.

We modified this statement to read:

“We conjecture that the troughs in Figure 3B, where specialization occurs for the lowest values of ??, occur when connectivity is just large enough so that the existence of a spanning tree is more likely than not.”

7) The authors suggest sparsity is the main determinant of whether a network supports reproductive specialization. But, their examples in 1B and 1C (where a ring is sparser than the bipartite network) to us suggest that it is not so much about "sparsity" as it is about "bipartiteness" – or how easy it is to subdivide the nodes into two classes such that most edges go between these two classes (that's what you'd want for specialization anyway, we guess), and that sparse graphs simply have a tendency to be close to bipartite. We suspect that a ring graph with an odd number of vertices will be less conducive to specialization (although you could still alternate germ/soma cells except at one point), and that a star graph where there is one node of degree n-1 and all the others have degree 1 may be an example of a sparse graph where evolving specialization is not so easy (because for this graph it's not clear how to divide the vertices into germ and soma).

Thank you for this point, with which we largely agree; “bipartiteness” is more important than sparsity. Sparsity is important for two reasons. First, as the reviewers suggest, sparse graphs tend to be close to bipartite. Second, sparse topologies are common in nature. Thus, sparsity may be a common natural route to being somewhat close to bipartite. We have modified the text to make this point clear. However, as will be highlighted by our discussion below, additional features of the graph, such as connectivity, can play a role in determining if specialization is favored as well.

We also appreciate the star graph suggestion, which is a very interesting topology. This graph is also relevant to understanding the evolution of asymmetric investment in germ and somatic cells, as is common in among independently-evolved multicellular organisms, like animals, plants, and volvocine green algae. Hopefully after clarifying our model in response to previous questions it is now clear that the star graph can strongly favor specialization. We now work this example in detail in the supplement. Briefly, we will discuss a five cell star here.

Author response image 2

If the central cell were a viability specialist, specialists would have a lower fitness than generalists. However, if the central cell is a fecundity specialist, then the four surrounding viability specialists provide the central cell with enough viability to make this configuration favored. In fact, for large groups (?? > > 1), specialists are always favored – for any ?? and any ??. For the five cell examples shown above, specialists are favored for α values as low as 0.222; a heat map relating degree of specialization to ?? and ?? is shown in Author response image 3.

Finally, the reviewers are correct about rings with even and odd numbers of cells. The effect is small, but it is true that specialization is frustrated in rings with odd numbers of cells. In fact, we cut a discussion of this phenomenon from our manuscript as we worried that it would be a distraction. However, it is now clear that its absence in fact raises questions, so we have restored it.

Author response image 3

Here, ???, the lowest value of ?? for which specialization is favored, is plotted versus the number of cells in the ring, for rings with even and odd numbers of cells.As discussed above, we added a Discussion paragraph that reads:

“Finally, we note that the primary benefit of sparsity is that sparse networks are likely to be at least somewhat bipartite. The more bipartite-like a network is, the less effort is wasted, and the easier it is for specialization to be favored.”

We added a section analytically working out when specialists and generalists are favored for star topologies with various N, ??, and ??.

We added a section discussing even and odd numbered rings.

8) Related to the previous point: we would be interested if the authors have considered what happens when the optimal strategy is not 1:1 but, say, 1:2. Does that make specialization more difficult? Here we think that, with a few additional simulations, the authors could add a lot to the paper in terms of the ability to connect properties of the graph (beyond comparing some explicit topologies and random graphs of varying sparsity) to its ability to support the evolution of reproductive specialization.

Thank you for this interesting suggestion. First, we return to the mean field model introduced in comment 2, but now allow the fraction of fecundity specialists to be ?? (rather than forcing ?? = ?). The model itself is presented in detail in the text changes below; here, we summarize the results.

Our mean field model suggests that a larger proportion of fecundity specialists makes concave specialization easier to achieve. Further, we find that concave specialization is only possible if more than one fourth of cells are fecundity specialists.

We stress here that this is a mean field model, and does not apply to scenarios like the star network, in which cells have very different individual values of ??. If such networks do or do not favor specialization for ?? < 1 will again be a graph coloring problem.

We added a section on varying ratios of specialists:

“Effect of varying ratios of specialists

We now allow the fraction of fecundity specialists to be ?? (rather than forcing ?? = 1/2). […] If such networks do or do not favor specialization for ?? < 1 will again be a graph coloring problem.”

9) Finally, it would be nice to see how the different specialists are distributed on these networks (at least when the specialization is equal to 1). One can infer it, but we think it would visually help the reader to get the gist of how the model works very quickly.

We have visualized investment in different tasks on fully connected, ring, and bipartite graphs. We agree that these images are instructive for the reader.

In Author response image 4 is an image of specialists in a nearest neighbor topology:

Author response image 4

In Author response image 5 for the bipartite network, complete specialization happens readily:

Author response image 5

Author response image 6 for the complete network, generalists dominate with some fluctuations:

Author response image 6

We modified Figure 1 to incorporate these images:

Author response image 7
https://doi.org/10.7554/eLife.54348.sa2

Article and author information

Author details

1. David Yanni

School of Physics, Georgia Institute of Technology, Atlanta, United States
Contribution
Data curation, Software, Formal analysis, Validation, Investigation, Methodology, Writing - review and editing
Contributed equally with
Shane Jacobeen
Competing interests
No competing interests declared
2. Shane Jacobeen

School of Physics, Georgia Institute of Technology, Atlanta, United States
Contribution
Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Methodology, Writing - original draft
David Yanni
Competing interests
No competing interests declared
3. Pedro Márquez-Zacarías

1. Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, Atlanta, United States
2. School of Biological Sciences, Georgia Institute of Technology, Atlanta, United States
Contribution
Visualization, Methodology, Writing - review and editing
Competing interests
No competing interests declared
4. Joshua S Weitz

1. School of Physics, Georgia Institute of Technology, Atlanta, United States
2. School of Biological Sciences, Georgia Institute of Technology, Atlanta, United States
Contribution
Supervision, Methodology, Writing - review and editing
Competing interests
No competing interests declared
5. William C Ratcliff

School of Biological Sciences, Georgia Institute of Technology, Atlanta, United States
Contribution
Conceptualization, Resources, Supervision, Writing - original draft, Project administration, Writing - review and editing
Contributed equally with
Peter J Yunker
For correspondence
william.ratcliff@gatech.edu
Competing interests
No competing interests declared
6. Peter J Yunker

School of Physics, Georgia Institute of Technology, Atlanta, United States
Contribution
Conceptualization, Resources, Supervision, Funding acquisition, Methodology, Writing - original draft, Project administration, Writing - review and editing
Contributed equally with
William C Ratcliff
For correspondence
peter.yunker@gatech.edu
Competing interests
No competing interests declared

Funding

National Science Foundation (IOS-1656549)

• William C Ratcliff
• Peter J Yunker

National Institutes of Health (GM138030)

• William C Ratcliff

National Science Foundation (BMAT-2003721)

• Peter J Yunker

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Senior Editor

1. Diethard Tautz, Max-Planck Institute for Evolutionary Biology, Germany

Reviewing Editor

1. Raymond E Goldstein, University of Cambridge, United Kingdom

Reviewer

1. Pierrick Bourrat, MacQuarie University, Australia

Publication history

2. Accepted: September 17, 2020
3. Accepted Manuscript published: September 17, 2020 (version 1)
4. Version of Record published: November 3, 2020 (version 2)

? 2020, Yanni et al.

Metrics

• 1,546
Page views
• 223
• 1
Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

A two-part list of links to download the article, or parts of the article, in various formats.

1. Evolutionary Biology
2. Immunology and Inflammation

In silico analysis of myeloid cells across animal kingdom reveal neutrophil evolution by colony stimulating factors

Damilola Pinheiro et al.
Research Article

Neutrophils constitute the largest population of phagocytic granulocytes in the blood of mammals. The development and function of neutrophils and monocytes is primarily governed by the granulocyte colony-stimulating factor receptor family (CSF3R/CSF3) and macrophage colony-stimulating factor receptor family (CSF1R/IL34/CSF1) respectively. Using various techniques this study considered how the emergence of receptor:ligand pairings shaped the distribution of blood myeloid cell populations. Comparative gene analysis supported the ancestral pairings of CSF1R/IL34 and CSF3R/CSF3, and the emergence of CSF1 later in lineages after the advent of Jawed/Jawless fish. Further analysis suggested that the emergence of CSF3 lead to reorganisation of granulocyte distribution between amphibian and early reptiles. However, the advent of endothermy likely contributed to the dominance of the neutrophil/heterophil in modern-day mammals and birds. In summary, we show that the emergence of CSF3R/CSF3 was a key factor in the subsequent evolution of the modern-day mammalian neutrophil.

1. Evolutionary Biology
2. Genetics and Genomics

Recurrent evolution of high virulence in isolated populations of a DNA virus

Tom Hill, Robert L Unckless
Research Article Updated

Hosts and viruses are constantly evolving in response to each other: as a host attempts to suppress a virus, the virus attempts to evade and suppress the host’s immune system. Here, we describe the recurrent evolution of a virulent strain of a DNA virus, which infects multiple Drosophila species. Specifically, we identified two distinct viral types that differ 100-fold in viral titer in infected individuals, with similar differences observed in multiple species. Our analysis suggests that one of the viral types recurrently evolved at least four times in the past ~30,000 years, three times in Arizona and once in another geographically distinct species. This recurrent evolution may be facilitated by an effective mutation rate which increases as each prior mutation increases viral titer and effective population size. The higher titer viral type suppresses the host-immune system and an increased virulence compared to the low viral titer type.