Skip to content
Opens in a new window Opens an external site Opens an external site in a new window
Ecology & Society
  • Current Issue
  • About the Journal
    • Our Editors
    • Policies
    • Submissions
    • Contact
  • Open Access Policy
  • Submit an Article
  • Sign In
Icons/Search
Icons/Close
Icons/Search
Home > VOLUME 31 > ISSUE 1 > Article 12 Research

Divide and explain: novel metrics and procedures for archetype analysis in case-based sustainability research

Eisenack, K., and R. Wang. 2026. Divide and explain: novel metrics and procedures for archetype analysis in case-based sustainability research. Ecology and Society 31(1):12. https://doi.org/10.5751/ES-16627-310112
Download PDF Download icon Download Citation Download icon Submit a Response Arrow-Forward icon
Share
  • Twitter logo
  • LinkedIn logo
  • Facebook logo
  • Email Icon
  • Link Icon
  • Klaus EisenackORCIDcontact author, Klaus Eisenack
    Resource Economics Group, Humboldt Universität zu Berlin, Germany
  • Rongyu WangRongyu Wang
    School of Public Affairs, Zhejiang University, Hangzhou, China

The following is the established format for referencing this article:

Eisenack, K., and R. Wang. 2026. Divide and explain: novel metrics and procedures for archetype analysis in case-based sustainability research. Ecology and Society 31(1):12.

https://doi.org/10.5751/ES-16627-310112

  • Introduction
  • Developing Methodology of Formal Concept-based Archetype Analysis
  • Metrics and New Procedures
  • Re-identifying Archetypes from Two Previous Studies
  • Conclusion
  • Responses to this Article
  • Author Contributions
  • Acknowledgments
  • Use of Artificial Intelligence (AI) and AI-assisted Tools
  • Data Availability
  • Literature Cited
  • archetype analysis; classification; climate change adaptation; configurational methods; formal concept analysis; land-use change
    Divide and explain: novel metrics and procedures for archetype analysis in case-based sustainability research
    Copyright © by the author(s). Published here under license by The Resilience Alliance. This article is under a Creative Commons Attribution 4.0 International License. You may share and adapt the work provided the original author and source are credited, you indicate whether any changes were made, and you include a link to the license. ES-2025-16627.pdf
    Research

    ABSTRACT

    Sustainability research often seeks to transfer insights across cases, but the heterogeneity of contexts and outcomes presents significant challenges. What works in one case may fail in another, requiring an approach that combines classification with explanation. Classifying cases into several types, each with a particular explanation, however, involves a trade-off between too broad and too fine-grained classes. Existing studies often address this trade-off in ways that are difficult to reproduce, highlighting the need for more systematic and replicable methods. To address this gap, this study develops quantitative metrics and standard procedures that are replicable across contexts. They enable identifying archetypes from binary data sets using formal concept analysis (FCA). The novel procedures are demonstrated by replicating two previously published archetype analyses on land use and climate change adaptation. We propose three core steps (formal concept analysis, concept filter, theoretical analysis) alongside two optional steps (grouping, optimal concept selection). Key metrics, including consistency, coverage, richness, size, and lift, guide these steps. We show that the procedures enhance reproducibility and speed up analysis compared with previous approaches, and help determine the appropriate number of archetypes to provide more parsimonious research findings. This study thus contributes to methodological rigor in case-based sustainability research by balancing generality and particularity of archetype analysis.

    INTRODUCTION

    In current sustainability research, evaluations of sustainability interventions in land use governance, natural resource management, and other related fields are hampered by the considerable incommensurability of many interventions and heterogeneity of the cases where they are attempted (Vatn 2005, Ostrom 2010, Meyfroidt et al. 2022). For the same reasons, it is also challenging to come up with a general explanation of crucial outcomes, e.g., successful collective action (Ostrom 2007). What works in certain ways in one case might work completely differently in other cases. However, sometimes seemingly different cases still produce similar outcomes.

    Thus, different ways have been explored to address the challenge of dealing with heterogeneity among cases when producing generalized knowledge in sustainability research (e.g., Warren 2002, Ostrom 2007, Rudel 2008, Magliocca et al. 2018, Meyfroidt et al. 2018, Young 2019, Villamayor-Tomas et al. 2020, Castille et al. 2023, Eisenack 2024). If a single valid explanation is not feasible for the full set of cases one is interested in, we can group the cases into different classes (Young et al. 2006). If the cases are accurately and precisely classified (e.g., with a cluster analysis), each class might have its individual explanation. Class-wise explanations can then be tested by conventional methodological standards, but only applied to the separate classes (Ostrom and Cox 2010, Ratajczyk et al. 2016). From a general viewpoint, such approaches combine classification with explanation, as already discussed by Capecchi (1973). This is also implicit in qualitative comparative analysis (QCA; Schneider and Wagemann 2012), as it determines multiple sufficient conditions that are linked by logical disjunction (“OR”). It is also implicit in various studies in sustainability research that employ qualitative differential equations, fuzzy set indicators, or other methods (e.g., Lüdeke et al. 2004, Eisenack et al. 2006, Manuel-Navarrete et al. 2007, United Nations Environment Programme (UNEP) 2007, Eisenack and Roggero 2022).

    Our methodological contribution aims at further improving approaches that combine explanation with classification. This task is also highly relevant for archetype analysis, an innovative approach that is increasingly used in case-based sustainability research (Neudert et al. 2019, Oberlack et al. 2023). Applications include research on land use (e.g., Gong and Tan 2021, Buchadas et al. 2022, Piemontese et al. 2022a), urban climate action (e.g., Roggero et al. 2025a, b), social-ecological systems and biodiversity (e.g., Nagel et al. 2024, Partelow et al. 2024, Harmáčková et al. 2025), or food systems (e.g., Sietz and Neudert 2022, Václavík et al. 2024). The basic idea of archetype analysis is to classify cases into archetypes (which do not need to be mutually exclusive). Each archetype is characterized by particular mechanisms that lead to an outcome (Eisenack et al. 2021). The approach can be operationalized with various methods like QCA (e.g., Sietz and van Dijk 2015, Kimmich and Villamayor Tomas 2019, Kurtmollaiev et al. 2023), cluster analysis (e.g., Levers et al. 2018, Orozco et al. 2024), or formal concept analysis (FCA; Oberlack et al. 2016, Oberlack and Eisenack 2018, Wang et al. 2019, Gotgelf et al. 2020). Yet, good criteria for the number of archetypes (i.e., the number of classes) are still acknowledged as a research gap (Eisenack et al. 2019).

    Choosing the right number of classes is an overarching challenge when combining explanation with classification. A small number of classes has the advantage of a broader generality of the results and can be summarized in a shorter list. However, a small number of large classes might only lead to coarse-grained explanations or even to no explanations at all, e.g., in the extreme case of classifying a complete sample of cases into merely one single class (also called the “nomothetic trap” (Beltz et al. 2016, Lyon et al. 2017)). Small classes might be fine-grained enough to obtain rich explanations, but may lead to confusion in a long list of classes. In the extreme, every case would be its own class, so that there is no generalization at all, also called the “idiographic trap” (Beltz et al. 2016, Lyon et al. 2017). So, what is the best trade-off here? This trade-off is well known in cluster analysis, where various metrics are used to determine the appropriate number of clusters (e.g., Sietz et al. 2017), or in FCA which orders concepts by cardinality (Ganter and Wille 1999). For archetype analysis, a lack of good criteria for the number of archetypes is acknowledged as a crucial gap, as the approach is explicitly made to navigate between the idiographic and the nomothetic trap (Eisenack et al. 2019, Tan and Wang 2025).

    To fill this methodological gap, this paper develops and tests operational and orderly procedures, along with rigorous and replicable metrics, to determine the (number of) classes. The procedures are also more efficient than previous methods that rely on extensive steps requiring potentially arbitrary human judgment. The new procedures can also engender more parsimonious findings. Specifically, we develop the procedures for archetype analyses using FCA (Ganter and Wille 1999), an increasingly used data-analytic approach (e.g., Oberlack and Eisenack 2018, Gotgelf et al. 2020, Harmáčková et al. 2025, Tan and Wang 2025). With FCA, the novel metrics are directly applicable to binary data, as frequently used in case-based sustainability studies. Some steps of the proposed procedure are computational and can be done with conventional software packages, whereas other steps are qualitative. We illustrate the metrics and procedure by replicating the data analysis part of previously published archetype analyses. We demonstrate the particularities of each step, show how to circumvent some possible pitfalls, and assess the procedures’ strengths in improving analytical efficiency and obtaining more parsimonious results. The conclusions further discuss extensions of the procedure, e.g., for other kinds of data or data-analytic methods.

    DEVELOPING METHODOLOGY OF FORMAL CONCEPT-BASED ARCHETYPE ANALYSIS

    As an innovative method for large-sample case study comparison, archetype analysis is increasingly used in sustainability research, and FCA serves as one major data-processing approach to identify archetypes. However, formal concept-based archetype analysis faces several methodological challenges—such as objectivity, replicability, and efficiency—in terms of combining explanation with classification.

    Archetype analysis

    In sustainability research, archetypes are recurrent patterns of interplay among elements of social-ecological systems (SES) that recur in multiple cases (Oberlack et al. 2019). An archetype is an analytical unit that features a combination of various attributes, their interactive mechanisms and outcomes, found in a set of cases. Archetype analysis aims to enable an analysis of various attributes that jointly affect the processes and outcomes of SES to reveal structural relations among these attributes (Tan and Wang 2025). To balance between the nomothetic and the idiographic trap, archetype analysis works on an intermediate level of abstraction: it neither intends to depict all details of cases (no abstraction), nor considers the details of cases as a black box (too abstract). In contrast, it intends to identify the recurrent configurations of attributes following coherent theories, realized in multiple but not necessarily all cases—which is a non-trivial challenge (Eisenack et al. 2021).

    In this regard, archetype analysis combines classification with explanation by determining “recurrent patterns [...] that explain the phenomenon under particular conditions” (Oberlack et al. 2019). Each archetype in a set of identified archetypes (called “suite of archetypes”) is individually characterized by the set of cases that it covers, by a configuration of attributes that is common to the cases, and a mechanism that explains how the configuration leads to specified outcomes (Eisenack et al. 2021). For example, one archetype’s attribute configuration can express the conditions for government-led modes to consolidate rural construction land; this archetype cannot explain the cases where rural households self-organize. Otherwise, the configuration of attributes and outcomes reflected by this archetype may be too general; it might not even be identified as an archetype. Although one archetype should be present in a set of multiple (yet not all) cases, they can also function as building blocks: one case might be in the case set of multiple archetypes. Different aspects of the case might be explained by a combination of multiple archetypes (Eisenack et al. 2025). Archetype analysis thus admits a non-disjoined classification of cases. Importantly in this context, this makes FCA one highly relevant data-analytic method for archetype analysis.

    Formal concept analysis (FCA)

    Formal concept analysis, as developed by Ganter and Wille (1999), is a computational method that classifies data based on set theory and mathematical lattice theory. Here, we present some basics of FCA that are needed in the remainder of the paper.

    In its simplest form, the input to an FCA is a table, with a set of cases (called “objects” in FCA jargon) in rows, and attributes in columns. For instance, cases can be a sample of municipalities in China, and the attributes express whether municipalities have a large area or whether there is a rising rural population (according to criteria defined prior to the assessment of the attributes). The table contains binary entries, which indicate which attributes are present (“hold”) in which cases.

    With these data, the algorithm computes all possible classifications of the cases by their attributes. It does so by determining “concepts.” Each concept is described by its extent and its intent. Its extent is a set of cases that are covered by the concept, and its intent is a set of attributes. The challenge when determining the concepts is that extent and intent need to fit to each other: the extent of a concept contains exactly all cases that have all the intent’s attributes (but not more cases). For instance, a concept might contain all municipalities that are both large and have a rising rural population. In turn, the intent needs to contain all attributes that are common to all cases in the extent. So, if all large and rising-rural municipalities in the sample have a further shared feature, e.g., located in coastal areas, this feature should also be part of the concept’s intent. Thus, concepts are maximal classifications in the sense that they contain as many cases and attributes as possible, if they at least fit to each other in the above sense. Note that those concepts do not need to be mutually exclusive. Their extents can intersect if cases are contained in multiple concepts.

    In addition to the concepts, FCA determines all sub-concept relations between them. A concept S is a sub-concept of concept C if S’s extent is a subset of C’s extent. This is equivalent to C’s intent being a subset of S’ intent. The concept of municipalities that are large, rising-rural, and coastal is a sub-concept of the large and rising-rural municipalities (as not all are coastal). One might say that the sub-concept C is less abstract/more concrete than S.

    When the FCA is done, one can proceed with an archetype analysis by selecting several concepts at the same time to obtain a classification of the cases: each case is included in one of the selected concepts’ extents (or not in any extent). It is important to note that a case can also be in the extent of multiple concepts (the classification does not need to be disjoined). Every selection of multiple concepts is thus a candidate for a suite of archetypes. However, one can only speak about candidates here, because we still need to find a classification that can be combined with an explanation, so is consistent with theory. Explanation is not part of FCA and is addressed further below.

    Methodological challenges

    This paper addresses three main challenges, some of which are common to all research paradigms that combine explanation with classification, and some are particular to formal concept-based archetype analysis.

    First, one needs to properly handle the level of abstraction to avoid nomothetic and idiographic traps. Increasing the number of archetypes depicts the characteristics of data sets and the actual phenomena more completely; however, each archetype may then contain too many attributes and draw too specific implications. Decreasing the number of archetypes (so that they, on average, cover more cases at the expense of comprising fewer attributes) can draw rather replicable and generalizable implications, but tends to neglect certain details deserving further investigation. For example, Wang and Tan (2020) identify eight archetypes of rural collective action, which capture the vital role of local leadership in fostering a self-organized mode to consolidate rural construction land. Although local leadership can be conceptualized in diverse aspects, such as village leaders, village elders, entrepreneurs, or local officials, it is not necessary to specify local leadership separately through more archetypes, and the current number of archetypes in this study identifies a recurrent and stable relation between local leadership and self-organized construction land consolidation. Yet, systematic ways for identifying the appropriate number of archetypes—and thus navigating the levels of abstraction in light of this trade-off—have been considered as one desideratum of archetype analysis (Eisenack et al. 2019).

    Second, the previous literature has already proposed procedures and metrics to identify the appropriate number of archetypes in terms of attribute structures, reappearance of formal concepts, theoretical saliency and coherence, external and internal validity (e.g., Oberlack et al. 2016, Levers et al. 2018, Piemontese et al. 2022b, Partelow et al. 2024, Harmáčková et al. 2025). However, they still face controversial challenges. Specifically, the metrics are either too general or too study specific. On the general side, Eisenack et al. (2019) suggest general quality criteria to guide the design of archetype analyses, suggesting that an archetype analysis should specify the domain of validity for each archetype, ensure multiple archetypes to be combined in different ways to characterize single cases, explicitly navigate through different levels of abstraction, and obtain a fit between its configuration of attributes, theory, and empirical evidence. On the more specific side, Wang and Tan (2020) accomplish the archetype analysis based on a first-hand case set, presenting the process of case collection, coding, categorizing, and archetype identification with rather context-specific criteria. They require that an archetype appears in at least two cases, along with a configuration of at least one diagnostic attribute and one design attribute. Diagnostic attributes are considered contextual and reflect case-dependent conditions (e.g., biophysical conditions of rural construction land or characteristics of rural households), whereas design attributes are considered contextual and reflect case-dependent conditions (information or revenue distribution rules). Furthermore, an archetype must be illustrated by a coherent theory of collective action that provides a rationale for the configuration to occur. Also, sub-archetypes are identified to provide more detailed but not too specific or entirely repetitive information about the possibility of rural collective action. Except for the FCA, other steps for identifying the appropriate number of archetypes are rather qualitative and semi-qualitative, such as inspecting the extents and intents of formal concepts in detail.

    Third, formal concept-based archetype analysis usually generates a large number of formal concepts in the first step, as the scale of a data set enlarges. For instance, Wang et al. (2019) obtained 131 formal concepts from 27 cases, whereas Oberlack and Eisenack (2018) got 216 formal concepts from 114 cases. In practice, researchers scrutinize hundreds of formal concepts by hand to identify the archetypes, impeding the process efficiency (the full data of Harmáčková et al. (2025) with 460 cases, e.g., obtains 982 concepts). We thus aim to propose procedures to speed up this approach.

    In summary, the current procedures show the notable shortages in objectivity, replicability, and efficiency in identifying archetypes and the appropriate number of archetypes.

    METRICS AND NEW PROCEDURES

    To address the above methodological challenges, we introduce several quantitative metrics, and show how they can be used in the procedural steps we propose.

    Quantitative metrics

    The following metrics can be used in various ways to distill concepts and to later identify those that qualify as archetypes. Generally, we denote the number of cases in the data set by n, each described by a subset of the total number of attributes m (more precisely, m is the number of attributes except those that characterize outcomes). Recall that each concept is characterized by its intent (i.e., attributes) and extent (i.e., cases). To combine classification with explanation, we distinguish the outcome attribute(s) we are interested in (those we want to explain) from the remaining attributes. The number of cases that have a chosen outcome attribute “out” is denoted by nout.

    The size of a concept is the number of cases contained in the concept (i.e., the cardinality of its extent). If concepts contain more cases, it might be more reasonable to consider them as important archetypes. Yet, if they are too large, they might miss nuances to represent different kinds of cases. It can be convenient to normalize the size: by borrowing from qualitative comparative analysis (QCA; Ragin 1987, Schneider and Wagemann 2012), we define a concept’s coverage = size/nout (which is below unity if size ≤ nout).

    The richness ρ of a concept is the number of its attributes (i.e., the cardinality of its intent). Concepts with low richness (e.g., ρ = 1 for a single attribute) express little nuance and tend to have a larger size. Usually, archetype analysis is interested in determining configurations of multiple attributes (so, richness is 2 or larger).

    The lift of a concept (borrowed from association rule mining, e.g., McNicholas et al. 2008) combines data on attributes and cases. Let P(ai) = #(ai) / n denote the relative frequency of cases with attribute ai appearing among all cases (possibly together with further attributes). For a concept with more attributes, say an intent set {a1, a2, ..., ai, ..., ak}, it is:

    Equation 1 (1)

    The numerator expresses the relative frequency of all the concept’s attributes appearing together in the cases. The denominator is the expected frequency that the concept’s attributes jointly co-occur exactly as frequently as expected under the assumption that the single attributes are stochastically independent. A lift = 1.0 thus means that the concept appears as frequently as expected. A greater value, above 1.0, means that the concept’s attributes co-occur more frequently than expected. Concepts with a high lift might be considered more surprising, so they warrant more attention when they explain an outcome.

    To get a measure on whether a concept has potential to explain an outcome attribute, one can compute its consistency (again, borrowing from QCA; Schneider and Wagemann 2012, Roggero 2019). Consistency refers to the share of cases in a concept’s extent that also have the outcome attribute out. When considering a concept with intent set {a1, a2, ..., ak}, some but possibly not all cases with these attributes also have the outcome out. We thus define the consistency = #(out AND a1 AND a2 AND ... AND ak)/size. Here, “AND” denotes logical conjunction, so that the consistency is the number of all cases that have the outcome attribute together with all attributes of the concept’s intent, normalized to the concept’s size. So, consistency depends on the outcome attribute because the outcome can appear with or without the other attributes. A consistency of 100% means that all cases in a given concept have the outcome of interest. Such a concept is a good candidate for a strong explanation. With a lower consistency (more common in larger data sets), there are more false positives when we use the concept to explain the outcome.

    The above metrics express quantitative features of single concepts. However, archetype analysis aims to determine a whole suite of multiple archetypes that explain a specific outcome. Some archetypes might cover cases that other archetypes do not speak about. Recall that concepts do not need to be disjoined (their extents can intersect): two concepts taken together (the union of their extents) might contain fewer cases than simply adding both concepts’ sizes. For any selection of concepts, we thus define their joint size, v, by the number of cases that are in the extent of at least one of those concepts (i.e., the cardinality of the union of their extents). When normalized, we have their joint coverage = v/nout. The lower the joint coverage, the more cases are not covered by any concept in the selection. We have now introduced the main metrics. They can be calculated, e.g., by using the fcaR package in R (Lopez Rodriguez et al. 2020; see available scripts on https://github.com/ArchetypeAnalysis/Archetype-Identification).

    Standard procedures

    We propose the following standard procedures to determine the optimal number of archetypes, with core steps and optional steps (Table 1). Core steps should always be taken, whereas the optional steps can be adopted discretionally according to the research purposes, theoretical basis, and structure in the data. The novel metrics for archetype identification are summarized more technically in Fig. 1.

    The first core step (CS1) conducts the FCA. It takes as input the data of all n cases. As attributes, it takes only those that are “not” considered as outcomes to be explained. The FCA results in a list of all concepts. Each selection from this list can be considered as a classification of the data. However, there are usually many ways to classify. The concepts yet differ a lot, for instance by size. More importantly, many will not have explanatory power. Thus, CS1 continues with computing, for every concept, its size, richness, lift, coverage, and consistency. Recall that the value of the last two metrics depends on the chosen outcome attribute.

    These metrics are used in the concept filter in the second core step (CS2). The analyst sets (and reports) thresholds for all metrics (or, depending on the research, only some of these metrics). Only those concepts from CS1 that respect all thresholds are retained, whereas the remainder are discarded. This raises the question of how to set the thresholds. Although this generally depends on the research, some general considerations can be made. Most important for explanatory purposes is consistency, so ideally set to 100%. Then, all cases in the extent of the perfectly consistent concepts also have the outcome attribute we are interested in. Classifying a case with such a concept is sufficient for the outcome. For larger data sets, it might not be possible to find perfectly consistent concepts, possibly because there is so much variation in unobserved variables, or much noise. By referring to conventions from QCA, we propose a minimal consistency threshold of 80% (Ragin 2008).

    For size and coverage, archetype analysis requires each archetype to appear repeatedly. The most relaxed quantification is to only accept concepts with size ≥ 2 or coverage ≥ 2 / nout as archetypes. In larger data sets, however, we might set a higher threshold so that we require concepts to cover more cases, e.g., coverage ≥ 1%. A higher threshold implies that we ignore more unconventional cases.

    For richness, one would require at least ρ ≥ 2 for a conjunctional causation approach (Schneider and Wagemann 2012), as is common in archetype analysis (e.g., Eisenack and Roggero 2022). If the explanatory power of single attributes is worth being checked, ρ ≥ 1 might also be reasonable. A higher threshold has the advantage that we retain only concepts with a richer characterization or which admit more complex theorizing. On the other hand, explanations are stronger if much follows from little. Richer concepts tend to become more difficult to comprehend qualitatively or to explain by theoretical considerations. Thus, an upper limit is recommended for richness, which yet depends on the research. We recommend ρ ≤ m / 2 to avoid certain attributes appearing in very many concepts.

    For lift, a natural lower threshold is 100%. One might yet choose a higher threshold if the research aims for explanations that can be applied more frequently than expected from the single attribute frequency. Depending on the research, lift ≥ 80% can also be acceptable.

    Next comes the theoretical analysis step (CS3), which is most difficult to formalize (partially because it strongly depends on disciplinary standards). The concepts that passed the previous steps only describe which attributes are present in the configurations and not how they are interlinked (although Eisenack and colleagues (2025) discuss ways to consider this in an FCA), and not even why they are interlinked (possibly even in a dynamic way). Thus, the aim of CS3 is to provide theoretical explanations. All the remaining concepts are inspected according to whether they illustrate established theories, corroborate theoretical inferences, are supported by case-based knowledge, and align with or extend the relevant literature. We obtain the final suite of archetypes by only keeping those concepts that pass this test. Such a test might be more or less easy. However, the advantage of our proposed procedure is that the previous steps have considerably reduced the list of concepts that warrant such a theorizing. So, one can concentrate this effort on those concepts that are repeatedly sufficient for the outcome. Theorizing can be further supported by findings from the previous steps. For instance, we can focus on less rich concepts first and later check whether sub-concepts of those qualify as sub-archetypes. Although we accept overlap between concepts, we might justify retaining those that are less repetitive in terms of other concepts. Sometimes, it can be useful to sort the m attributes into different kinds (e.g., referring to bio-physical or to socio-economic features) and only accept concepts as archetypes if they contain attributes of two kinds (e.g., in Gotgelf et al. 2020).

    It may still be possible that too many concepts pass CS2 and CS3 or that too many concepts pass CS2 to accomplish CS3. We then propose two optional steps that can be done before or after some of the core steps.

    Grouping (OS1) is a semi-qualitative optional step. It divides cases into groups defined by the analyst, according to their outcomes, critical attributes of interest, or research questions. The data are split into subsets by group. The other steps are then done for each group separately. The fewer cases in each group, the smaller the number of concepts determined with FCA. However, the number of groups should be small enough to be manageable in the subsequent steps. Grouping is possible before CS1 or between CS2 and CS3.

    Optimal concept selection (OS2) optimizes the joint size v (ρ, N) for a given richness ρ and N. This can now build on tested algorithms (Harmáčková et al. 2025, Partelow et al. 2024). As input, it takes a set of concepts that have passed previous tests, and then determines optimal subsets (selections of a given number of N concepts). In other words, we want to cover as many cases as possible within the union of a small selection of concepts’ extents. If only N random concepts were selected, their extents might considerably intersect, so that their joint size is not much greater than the individual concepts’ sizes. The algorithm computes all selections that optimize the joint size for a given (ρ, N). Still, the analyst needs to justify which ranges for (ρ, N) to compute and then needs to choose among them. The following criteria can help.

    For an archetype analysis, we should require N ≥ 2 (if N = 1, a nomothetic approach would be better). Furthermore, N < m is reasonable as otherwise each single attribute is admitted as a (not very interesting) concept. Pragmatically, there might be a maximum number of concepts one is able to cope with in terms of available resources, publication space, and so on (e.g., N ≤ 10 or N ≤ 40). The joint size is then optimized for this range for N, and the range for ρ is chosen as in previous steps. The more important task is to choose N among all computed optimal selections. This number is actually the size of the suite of archetypes. Generally, we want to cover many cases, so we could only accept an optimal selection of concepts with a high joint coverage. However, it is not reasonable to require 100% joint coverage because there might be (i) quite idiosyncratic cases that only appear in concepts of size 1, or (ii) cases that are only described by one attribute. Depending on the research, there might be a number of cases we can afford to leave uncovered. All this requires a lower threshold, e.g., joint coverage ≥ 80%. On the other hand, there might be many optimal selections of N concepts above the threshold. It is recommended to choose the lowest N that achieves the threshold of joint coverage (by parsimony).

    Parsimony can also be operationalized in another way. Suppose we have, for a given (ρ, N), the optimal joint size v (ρ, N). Then, admitting one more concept can never decrease the optimal joint size: v(ρ, N+1) ≥ v(ρ, N) is always true. But what is the point of admitting one more concept if the joint size would be raised by just one more case? We can thus require that a further concept is only admitted to the suite of archetypes if it sufficiently improves coverage. With some preset threshold, e.g., ΔN = 2 or ΔN = 0.01 nout, we can choose N so that it fulfills both v(ρ, N) ≥ v(ρ, N-1) + ΔN and v(ρ, N+1) <v(ρ, N) + ΔN. With these criteria, OS2 ends with (several) optimal selections of concepts. Only those are retained in subsequent steps. We recommend applying this step only after CS2 (although it also works after CS1).

    RE-IDENTIFYING ARCHETYPES FROM TWO PREVIOUS STUDIES

    To illustrate the proposed metrics and steps, we now attempt to replicate two previously published archetype analyses, one with a smaller and one with a larger data set. The comparison between the original and the novel procedure and findings also highlights the practical effectiveness of the metrics and steps. Further examples are available from the authors upon request.

    Sustainable rural renewal in China

    Wang and colleagues (2019) study archetypes of rural renewal in contemporary China involving activities that replan, consolidate, and redevelop the extant and idle rural construction land and then convert such land for alternative uses, including new rural settlement construction and rural industry development. The governance of rural renewal and its performance show great diversity in facilitating rural sustainability. The analysis is based on primary data from n = 27 cases from the eastern, central, and western parts of China. Each case is characterized by the presence/absence of m = 47 attributes, including three outcome attributes (unsustainable, semi-sustainable, or sustainable rural renewal). In total, eight archetypes and 17 further sub-archetypes were identified, the latter capturing more fine-grained but not repetitive information on the cases. These archetypes and sub-archetypes were originally obtained by FCA with four criteria concerning the configuration, recurrence, theoretical coherence, and generality/particularity. This procedure involved the manual inspection of 134 concepts, many of them with more than 10 attributes.

    For the replication, we first used one optional step (OS1), and then the three core steps, with the quantitative metrics of consistency, coverage, and richness (Fig. 2).

    The first step (OS1) followed the original study by grouping the cases by the three outcome attributes, including unstainable rural renewal (group 1), semi-sustainable rural renewal (group 2), and sustainable rural renewal (group 3) (Wang et al. 2019). In addition, the cases in group 3 were decomposed into three sub-groups according to further attributes for the scales of rural land and rural households involved in rural renewal (Wang et al. 2019).

    Subsequently, CS1 was performed separately for each group (and sub-group), yielding between eight and 66 concepts per group (and 134 in total). The consistency, coverage, and richness of each concept was calculated. Considering the concepts in all the main groups and sub-groups, coverage varies from 22% to 100%, and richness from 2 to 21.

    We then proceed with the concept filter (CS2) to filter out all concepts with consistency < 100%, i.e., those that do not always lead to the respective outcome (unsustainable, semi-sustainable, or sustainable rural renewal). The chosen thresholds for coverage differ between the groups, as they contain a quite different number of cases. Thus, in groups 1 and 2, and sub-groups 3.1-3.3, formal concepts were filtered out if their coverage was below certain thresholds, varying from 22% to 66%. The original study only accepted concepts as (sub-)archetype with size = 2 or more. In terms of richness, we only retained concepts with 2 ≤ ρ ≤ 21. The minimal richness ensures a configuration of attributes rather than an individual attribute. Concepts with 21 attributes are already quite complicated and specific, being nearly half of the total number of attributes (m = 47), and can contain multiple interactions in several analytic stages. In total, the step reduced the number of “surviving” concepts to 36 for the main groups, and to 67 for the sub-groups.

    In CS3, all remaining concepts were inspected for theoretical saliency, i.e., whether these concepts can be explained by or develop the theories of transaction cost economics or/and rural collective action. The assessment of theoretical saliency was conducted by the authors of the original study. The remaining concepts were ultimately reduced to a suite of eight archetypes and 17 sub-archetypes. These are identical to those from the original study. The novel procedure (up to CS3) completely reproduced previous findings, yet with much less effort.

    Barriers to water governance under climate change

    The archetype analysis by Oberlack and Eisenack (2018) conducted a systematic literature review on barriers to climate change adaptation in water governance of river basins. Every “case” in this study is a causal statement made in an empirical publication (the study is a model-centered meta-analysis; Rudel 2008). Some publications made just one such claim, whereas others made several of them. The outcome is always the presence of a barrier (as other statements were not coded). In total, n = 114 statements about adaptation barriers provided by these peer-reviewed empirical publications were used in this study. The statements were coded with binary attributes based on the SES framework (m = 59) and performed with an FCA that led to 216 formal concepts. Only those 87 formal concepts that contained at least two different causal statements from two different publications were retained. In the original study, all of them were inspected by hand for theoretical saliency and led to a suite of 31 archetypes with richness 2, and a joint size of 93 cases. The study determined a further 10 archetypes with richness = 3, but those are all sub-archetypes of the previous.

    The original data were re-analyzed through the three core steps and one optional step (OS3) with the quantitative metrics of coverage, richness, size, and lift (Fig. 3).

    In CS1, the FCA again computed 216 formal concepts. The coverage, richness, size, and lift of each concept was calculated. We did not compute consistency as all concepts are consistent by definition in this study.

    In CS2, in aligning with the original study, we only retained concepts with size ≥ 2 (which converts to a coverage of 1.75%) and that appear in at least two different papers. For richness, we imposed 2 ≤ ρ ≤ 3 to also align with the original study. We further required lift ≥ 120%. After the concept filter, 72 concepts were kept.

    We then continued with optimal concept selection (OS2). We computed all selections of N concepts with the richness given in the previous step and that maximize the joint size. More specifically, we considered all optimal selections with N between 2 and 35 to have the original study’s N = 31 in the range, and to check whether there should be more archetypes than in the original study. We required the same joint coverage as in the original study (81.6%). The algorithm yields that this is already achieved for ρ = 2 and N = 25. So, the original coverage could have also been reached with considerably fewer archetypes if the proposed new procedures would have been available. If we additionally impose ΔN = 2, we cannot obtain a higher joint coverage than 79%. For ρ = 3, the threshold cannot be achieved (as in the original study). For re-identification, we thus required both joint coverage above 75% and ΔN = 2. The optimum (with a joint size of 90) is achieved with 10 different optimal selections, each with 22 concepts. As those selections are not so different, they contain 26 concepts in total.

    All these are fed into the theoretical analysis (CS3). As our main aim is to explore the advantage of the new steps by comparing with the original study, we concentrated this step on scrutinizing the relevance of the concepts that disappeared in comparison to the original study, and that are new (2 for the most similar optimum, see Append. 1 for more details). They do not change the main insights from the original study.

    Thus, 22 archetypes are obtained, fewer than the original study (i.e., 31 archetypes). The original study covers three more statements, but needs nine more archetypes to achieve this. In this regard, the novel procedures and metrics, while being more reproducible and faster, reproduce the essence of the original findings in a more parsimonious way.

    CONCLUSION

    Like other methodologies for combining explanation with classification, archetype analysis faces the trade-off between the number of classes and the strength of explanations. This requires purposefully navigating different levels of abstraction. If formal concept-based archetype analysis is used, this amounts to identifying the appropriate (number of) archetypes from the formal concepts. Our paper proposes standardized procedures with quantitative metrics to reduce unnecessary subjectivity and bias, to increase the process efficiency, and to obtain more parsimonious findings when selecting the archetypes. We propose using the metrics of consistency, coverage, richness, size, and lift. They can be computed and used in some of the three main steps: formal concept analysis, concept filtering, and theoretical analysis. Depending on the research question and data, two optional steps are suggested: grouping and optimal concept selection. The effectiveness of the novel procedures and metrics are assessed by reanalyzing two previously published studies. We demonstrate that the more quantitative steps lead to a manageable number of insightful formal concepts for further qualitative investigation—through a rather rigorous, objective, and much less resource-intensive procedure than previous studies. The novel procedure completely replicates the original archetypes or the original findings in a more parsimonious way. Our paper thus contributes to the further methodological development of archetype analysis in case-based sustainability research by suggesting a more objective and efficient way toward identifying archetypes with the appropriate granularity. In other words, an intermediate level of abstraction is achieved that avoids the “nomothetic trap” and the “idiographic trap” when combining classification with explanation.

    Obviously, the quantitative metrics and standard procedures for formal concept-based archetype analysis remain to be further tested and improved in the broader field of sustainability research. In particular, reasonable ranges for the thresholds for the metrics can be further tested in a heuristic manner that accumulates experience from individual studies. The lift metric has limits in exploiting more information on how attributes are configured by assuming stochastic independence, and when comparing among concepts with different richness. Technical advances are still possible to refine the standard procedures and improve their efficiency, especially on the stages of grouping and optimal concept selection.

    Our proposal is tailored to binary data, analyzed with FCA. Natural extensions would address non-binary or fuzzy data. Extending the procedures and metrics for methods other than FCA might be promising, e.g., for cluster analysis (one might compute correlation coefficients with an outcome variable for each level in a hierarchical cluster analysis to identify the best level). In addition, machine learning algorithms such as artificial neural networks or self-organizing maps offer entry points (Frey and Rusch 2013, Levers et al. 2018). Unsupervised classification realized by machine learning algorithms can further mitigate the subjective and biased identification of the appropriate number of archetypes, while likely involving trade-offs with theoretical saliency or consistency. In this regard, we argue that the methodological advances suggested in this paper show promising avenues for improved archetype analysis in case-based sustainability research, particularly by increasing analytical efficiency and obtaining more parsimonious findings.

    RESPONSES TO THIS ARTICLE

    Responses to this article are invited. If accepted for publication, your response will be hyperlinked to the article. To submit a response, follow this link. To read responses already accepted, follow this link.

    AUTHOR CONTRIBUTIONS

    Both authors equally contributed to the manuscript.

    ACKNOWLEDGMENTS

    We are grateful for the inspiring discussions with participants at the Workshop on Archetype Analysis in Sustainability Research in Washington, D.C. (www.archetype-analysis.net), the Workshop on the Ostrom Workshop 2024 (wow.indiana.edu), and the seminars of the Resource Economics Group. We want to thank Zuzana Harmáčková for the helpful exchange on testing the procedures, and two anonymous reviewers for their constructive suggestions. All remaining errors are ours. Rongyu Wang received support from the Natural Science Foundation of China through project No. 72103175. The article processing charge was funded by the Open Access Publication Fund of Humboldt-Universität zu Berlin.

    Use of Artificial Intelligence (AI) and AI-assisted Tools

    AI tools were used to improve the language of the paper and to speed up the coding of the R scripts.

    DATA AVAILABILITY

    The data and code that support the findings of this study are openly available in GitHub at https://github.com/ArchetypeAnalysis/Archetype-Identification. Ethical approval for this research study was not required because it only uses data that were already published in peer-reviewed papers.

    LITERATURE CITED

    Beltz, A. M., A. G. C. Wright, B. N. Sprague, and P. C. M. Molenaar. 2016. Bridging the nomothetic and idiographic approaches to the analysis of clinical data. Assessment 23(4):447-458. https://doi.org/10.1177/1073191116648209

    Buchadas, A., M. Baumann, P. Meyfroidt, and T. Kuemmerle. 2022. Uncovering major types of deforestation frontiers across the world’s tropical dry woodlands. Nature Sustainability 5(7):619-627. https://doi.org/10.1038/s41893-022-00886-9

    Capecchi, V. 1973. Some examples of classification and causal analysis. Quality and Quantity 7(1):131-56. https://doi.org/10.1007/BF00172372

    Castille, E. L., M. A. Janssen, and C. T. Solomon. 2023. Polycentric governance systems’ perceived impact on learning in north-central US lake and watershed organizations. Regional Environmental Change 23(3):109. https://doi.org/10.1007/s10113-023-02100-8

    Eisenack, K. 2024. Why local governments set climate targets: effects of city size and political costs. Environmental and Resource Economics 87: 2935-2965. https://doi.org/10.1007/s10640-024-00919-1

    Eisenack, K., G. Epstein, L. Finzel, E. Kellner, B. Nagel, S. Partelow, M. Roggero, and S. Villamayor-Tomas. 2025. Synthesizing archetypes of social-ecological systems: identifying common building blocks. Environmental Policy and Governance 35(5):882-897. https://doi.org/10.1002/eet.70006

    Eisenack, K., C. Oberlack, and D. Sietz. 2021. Avenues of archetype analysis: roots, achievements, and next steps in sustainability research. Ecology and Society 26(2):31 https://doi.org/10.5751/ES-12484-260231

    Eisenack, K., and M. Roggero. 2022. Many roads to Paris: explaining urban climate action in 885 European cities. Global Environmental Change 72:102439. https://doi.org/10.1016/j.gloenvcha.2021.102439

    Eisenack, K., S. Villamayor-Tomas, G. Epstein, C. Kimmich, N. Magliocca, D. Manuel-Navarrete, C. Oberlack, M. Roggero, and D. Sietz. 2019. Design and quality criteria for archetype analysis. Ecology and Society 24(3):6. https://doi.org/10.5751/ES-10855-240306

    Eisenack, K., H. Welsch, and J. P. Kropp. 2006. A qualitative dynamical modelling approach to capital accumulation in unregulated fisheries. Journal of Economic Dynamics and Control 30(12):2613-2636. https://doi.org/10.1016/j.jedc.2005.08.004

    Frey, U., and H. Rusch. 2013. Using artificial neural networks for the analysis of social-ecological systems. Ecology and Society 18(2):40. https://doi.org/10.5751/ES-05202-180240

    Ganter, B., and R. Wille. 1999. Formal concept analysis: mathematical foundations. Springer Nature, Cham, Switzerland. https://doi.org/10.1007/978-3-031-63422-2

    Gong, Y., and R. Tan. 2021. Emergence of local collective action for land adjustment in land consolidation in China: an archetype analysis. Landscape and Urban Planning 214:104160. https://doi.org/10.1016/j.landurbplan.2021.104160

    Gotgelf, A., M. Roggero, and K. Eisenack. 2020. Archetypical opportunities for water governance adaptation to climate change. Ecology and Society 25(4):6. https://doi.org/10.5751/ES-11768-250406

    Harmáčková, Z., K. Eisenack, Y. Yoshida, N. Sitas, and P. O’Farrell. 2025. Value archetypes in future scenarios: the role of scenario co-designers. Ecology and Society 30(3):4. https://doi.org/10.5751/ES-15884-300304

    Kimmich, C., and S. Villamayor Tomas. 2019. Assessing action situation networks: a configurational perspective on water and energy governance in irrigation systems. Water Economics and Policy 5(1):1850005. https://doi.org/10.1142/S2382624X18500054

    Kurtmollaiev, S., P. E. Pedersen, and T. Lie. 2023. A bird in the hand: empirically grounded archetypes of collaborative innovation in the public sector. Public Management Review 26(5):1266-1298. https://doi.org/10.1080/14719037.2023.2171092

    Levers, C., D. Müller, K. Erb, H. Haberl, M. R. Jepsen, M. Metzger, P. Meyfroidt, T. Plieninger, C. Plutzar, J. Stürck, P. Verburg, P. Verkerk, and T. Kuemmerle. 2018. Archetypical patterns and trajectories of land systems in Europe. Regional Environmental Change 18(3):715-732. https://doi.org/10.1007/s10113-015-0907-x

    Lopez Rodriguez, D., A. Mora, J. Dominguez, A. Villalon, and I. Johnson. 2020. Package fcaR. Github Inc., San Francisco, California, USA. https://github.com/neuroimaginador/fcaR

    Lüdeke, M. K. B., G. Petschel-Held, and H.-J. Schellnhuber. 2004. Syndromes of global change: the first panoramic view. GAIA 13(1):42-49. https://doi.org/10.14512/gaia.13.1.10

    Lyon, A. R., E. Connors, A. Jensen-Doss, S. J. Landes, C. C. Lewis, B. D. McLeod, C. Rutt, C. Stanick, and B. J. Weiner. 2017. Intentional research design in implementation science: implications for the use of nomothetic and idiographic assessment. Translational Behavioral Medicine 7(3):567-580. https://doi.org/10.1007/s13142-017-0464-6

    Magliocca, N. R., E. C. Ellis, G. R. Allington, A. de Bremond, J. Dell’Angelo, O. Mertz, P. Messerli, P. Meyfroidt, R. Seppelt, and P. H. Verburg. 2018. Closing global knowledge gaps: producing generalized knowledge from case studies of social-ecological systems. Global Environmental Change 50:1-14. https://doi.org/10.1016/j.gloenvcha.2018.03.003

    Manuel-Navarrete, D., J. J. Gomez, and G. Gallopin. 2007. Syndromes of sustainability of development for assessing the vulnerability of coupled human–environmental systems. The case of hydrometeorological disasters in Central America and the Caribbean. Global Environmental Change 17(2):207-17. https://doi.org/10.1016/j.gloenvcha.2006.07.002

    Meyfroidt, P., R. R. Chowdhury, A. de Bremond, E. Ellis, K. H. Erb, T. Filatova, and C. A. Kull. 2018. Middle-range theories of land system change. Global Environmental Change 53:52-67. https://doi.org/10.1016/j.gloenvcha.2018.08.006

    Meyfroidt, P., A. de Bremond, C. M. Ryan, E. Archer, R. Aspinall, A. Chhabra, G. Camara, E. Corbera, R. DeFries, S. Diaz, J. Dong, E. C. Ellis, K. H. Erb, J. A. Fisher, R. D. Garrett, N. E. Golubiewski, H. R. Grau, J. M. Grove, H. Haberl, A. Heinimann, P. Hostert, E. G. Jobbagy, S. Kerr, T. Kuemmerle, E. F. Lambin, S. Lavorel, S. Lele, O. Mertz, P. Messerli, G. Metternicht, D. K. Munroe, H. Nagendra, J. O. Nielsen, D. S. Ojima, D. C. Parker, U. Pascual, J. R. Porter, N. Ramankutty, A. Reenberg, A. Roy, R. Chowdhury, K. C. Seto, V. Seufert, H. Shibata, A. Thomson, B. L. Turner, J. Urabe, T. Veldkamp, P. H. Verburg, G. Zeleke, and E. zu Ermgassen. 2022. Ten facts about land systems for sustainability. Proceedings of the National Academy of Sciences 119(7):e2109217118. https://doi.org/10.1073/pnas.2109217118

    McNicholas, P. D., T. B. Murphy, and M. O’Regan. 2008. Standardising the lift of an association rule. Computational Statistics and Data Analysis 52:4712-4721. https://doi.org/10.1016/j.csda.2008.03.013

    Nagel, B., N. Buhari, and S. Partelow. 2024. Archetypes of community-based pond aquaculture in Indonesia: applying the social-ecological systems framework to examine sustainability tradeoffs. Environmental Research Letters 19(4):044026 https://doi.org/10.1088/1748-9326/ad2e71

    Neudert, R., A. Salzer, N. Allahverdiyeva, J. Etzold, and V. Beckmann. 2019. Archetypes of common village pasture problems in the South Caucasus: insights from comparative case studies in Georgia and Azerbaijan. Ecology and Society 24(3):5. https://doi.org/10.5751/ES-10921-240305

    Oberlack, C., and K. Eisenack. 2018. Archetypical barriers to adapting water governance in river basins to climate change. Journal of Institutional Economics 14(3):527-555. https://doi.org/10.1017/S1744137417000509

    Oberlack, C., S. Pedde, L. Piemontese, T. Václavík, and D. Sietz. 2023. Archetypes in support of tailoring land-use policies. Environmental Research Letters 18(6):060202. https://doi.org/10.1088/1748-9326/acd802

    Oberlack, C., D. Sietz, F. Bürgi Bonanomi, A. de Bremond, J. Dell’Angelo, K. Eisenack, E. C. Ellis, G. Epstein, M. Giger, A. Heinimann, C. Kimmich, M. T. J. Kok, D. Manuel-Navarrete, P. Messerli, P. Meyfroidt, T. Václavík, and S. Villamayor-Tomas. 2019. Archetype analysis in sustainability research: meanings, motivations, and evidence-based policy making. Ecology and Society 24(2):26. https://doi.org/10.5751/ES-10747-240226

    Oberlack, C., L. Tejada, P. Messerli, S. Rist, and M. Giger. 2016. Sustainable livelihoods in the global land rush? Archetypes of livelihood vulnerability and sustainability potentials. Global Environmental Change 41:153-71. https://doi.org/10.1016/j.gloenvcha.2016.10.001

    Orozco, R., M. Meyer-Jürshof, K. Vergara-Rodríguez, T. Václavík, and D. Sietz. 2024. Dynamic archetypes of agricultural land systems in Germany from 1992 to 2019. Land Use Policy 145:107281. https://doi.org/10.1016/j.landusepol.2024.107281

    Ostrom, E. 2007. A diagnostic approach for going beyond panaceas. Proceedings of the National Academy of Sciences 104(39):15181-15187. https://doi.org/10.1073/pnas.0702288104

    Ostrom, E. 2010. Polycentric systems for coping with collective action and global environmental change. Global Environmental Change 20(4):550-557. https://doi.org/10.4324/9781315254210-18

    Ostrom, E., and M. Cox. 2010. Moving beyond panaceas: a multi-tiered diagnostic approach for social-ecological analysis. Environmental Conservation 37(4):451-463. https://doi.org/10.1017/S0376892910000834

    Partelow, S., S. Villamayor Tomás, K. Eisenack, G. B. Epstein, E. Kellner, M. Roggero, and M. Tschopp. 2024. A meta-analysis of SES framework case studies: identifying dyad and triad archetypes. People and Nature 6:1229-1247. https://doi.org/10.1002/pan3.10630

    Piemontese, L., R. N. Kamugisha, J. Barron, J. M. B. Tukahirwa, N. Harari, and F. Jaramillo. 2022a. Investing in sustainable intensification for smallholders: quantifying large-scale costs and benefits in Uganda. Environmental Research Letters 17(4):045010. https://doi.org/10.1088/1748-9326/ac5ae0

    Piemontese, L., R. Neudert, C. Oberlack, S. Pedde, M. Roggero, A. Buchadas, D. A. Martin, R. Orozco, K. Pellowe, A. C. Segnon, L. Zarbá, and D. Sietz. 2022b. Validity and validation in archetype analysis: practical assessment framework and guidelines. Environmental Research Letters 17(2):025010. https://doi.org/10.1088/1748-9326/ac4f12

    Ragin, C. 1987. The comparative method. Moving beyond qualitative and quantitative strategies. University of California Press, Oakland, California, USA.

    Ragin, C. 2008. Redesigning social inquiry: fuzzy sets and beyond. University of Chicago Press, Chicago, Illinois, USA. https://doi.org/10.7208/chicago/9780226702797.001.0001

    Ratajczyk, E., U. Brady, J. A. Baggio, A. J. Barnett, I. Perez-Ibarra, N. Rollins, C. Rubiños, H. C. Shin, D. J. Yu, R. Aggarwal, J. M. Anderies, and M. A. Janssen. 2016. Challenges and opportunities in coding the commons: problems, procedures, and potential solutions in large-N comparative case studies. International Journal of the Commons 10(2):440-466. https://doi.org/10.18352/ijc.652

    Roggero, M. 2019. Social dilemmas, policy instruments, and climate adaptation measures: the case of green roofs. Mitigation and Adaptation Strategies for Global Change 25:625-42. https://doi.org/10.1007/s11027-019-09883-4

    Roggero, M., J. K. Fjornes, and K. Eisenack. 2025a. Ambitious climate targets and emission reductions in cities: a configurational analysis. Climate Policy 25(1):29-43. https://doi.org/10.1080/14693062.2023.2282488

    Roggero, M., J. K. Fjornes, and K. Eisenack. 2025b. Explaining emission reductions in cities: configurations of socioeconomic and institutional factors. Earth System Governance 24:100252. https://doi.org/10.1016/j.esg.2025.100252

    Rudel, T. K. 2008. Meta-analyses of case studies: a method for studying regional and global environmental change. Global Environmental Change 18(1):18-25. https://doi.org/10.1016/j.gloenvcha.2007.06.001

    Schneider, C. Q., and C. Wagemann. 2012. Set-theoretic methods for the social sciences--a guide to qualitative comparative analysis. Cambridge University Press, Cambridge, UK. https://doi.org/10.1017/CBO9781139004244

    Sietz, D., and R. Neudert. 2022. Taking stock of and advancing knowledge on interaction archetypes at the nexus between land, biodiversity, food and climate. Environmental Research Letters 17(11):113004. https://doi.org/10.1088/1748-9326/ac9a5c

    Sietz, D., J. C. Ordoñez, M. T. K. J. Kok, P. Janssen, H. B. M. Hilderink, P. Tittonell, and H. van Dijk. 2017. Nested archetypes of vulnerability in African drylands: where lies potential for sustainable agricultural intensification? Environmental Research Letters 12(9):095006. https://doi.org/10.1088/1748-9326/aa768b

    Sietz, D., and H. van Dijk. 2015. Land-based adaptation to global change: what drives soil and water conservation in western Africa? Global Environmental Change 33:131-141. https://doi.org/10.1016/j.gloenvcha.2015.05.001

    Tan, R., and R. Wang, R. 2025. Archetype analysis of land governance: a systemism research paradigm. Land Use Policy 150:107466. https://doi.org/10.1016/j.landusepol.2024.107466

    United Nations Environment Programme (UNEP). 2007. Global environmental outlook 4. Progress Press Ltd., Valetta, Malta.

    Václavík, T., M. Beckmann, M. Bednář, S. Brdar, G. Breckenridge, A. F. Cord, C. Domingo-Marimon, A. Gosal, F. Langerwisch, A. Paulus, S. Roilo, B. Šarapatka, G. Ziv, and T. Čejka. 2024. Farming system archetypes help explain the uptake of agri-environment practices in Europe. Environmental Research Letters 19(7):074004. https://doi.org/10.1088/1748-9326/ad4efa

    Vatn, A. 2005. Institutions and the environment. Edward Elgar Publishing, Northampton, UK.

    Villamayor-Tomas, S., C. Oberlack, G. Epstein, S. Partelow, M. Roggero, E. Kellner, M. Tschopp, and M. Cox. 2020. Using case study data to understand SES interactions: a model-centered meta-analysis of SES framework applications. Current Opinion in Environmental Sustainability 44:48-57. https://doi.org/10.1016/j.cosust.2020.05.002

    Wang, R., K. Eisenack, and R. Tan. 2019. Sustainable rural renewal in China: archetypical patterns. Ecology and Society 24(3):32. https://doi.org/10.5751/ES-11069-240332

    Wang, R., and R. Tan. 2020. Patterns of rural collective action in contemporary China: an archetype analysis of rural construction land consolidation. Journal of Rural Studies 79:286-301. https://doi.org/10.1016/j.jrurstud.2020.08.019

    Warren, A. 2002. Land degradation is contextual. Land Degradation and Development 13:449-459. https://doi.org/10.1002/ldr.532

    Young, O. R. 2019. Constructing diagnostic trees: a stepwise approach to institutional design. Earth System Governance 1:100002. https://doi.org/10.1016/j.esg.2019.02.001

    Young, O. R., E. F. Lambin, F. Alcock, H. Haberl, S. I. Karlsson, W. J. McConnell, T. Myint, C. Pahl-Wostl, C. Polsky, P. S. Ramakrishnan, H. Schroeder, M. Scouvart, and P. H. Verburg. 2006. A portfolio approach to analyzing complex human–environment interactions: institutions and land change. Ecology and Society 11(2):31. https://doi.org/10.5751/ES-01799-110231

    Corresponding author:
    Klaus Eisenack
    klaus.eisenack@hu-berlin.de
    Appendix 1
    Fig. 1
    Fig. 1. Overview of metrics (suggested thresholds may be adjusted according to the research).

    Fig. 1. Overview of metrics (suggested thresholds may be adjusted according to the research).

    Fig. 1
    Fig. 2
    Fig. 2. Re-identifying archetypical patterns of sustainable rural renewal in China.

    Fig. 2. Re-identifying archetypical patterns of sustainable rural renewal in China.

    Fig. 2
    Fig. 3
    Fig. 3. Re-identifying archetypical barriers to water governance under climate change.

    Fig. 3. Re-identifying archetypical barriers to water governance under climate change.

    Fig. 3
    Table 1
    Table 1. Steps for archetype identification (further explanations in the text).

    Table 1. Steps for archetype identification (further explanations in the text).

    Step Type Name Description
    CS1 Core, Quantitative Formal Concept Analysis (FCA) - Choose one outcome attribute
    - Run FCA on non-outcome attributes
    - For each concept, compute size, coverage, richness, lift, consistency
    CS2 Core, Quantitative Concept Filter - Choose thresholds, e.g., size ≥ 2, coverage ≥ 1%, m/2 ≥ richness ≥ 2, lift ≥ 100%, consistency ≥ 80%
    - Only retain concepts that respect these thresholds
    CS3 Core, Qualitative Theoretical Analysis - Inspect all remaining concepts by theoretical saliency, coherence with research purpose, and possibly case-based knowledge
    OS1 Optional, Semi-qualitative Grouping - Split cases into different groups based on outcomes or critical attributes of interest
    OS2 Optional, Quantitative Optimal Concept Selection - Compute selections of concepts that maximize joint size
    - Choose N concepts, e.g., with joint coverage ≥ 80% or ΔN = 2
    Click and hold to drag window
    ×
    Download PDF Download icon Download Citation Download icon Submit a Response Arrow-Forward icon
    Share
    • Twitter logo
    • LinkedIn logo
    • Facebook logo
    • Email Icon
    • Link Icon

    Keywords

    Click on a keyword to view more articles on that topic.

    archetype analysis; classification; climate change adaptation; configurational methods; formal concept analysis; land-use change

    Submit a response to this article

    Learn More
    See Issue Table of Contents

    Subscribe for updates

    * indicates required
    • Submit an Article
    • Submission Guidelines
    • Current Issue
    • Journal Policies
    • Find Back Issues
    • Open Access Policy
    • Find Features
    • Contact

    Resilience Alliance is a registered 501 (c)(3) non-profit organization

    Permissions and Copyright Information

    Online and Open Access since 1997

    Ecology and Society is now licensing all its articles under the Creative Commons Attribution 4.0 International License

    Ecology and Society ISSN: 1708-3087