Anarchy Reigns A quantitative analysis of Agent-Based Modelling publication practices in JASSS , 2001-2012

Agent Based Modelling (ABM), a promising scientific toolset, has received criticism from some, in part, due to a claimed lack of scientific rigour, especially in the communication of its methods and results. To test the veracity of these claims, we conduct a structured analysis of over 900 scientific objects (figures, tables, or equations) that arose from 128 ABM papers published in the Journal of Artificial Societies and Social Simulation (JASSS), during the period 2001 to 2012 inclusive. Regrettably, we find considerable evidence in support of the detractors of ABM as a scientific enterprise: elementary plotting attributes are left off more often than not; basic information such as the number of replicates or the basis behind a particular statistic are not included; and few, if any, established methodological communication standards are apparent. In short, 'anarchy reigns'. Whilst the study was confined only to ABM papers of JASSS, we conclude that if the ABM community wishes its approach to be accepted further afield, authors, reviewers, and editors should take the results of our work as a wake-up call.


Introduction
"While the theoretical and experimental foundations of agent-based systems are becoming increasingly well understood, comparatively little effort has been devoted to understanding the pragmatics of (multi-)agent systems development -the everyday reality of carrying out an agent-based development project. As a result, agent system developers are needlessly repeating the same mistakes, with the result that, at best, resources are wasted -at worst, projects fail." "It is widely acknowledged that … agent-based models, can play an important role in fostering understanding of the dynamics of complex systems. … However, current [agent-based] modelling practice has two substantial shortcomings: (1) The reasoning behind the choice of a certain human decision model is often not well documented; insufficient empirical or theoretical foundations are given; or the decision model is only assumed on an ad-hoc basis … (2) Often the model is not described in a transparent manner (clear and complete) that would allow for reproducibility and facilitate the communication of the model and its results."

1.1
The two quotations above concisely describe a tragedy in the storied history of Agent-Based Modelling (ABM). The tragedy being that, whilst describing essentially the same paradox -the promise of ABM approaches in the social sciences juxtaposed against the lack of well developed practices in ABM science -the first quote (Wooldridge & Jennings 1998) predates the second (Müller et al. 2013) by 15 years. [1] 1.2 Of course, during this period -one which has seen a new global financial crisis (GFC), the emergence of several potentially pandemic infectious diseases, and the rise and rise of network-based social media platforms -the champions of ABM methodologies from all relevant fields have re-emphasised, in prominent publishing platforms, the need for, and renewed relevance of, ABM methods: Doyne 2009 GFC to call for ABMs in Economics at Nature; whilst Joshua Epstein, also writing in Nature, pointed to the network complexity inherit in the emerging H1N1 outbreak of 2009 to point out how suited ABMs are to modelling infectious diseases (Epstein 2009). In short, the promise of the methodology seems, during this period, alive and well.

1.3
Meanwhile timely and compelling articles of the generic form "Field X, meet ABM …," were written for, amongst others, the 'human systems' modelling community by Eric Bonabeau in the US flagship, Proceedings of the National Academy of Sciences (Bonabeau 2002), the mind-action socio-economic community by Nigel Gilbert and Pietro Terna (2000) in the (then brand new) Mind and Society journal, and for the sociology community by Michael Macy and Robert Willer in the Annual Review of Sociology (2002), each arriving at the start of this crucial period.

1.4
In terms of results, a few ABM practitioners have found a receptive audience for their insightful works. Creative, unashamedly ABM, papers have found their way into top field (e.g. in the Economic sciences: American Economic Review (albeit, Papers & Proceedings of the AEA), Journal of Economic Dynamics and Control , Journal of Economic Behaviour and Organisation) and generalist (e.g. Science) journals during this period (Geanakoplos et al. 2012;Howitt & Clower 2000;Dosi et al. 2010;Lim et al. 2007).
1.5 However, outside of these few high-points of the ABM social sciences, ABM studies in general have fared rather less well. Leombruni et al.'s ((2005 ) survey of the top 20 Economic and 10 Social sciences journals found only a handful of published studies (7 in the former, 11 in the latter) up to that point in time which used an ABM methodology. Recent publishing practices seem little different.

1.6
Various explanations are given for ABM's difficult reception in field journals. Chief amongst these seem to be a perceived lack of what might be called 'intuitive transparency' relative to the closed-form, deductive, toolsets available. Joshua M Epstein (2006), in his classic contribution to the Handbook of Computational Economics (vol 2), writes (ch 34), "The real reason some mathematical social scientists don't like computational agent-based modelling is not that the approach is empirically weak (in notable areas, it's empirically stronger than the neoclassical approach). It's that it isn't beautiful."

1.7
In a similar direction, Leombruni and Richiardi (2005) consider the two key perceived problems of 'Economists' with ABM studies (lack of generalisability, identification or estimation problems) stand behind the headline comment that ABMs, 'don't prove anything'. (Case in point, Leombruni and Richiardi's paper was published not by a progressive, forward-looking, Economics field journal, but rather by Physica A: statistical mechanics and its applicationsnot, one assumes, a commonly read journal by most economists.) 1.8 Whilst the the 'proof' (or 'beauty') problem is rather over-stated by field editors and reviewers (and is handled very well by the authors just mentioned), a lingering problem remains: the communication of ABM methods and resultsthis is an ABM science practice issue.

1.9
The simple fact is that ABM studies often draw their heritage apart from the deductive mathematical sciences, instead, they build on the sciences of software design and numerical simulation. And here, ABM practitioners in the social sciences seem to have suffered heavily from a lack of well-established communication tools and standards. Richiardi et al. (2006) capture this point well (paragraph 1.5), "Agent-based models have solid methodological foundations. However, the greater freedom they have granted to researchers (in terms of model design) has often degenerated in a sort of anarchy (in terms of design, analysis and presentation)." [emphasis added] 1.10 They go on to elaborate this anarchy as follows (paragraph 1.5), "a) There is no clear classification of the different ways in which agents can exchange and communicate: every model proposes its own interaction structure.
"b) There is not a standard way to treat the artificial data stemming from the simulation runs, in order to provide a description of the dynamics of the system, and many articles seem to ignore the basics of experimental design. Often, the comparison between artificial and real data is overly naïf, and the parameters' values are chosen without proper discussion; and "c) Too often, it is not possible to understand the details of the implementation of an agent-based simulation. This makes replication a difficult, sometimes impossible task, thus violating the basic principle of scientific practice and confining the knowledge generated by agent-based simulations to no more than anecdotal evidence." [enumeration added] 1.11 … Anarchy indeed. Sadly, and bringing us back to the opening quotation from Wooldridge and Jenning's (1998) article, this exact problem was identified (understandably then) for the 'new field' of ABM eight years prior (section 8.2), "In a field as new as agent systems, there are few established standards that a developer can make use of when building the agent specific components of an application." 1.12 Since this time, there have, of course, been real attempts to order the 'anarchy' of ABM development and communication. As early as 2000 the powerful Unified Modeling Language (UML) was articulated to include agents (Odell et al. 2000) and so offered a potential standardisation candidate (at least of one aspect of ABM development), whilst the Overview, Design concepts, and Details (ODD) (Grimm et al. 2006) (Heath et al. 2009) and 2-Dimensional, spatial visualisation ( Kornhauser et al. 2009). Whilst these approaches are all highly relevant and important (and in some focussed areas of the literature appear to be having an impact on the quality of ABM practice, Grimm et al. 2010), we wish to provide the community with a slightly more quantitative perspective on the state of affairs.
1.13 Specifically, in this study, we focus on the third (c) aspect of Richiardi et al.'s (2006) 'anarchy' enumeration -that of the basic communication of methodologies and results via a study's visual short-hand -the tables, figures and equations that it employs to describe its science (what we shall call a paper's 'objects'). We consider that this simple aspect has received little attention in the standardisation efforts to date. Where previous authors have spent their time on the development and design aspects of ABMs, it would appear that the community has largely assumed that the quality of basic components of ABM communication was in good order. We contend that even if a model is described accurately and helpfully, perhaps using the ODD or ODD+D protocol, and even if the model has been well validated, it will still fail as a piece of science if its key methodological and results objects are of poor quality. This distinction matters perhaps moreso in our visually-orientated, short-attention-span, era of scientific publishing. Indeed, the so-called 'mega-journal' PLOS-One, exemplifies this trend, in asking authors to nominate a 'striking image' (for our study: 'object') during submission. They presumably know well the value of such an image for their social media and marketing platform integrations.
1.14 We conduct a structured analysis of over 900 scientific objects (figures, tables, or equations) that arose from 128 ABM papers published in the JASSS, during the period 2001 to 2012 inclusive. By focussing on a single outlet, we reduce any inter-journal variance on publishing and editorial standards and allow authors' own practices to come to the fore.
1.15 The journal JASSS was chosen for the study for two principle reasons: first, it is well-regarded as a general social sciences ABM study 'clearing-house' with an active and engaged readership, so stands well for cross-disciplinary social sciences ABM publishing trends; and second, JASSS, being an open-access, online, HTML-based, journal lends itself to facile multi-year study of that which we propose -papers could be identified, harvested and analysed with ease. We note also that JASSS acts as a crucial 'reverberation board' for the transmission of relevant ideas between the social and physical sciences (Squazzoni & Casnici 2013).
1.16 Finally, the study period 2001-2012 covers this crucial 'early decade' of social sciences ABM science. As mentioned earlier, not only were multiple, key contributions made to introduce ABM methods to various social sciences fields at the start of our study period, but also the period covers almost all of the major contributions to the standardisation program of ABM social science model development and communication which has been ongoing since at least 2000. If these efforts have had any early impacts on ABM communication practices, these should be evident in our study.

2.2
A script was written to download the .html file of each article from the JASSS site and then process the meta-data of the HTML files to obtain the paper's fields (title, authors, keywords, abstract, volume, issue, date of publication). In seven cases, the meta-data format deviated from the JASSS standard, causing the articles to be dropped from the sample (e.g. http://jasss.soc.surrey.ac.uk/11/4/3.html). This produced 480 articles with successfully harvested meta-data.

2.3
Next, a corpus of unique keywords (the 'Subject' meta-data content in the JASSS template, for an example, see Appendix A) (note, keywords can be phrases) was built, taking care of non-material keyword variations such as the presence or absence of a hyphen. In all, 1501 keywords were gathered in this step.

2.4
Finally, an article was enrolled into the primary database if it contained one of the following 'ABM' keywords chosen by the authors after reviewing the unique keyword set, either exactly, or as a keyword root (e.g. 'multiagents', or 'multiagent systems' would match with 'multiagent') (number of matches in parentheses): 'agent based' (151), 'multi agent' (39), 'social simulation' (31), 'individual based' (9), 'multiagent' (8), 'agent simulation' (2), and 'artificial agents' (1). Of the 480 papers, 220 unique papers contained a matching keyword were enrolled in the primary database. The majority of papers (202) had a single ABM keyword match, 16 matched two ABM keywords, and two matched three.

2.5
To assist with replication, a full listing of the resulting 937 objects, their home paper ID, and basic descriptors is given in a commaseparated value file online with this work.

Methods
Defining & Validating the Object Taxonomy 3.1 Since no clear taxonomy of objects exists to our knowledge in the literature, the authors set about building a useful and facile taxonomy to describe the publication practices in each ABM paper. Our methodology has been guided by experience arising from related social sciences taxonomy/encoding exercises, albeit of a textual nature (Hara et al. 2000;Rourke & Anderson 2004). Building the taxonomy organically proceeded in 6 cyclical steps: primary database (Sample A). 2. A draft taxonomy was compiled collaboratively, including hierarchical descriptions. 3. The draft taxonomy was then applied, independently by two authors (SDA, BH-M) to Sample A. 4. The same authors then met to discuss disagreements and imperfections in the draft taxonomy leading to a refined taxonomy. 5. The refined taxonomy was then applied independently by two authors (SDA, BH-M) to a further 20 paper random sub-sample from the primary database (Sample B). 6. A second meeting was then convened between the two authors to validate and further refine the taxonomy leading to the final taxonomy, and clarify its application to the pooled (Samples A + B) 40 paper sample used in previous steps.
3.2 Note, since our focus of analysis is on the decisions of authors as to how they present the methods and results of ABM studies, we skip duplicate object types found in any article. That is, the first instance of a given object is studied, with second, third, and subsequent objects having the same general attributes as the first, not included in the analysis. Typically, subsequent objects of the same kind were presented with identical features (or lack of features) as the first object, presumably since authors create figures, or tables, via 'templates'.
Summary of the Object Taxonomy  Table, or the Granularity of a Results Figure or Table, refer Table 1) were not obvious from the contents of the object, or its caption, the information was sought from the surrounding text. However, in the specific case where the simulation results figures were studied for their quality (see final Results section, 'Quality of ABM results plotting over time' below) a more stringent test was applied requiring the key information we were looking for to be present in the figure itself or figure caption only. We are of the view that a results figure should be, as much as possible, self-contained.
Here Examples of Taxonomic Objects 3.5 Note: in all cases, except where stated otherwise, example objects from papers are provided without explicit attribution. Our intention is not to single out particular authors, but rather to point to general trends in the discipline. Source JASSS article citation information for any example provided can be sought from the corresponding author.

3.6
In Figures 1 to 9 we provide examples of the top three (by incidence in the database) objects used to communicate methodologies in the database.    Table', 'Schematic Diagram', and 'Screen-shot' objects are presented. For the former, we were directed by the nomenclature of the authors, where ' Figure' was used, the ' Figure' taxonomy object classes were employed, whilst ' Table' induced the ' Table' taxonomy classes. 'Schematic Diagram', as in the example (Figure 2), was used to code any figure which conveyed the relationship between multiple aspects of the model (agents, procedures) but did not conform to either a flowdiagram or UML-formalism. We have not sought to classify these figures further, owing to the rich variety of symbolic and relational elements employed by authors.     Table' class objects, being, unsurprisingly, focussed on parameter initialisation, rule definitions, and (numerical simulation) experimental conditions. Figure 7 (a, b and c) provide examples of the prominent equation types used by authors in the methods section.
Training & Encoding the entire Primary Database 3.9 Next, a research assistant, familiar with ABM science, was trained in applying the final taxonomy, before applying it to Sample A. A review and training meeting was held with one of the authors (BH-M) to provide guidance and clarification to the research assistant, before they applied the final taxonomy to Sample B. A final training meeting was held with one of the authors (BH-M) to complete the training and ensure strong coherence with the encoding approach of the authors.
3.10 Finally, the research assistant went on to encode the entire Primary Database, recording decisions with an online form to facilitate data entry. Validation of the research assistant's application of the final taxonomy to the Primary Database included: on-going, ad-hoc, conferring with one of the authors (BH-M); random checking by the same author of the encoding (around 10% of the coded objects in all); and then author checking of any residual 'flagged' objects. A 'flag' was used wherever the research assistant was hesitant for any reason in the correct encoding to use. Finally, a different research assistant reviewed all 'flagged' objects to confirm that the correct taxonomy had been applied.
3.11 During coding, a number of papers were found not to be ABM in nature and were dropped from the database following a simple rule: if the paper did not convey the methods and results of a scientific enquiry using an ABM model, it was dropped. For example, some papers were found to discuss ABM theory or practice (such as Deichsel and Pyka's (2009)

Results
Summary of the data   Table 3 demonstrates the tendency to convey methodological aspects of a study predominantly with the use of a balance of equations, figures and tables, whilst results are seldom communicated (or analysed) in equation form (a point we return to later), and largely find expression in graphical (figure) presentation.

4.2
To analyse publishing practices over time, we study the incidence of papers which have at least one object of a given type. As presented in Tables 2 and 3, it is obvious that some years have a small (< 5) number of papers or given object type. Thus, to draw meaningful conclusions, we aggregate over four, three year, periods: 2001-2003, 2004-2006, 2007-2009, and 2010-2012 (inclusive).
Temporal patterns in methodological presentation

4.3
In Table 4, and visualised in Figure 8, trends in publishing practices for methodological presentation are studied. We use the incidence-rate of an object, defined as the fraction of relevant papers (i.e. those including methodological objects) in the given period in the Study Database which utilised the object at least once, as the summary measure the practices undertaken by authors. We note that percentages do not have to sum within a column, as one paper may exhibit more than one object type.

4.4
For the sake of the analysis, we pool Figure and Table object types which together gives 25 unique methods objects. Further, we focus on only those objects which obtain an incidence-rate of at least 10% in one of the four periods. We find 12 such object types (Table 4) which fit this criterion, covering between 80% and 89% of all relevant papers in that period. Prominent object types included in 'Misc (all others)' include 'Algorithm', '3D model of environment', 'Agent type histogram', and 'Video' (recall, JASSS is wholly published online and multi-media is encouraged, though apparently used sparingly).

4.5
As can be seen in Figure 8, the top object choices: 'Look-up-table', 'Schematic diagram' and 'Screen-shot' are very stable over the study period. What is perhaps surprising is that the more formal, structured, model description tool of UML appears to have all but disappeared as a form of communication over time. Alternatively, and perhaps more encouragingly, raw-code, and pseudo-code (arguably the most informative and detailed description of the mechanics of a model) have retained, or slightly increased, their incidence-rate over time.
Temporal patterns in results presentation  Figure 9. Patterns in ABM results object use over time. Each bubble represents the percentage of papers in the dataset published during the given three year band which included at least one of the objects indicated. Note: sizing of the bubbles is non-linear.

4.6
In Table 5, and visualised in Figure 9, we present a similar analysis for results objects, again, pooling Figure and Table object types. In this analysis, we focus on the basis of the results presentation -whether the results are drawn from simulation results only (i.e. quantities drawn from artificial datasets), empirical results only (I.e., quantities drawn from survey, or measured datasets), theoretical outcomes (i.e. numerical calculations of parametric systems, typically without stochastic sources), or some combination of the above.

4.7
What the data in the table and the figure show is that the predominant practice, by approximately nine to one is the use of simulation data only from which to draw results. That is, authors of identified ABM studies, in JASSS, almost exclusively support their key results (either as tables or figures) with the use of simulation data only. Authors are not, therefore, often found to be presenting (for verification, or comparison) other quantity types along with their simulation data. There is perhaps some evidence of a trend towards comparison of simulation quantities to empirical quantities ('Mixed: empirical -simulation') coupled with a tendency away from theoretical comparison, but the sample is too small to assert these as significant trends.    4.10 In Figures 10, 11 and 12, we provide (anonymous) examples, taken from the Study Database, of a low (score: 1/5), medium (score: 3/5) and high (score: 4/5) quality results figure as assessed by our attribute method. 4.11 In all, 229 result plots, drawn from 106 unique papers, were analysed using the quality metric. These objects all claimed, or appeared, to be simulation-based in nature -the predominant form of results presentation choice. By a clear margin (Table 6), the most prevalent attributes displayed were the XY2 and XY1 attributes with over 91% and 77% of objects exhibiting X and Y scales, and X and Y axes labelling respectively. To aid the reader's understanding, we provide in Figure 13 a rare 'negative' example in which a simulation results object failed the 'XY2' (X and Y scale) attribute. However, perhaps alarmingly, over 40% of objects failed to demonstrate one or more of the other three attributes.  4.12 If one assumes that authors are ultimately responsible for the presentation of result figures, then one can take an average score across all qualifying results objects by paper (author). In Figure 14, we present the percentiles, by aggregate time period, of average paper scores, where each paper is represented by the average of the scores of the results objects within it, coded in our study.
4.13 Again, stressing that we consider the five results figure attributes as basic scientific requirements, the results are alarming, both in terms of the average quality, and the tend over time. For instance, the median paper score in 2001-2003 of 2.50 is only marginally improved upon (3.00) by 2004-2006, and then not again in the subsequent years. This indicates that around 50% of papers present results figures which miss two or more of the basic five attributes we identify. At the top end, a high point paper score of 4.62 is obtained for the 95 th percentile in [2004][2005][2006], but this slides to 4.00 by 2010-2012. 4.14 These data indicate that for whatever reason, the quality of simulation results presented by ABM authors in JASSS over the last decade is generally poor, and the standards are not improving. We discuss potential reasons for these problems below.
Discussion & Conclusions

5.1
This study set out to study an under-reported aspect of ABM science: the communication practices and quality of methodological and results objects in ABM studies over the last decade. As noted in the introduction, whilst other aspects of ABM development and communication have received academic attention and proposals, we are not aware of any such review of the kind we carry out here.

5.2
Against a background and history of early 'anarchy' on the one hand, and attempts at formalisation on the other, we were interested to see if there were any trends towards order or disorder in ABM publishing practices, and in the general quality of these practices for scientific ends.

5.3
Below, we draw together three key conclusions from our work, and provide some tentative reflections on their possible causes.
Conclusion 1 ABM science's methodological 'anarchy' shows no signs of submission to formalism

5.4
We are indebted to Richiardi et al.'s (2006) 'anarchy' descriptor. Looking at the practices revealed in this study, we see no evidence that the state of affairs is changing. Three dominant modes of visual communication were in the ascendency in 2001-2003 (Look-up-tables, 'Schematic' diagrams', andScreen-shots), and the same three sat in the same position a decade later (Figure 8). Whilst look-up-tables and screen-shots could perhaps be left aside from this analysis, as they serve their own, specific, communicative purpose, the use of relational diagrams not fitting any particular formalism (which we call 'schematics') is interesting. Indeed, when one notes that the incidence-rate of UML and Flow-chart diagrams has declined during the decade, we can only conclude that the social sciences community has turned its back on, or perhaps, has never properly engaged with, the use of more formal visual languages for conveying to the reader the core relationships amongst model components and agents.

5.5
A potential explanation for this lack of engagement is offered by Heath et al.'s (2009) 'many fields of study' conclusion to their survey of 297 ABM papers, "ABM is connecting diverse fields. The fields of biology, business, ecology, economics, the military, public policy, social science and traffic, among others, all use ABM. These diverse fields are trying to understand complex systems and are using ABM as a one common tool. … after reviewing the surveyed articles it is clear that each field has developed their own ABM terminology to describe techniques, applications and results, have their own ABM standards and their own ABM philosophies." (paragraph 4.4)

5.6
Whilst the papers surveyed here are all drawn from the 'social sciences' community as they were published by JASSS, a similar driver could be conjectured. One of the powerful hallmarks of JASSS is its wide embrace of agent-based simulation papers from all across the social sciences. However, this diversity will bring with it a diversity of philosophies of knowledge, and a diversity of expertise with quantitative and computational methods. In particular, Wooldridge and Jennings' (1998) 'Agent-oriented Development' pitfall #4.3 comes to mind, "You forget you are developing software" [emphasis retained]. Wooldridge and Jennings write, "Unfortunately, because the process [of developing any agent system] is experimental, it encourages the developer to forget that they are actually developing software. Project plans tend to be pre-occupied with investigating agent architectures, developing cooperation protocols, and improving coordination and coherence of multi-agent activity. Mundane software engineering processes -requirements analysis, specification, design, verification, and testingbecome forgotten. The result of this neglect is a foregone conclusion: the project flounders, not because of agentspecific problems, but because basic software engineering good practice was ignored." [emphasis retained] 5.7 Does the anthropologist, writing their first ever ABM in (say) NetLogo, gripped by the notion that their theoretical hunch can, for the first time, be modelled and visualised in real-time before their eyes, think that they are now a 'software engineer'? (Do they even know what that is?) The answer is obviously 'no'. Our anthropologist gets on with using the ABM tool to support their scientific conclusion. Along the way, however, the development, validation, and communication of their methodology and results will very likely be 'home-spun'. And so, another bespoke daughter of the 'schematic' class is born.
Conclusion 2 The scientific presentation of ABM results needs immediate, remedial, attention

5.8
The second conclusion rests on the alarming patterns uncovered by our simple '5 attributes' quality survey of simulation-based results plots (Table 6, Figure 14). That 60% of these 'results' figures didn't clearly indicate the parameters used to generate them, nor the number of simulations behind them, strikes somewhat of a mortal blow to any hope of successful replication. Or, at the paper-level of observation, that the 75 th percentile average results figure quality score is less than 3.5, during the most recent period (2010-2012) in the Study Database suggests the problem of adequate results presentation practice is wide-spread and continues today.

5.9
At face value, it would seem that either the interpretation or enforcement of JASSS' own author guidelines is at fault as they appear clear on the matter of replication, "Authors are strongly encouraged to include sufficient information to enable readers to replicate reported simulation experiments." 5.10 However, the ongoing text of these guidelines focusses more on the request that full model code be made available through a thirdparty site, rather than the details needed to actually use the model to replicate the results. Indeed, point (4) of JASSS's stated 'Referee guidelines' continues in this 'algorithmic-centric' view, asking referees to comment explicitly on, "If the article describes a simulation model, is there enough detail provided for the relevant output from the model to be replicated by a reader (the description might be in the form of an algorithm, pseudo-code, or access to the simulation program itself)?" 5.11 Whilst the question is 'replication', the apparent sufficient answer, according to JASSS' guidelines, is that authors provide an indication of the algorithm used to generate the results. Of course, if the 'simulation program itself' is provided, then it is possible, with a suitably written code-base (and runfiles) to identify the exact parameters used for every result in the paper. However, anything short of this will not suffice: 'an algorithm', or 'pseudo-code', on its own, will not leap into action and produce the results in the paper.

5.12
Here, there seems a clear and simple opportunity for JASSS (at least) to tighten its guidelines around results, broadening its 'algorithmic-centric' view of replication to include all the information required to replicate the results, including for example, parameters, initialisation settings, random-number stream definitions, and perhaps even hardware architecture. Again, such considerations no doubt arise immediately to the mind of the software engineer, but not, our anthropologist (or economist, or sociologist, or …) colleague (we include ourselves here).
Conclusion 3 Parametric interpretation by estimation of ABM simulation data appears to be missing in action 5.13 Close inspection of Table 3 highlights one further pattern in the practices related to analysing ABM results: the lack of estimation of ABM simulation data. The 'smoking-gun' for this conclusion is the almost complete lack of equation objects within the results sections of the 128 papers reviewed, indeed, just one 'results' equation was identified by our methods. Whilst we readily acknowledge that there are many cases where parametric estimation of ABM outputs is unnecessary to justify scientific conclusions due to the inherent unpredictability or emergent complexity of classes of ABM models, there are many other cases, (for example, where an ABM model is to be compared to empirical economic data) where estimation is necessary. That just one paper appeared to go down this path, would seem that estimation, as an analysis technique, or as a parameter tuning tool, is effectively unknown in the social sciences ABM community.
5.14 To be fair, there is likely cultural and technical reasons at play behind Conclusion 3. Culturally, we refer to the 'many fields of study' argument proposed above in reflection on Conclusion 1. ABM social science demands, at times, a varied and reasonably technical skillset: not only must the social scientist have their own field-specific knowledge and novelty of contribution, as we have seen, they should also ideally be aware of some basic software engineering principles, and here, they would now appear to need further to be familiar with time-series analysis and estimation techniques. Supposing again that JASSS welcomes a higher proportion of 'many fields' authors who are exploring ABM as a tool to illuminate their discipline than other journals, that a large proportion of papers do not go down the path of estimation is understandable.

5.15
However, there is a second, technical reason that could be advanced. Suppose that we are wrong, and that JASSS authors predominantly would like to use estimation techniques to assess the artificial data their models produce and they are well versed in 'standard' approaches, then they would quickly discover that estimation of ABMs can be present some unique challenges. Two features of a large number of ABMs cause trouble -first, the normally large number of parameters, and second, the existence of non-linear dynamics (often the reason for choosing ABM techniques in the first place). Together, these attributes cause enormous difficulty for estimation. There is some 'hope' on the horizon, however: very recently Grazzini andRichiardi (2014, 2015) have begun to provide some credible options for the estimation of ergodic and non-ergodic ABM time-series in the presence of these attributes. Whilst Grazzini's earlier, but still relatively recent (2012) study in JASSS should be highlighted again to the ABM social scientists community as it enables an author to identify whether their ABM data exhibits stationarity and ergodicity in the first place (crucial questions for choice of estimation technique).
5.16 Hence, Conclusion 3 should not be seen as all that surprising, and should be read more as a confirmation of the state of ABM social science in general. However, with the efforts now being expended on this problem (see especially the companion work of Lee et al. 2015), if a similar paucity of equation-based estimation analysis of ABM outputs was discovered over the next decade, different conclusions (and prescriptions) would need to be made.
5.17 Before drawing final conclusions we return to the most obvious limitation of our study: the use only of JASSS ABM papers for our analysis. We acknowledge several problems here. First, selection bias: by confining ourselves to JASSS we have no vision of the quality of ABM studies published elsewhere: papers submitted to JASSS could be of higher or lower quality than the 'field median'; or papers could express an unusual mix of social science, artificial life, and ABM methodologies, impacting the way that they are presented towards a bespoke 'JASSS' style. Whilst such biases (and others like them) are important, they do not, in our opinion, diminish the responsibility of the JASSS towards building up its publication standards, JASSS can (and does) play a critical educational role in the preparation, presentation and prosecution of ABM science, a point we return to below.
5.18 Second, editorial flux: it is possible that during our study period, editorial policies weakened or tightened, or followed some other functional form, perhaps to pursue other (reasonable) motivations such as expanding the 'reach' or 'inclusiveness' of the JASSS community. Indeed, JASSS itself published current JASSS Editor Squazzoni and Casnici's (2013) study which advocated specific editorial policy prescriptions for JASSS to better serve its aims such as keeping a closer watch on JASSS's quantifiable inter-disciplinary impact and potentially targeting specific, un-tapped, domains through special issues. Whilst it is hard to obtain a measure of such policy dynamics, our study's main conclusions encourage adoption of minimum practices, generic to all ABM papers, regardless of field or specific editorial focus, hence, we again would submit that any nuanced editorial movements apply at a layer above the one to which we are studying. In any case, the Editorial oversight of JASSS during this period was as stable as one could hope for any journal: a single (foundational) editor oversaw the first 17 years of the journal's life, generously overlapping with our study period. The editor, as founder, built up the journal from the ground, and has rightly received enormous gratitude from the social simulation community for their contributions to JASSS's success (Elsenbroich & Badham 2015).

5.19
One further limitation is worth making clear. We acknowledge that the overall clarity of a paper's scientific contribution may or may not rest on the formalism of its presentation style. In this work, we have merely tabulated the trends in the ABM publishing community, expressed through JASSS. What we cannot conclude is whether the papers which adopt one or other formalism in the communication of methods and results are actually any clearer than those which do not. The answer to such a question would require an altogether different methodology, presumably involving trained human readers and/or replication attempts of the science that a given paper presents. We see such considerations as a natural extension of this work and would encourage others to imagine creative experimental designs which could identify the 'best' communication formalism of methods and results for ABM works.
5.20 In one sense, our study concludes without a particularly novel contribution: we appear merely to have found new ways to catalogue the anarchy of ABM social sciences publication practices, or alternatively, the lack of uptake of the various proposals to 'order' the realm. On this question, it would seem that the ABM social sciences community has a choice: it can go on permitting the 'anarchy', choosing to apply laissez-faire publication practice standards to the communication of methodologies and results, or, it can attempt to apply order by enforcing one or more standards.

5.21
Affinity or dissatisfaction with 'anarchy' will likely turn on one's stance towards the merits of diversity. On the one hand, proponents of anarchic publishing practices could point to the benefits of enhanced academic freedom, creative expression, and the notion of fitting the authors' personal sense of the 'right' methodological or results object presentation to the scientific claim at hand, with little reference to norms or standards. As mentioned above in Conclusion 1, given the diversity of generating fields (each with their own field-norms) who seek to publish their ABM works in JASSS, there is no need to encourage such diversity, it will arise organically due to each article's provence. Furthermore, who is to say that one author's experimentation or innovation in presenting their ABM method or results will not be self-evidently brilliant, and so receive wider adoption amongst ABM authors via imitation for the benefit of all? There are indeed potential merits to anarchy. That said, let us advance an alternative perspective on the merits of anarchy. If our reading of JASSS is right, that it is a platform of choice for many first-time ABM authors from the social sciences, then JASSS must be seen as far more than simply a 'social sciences simulation publication', JASSS is also a powerful educational tool: the articles in JASSS will implicitly form a corpus of ideas, methods and practices to learn from, extend, and ultimately imitate. This matters because if the ABM methodology is to make progress outside of simulation-specific journals like JASSS and into the 'main-stream' top field journals of our respective disciplines, then JASSS can play an important role in developing (enforcing) the best standards of the ABM discipline 'in house' before authors take their ideas to editors and referees for whom ABM will (still) be an entirely new methodological approach. If, over time, social scientists are made to develop standard approaches, then that standard will be learned and understood by field editors and referees, annulling an easy (rejection) charge that ABM papers lack intelligibility or transparency based on the communication of the methodology or results alone. In summary, there could be strong long-run benefits to finding (and continuously refining) social science ABM's formal voice on the presentation of methods and results.
5.22 But our study has done more than confirm this anarchy, it has identified a basic problem with a vast number of ABM results published in JASSS over a decade: the quality of presented results. This situation cannot continue. It is a matter of fundamental scientific practice and reproducibility -a supposedly cherished feature of ABM science. We stress that we are not suggesting that works studied in our survey do not have good scientific points to make, these works have all passed the peer-review process and as such must convey important scientific findings. However, the ABM community cannot on the one hand grumble about the slow uptake of ABM science in top field journals, whilst on the other, fail at practicing basic scientific hygiene when it comes to presenting its results. Again, viewed through an educational lens, JASSS has a real opportunity, by its instructions to referees, and its submission requirements, to set minimum standards for replication and results presentation.

5.23
We look forward to contributing to the further development and enforcement of best-practice standards, and call on the social sciences ABM community to do the same.