skip to main content
10.1145/3183440.3183475acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
demonstration

Perceval: software project data at your will

Published:27 May 2018Publication History

ABSTRACT

Software development projects, in particular open source ones, heavily rely on the use of tools to support, coordinate and promote development activities. Despite their paramount value, they contribute to fragment the project data, thus challenging practitioners and researchers willing to derive insightful analytics about software projects. In this demo we present Perceval, a loyal helper able to perform automatic and incremental data gathering from almost any tool related with contributing to open source development, among others, source code management, issue tracking systems, mailing lists, forums, and social media. Perceval is an industry strong free software tool that has been widely used in Bitergia, a company devoted to offer commercial software analytics of software projects. It hides the technical complexities related to data acquisition and eases the definition of analytics. A video showcasing the main features of Perceval can be found at https://youtu.be/eH1sYF0Hdc8.

References

  1. Christian Bird, Peter C Rigby, Earl T Barr, David J Hamilton, Daniel M German, and Prem Devanbu. 2009. The promises and perils of mining git. In MSR. 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Casey Casalnuovo, Yagnik Suchak, Baishakhi Ray, and Cindy Rubio-González. 2017. GitcProc: a tool for processing and classifying GitHub commits. In ISSTA. 396--399. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Maëlick Claes, Mika Mäntylä, Miikka Kuutila, and Bram Adams. 2017. Abnormal Working Hours: Effect of Rapid Releases and Implications to Work Content. In MSR '17. IEEE Press, 243--247. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Valerio Cosentino, Javier Luis Cánovas Izquierdo, and Jordi Cabot. 2015. Gitana: a SQL-based Git Repository Inspector. In ER. 329--343.Google ScholarGoogle Scholar
  5. Laura Dabbish, Colleen Stuart, Jason Tsay, and Jim Herbsleb. 2012. Social coding in GitHub: transparency and collaboration in an open software repository. In CSCW. 1277--1286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Premkumar Devanbu, Pallavi Kudigrama, Cindy Rubio-González, and Bogdan Vasilescu. 2017. Timezone and Time-of-day Variance in GitHub Teams: An Empirical Method and Study. In SWAN '17. ACM, 19--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Robert Dyer, Hoan Anh Nguyen, Hridesh Rajan, and Tien N Nguyen. 2013. Boa: A language and infrastructure for analyzing ultra-large-scale software repositories. In MSR. 422--431. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Michael Fischer, Martin Pinzger, and Harald Gall. 2003. Populating a release history database from version control and bug tracking systems. In ICSM 2003. 23--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Clinton Gormley and Zachary Tong. 2015. Elasticsearch: The Definitive Guide: A Distributed Real-Time Search and Analytics Engine. "O'Reilly Media, Inc.". Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Georgios Gousios and Diomidis Spinellis. 2012. GHTorrent: GitHub's data from a firehose. In MSR 2012. 12--21. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Dick Hardt. 2012. The OAuth 2.0 authorization framework. (2012).Google ScholarGoogle Scholar
  12. Hadi Hemmati, Sarah Nadi, Olga Baysal, Oleksii Kononenko, Wei Wang, Reid Holmes, and Michael W Godfrey. 2013. The MSR cookbook: Mining a decade of research. In MSR. 343--352. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Israel Herraiz, Daniel Izquierdo-Cortazar, and Francisco Rivas-Hernández. 2009. Flossmetrics: Free/libre/open source software metrics. In CSMR'09. 281--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Truong Ho-Quang, Regina Hebig, Gregorio Robles, Michel RV Chaudron, and Miguel Angel Fernandez. 2017. Practices and perceptions of UML use in open source projects. In ICSE SEIP. IEEE Press, 203--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. James Howison, Megan Conklin, and Kevin Crowston. 2006. FLOSSmole: A collaborative repository for FLOSS research data and analyses. Intl Journal of Information Technology and Web Engineering 1, 3 (2006), 17--26.Google ScholarGoogle ScholarCross RefCross Ref
  16. Eirini Kalliamvakou, Georgios Gousios, Kelly Blincoe, Leif Singer, Daniel M German, and Daniela Damian. 2016. An in-depth study of the promises and perils of mining GitHub. Empirical Software Engineering 21, 5 (2016), 2035--2071. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Thomas Kluyver, Benjamin Ragan-Kelley, Fernando Pérez, Brian E Granger, Matthias Bussonnier, Jonathan Frederic, Kyle Kelley, Jessica B Hamrick, Jason Grout, Sylvain Corlay, and others. 2016. Jupyter Notebooks-a publishing format for reproducible computational workflows.. In ELPUB. 87--90.Google ScholarGoogle Scholar
  18. Filippo Lanubile, Christof Ebert, Rafael Prikladnicki, and Aurora Vizcaíno. 2010. Collaboration tools for global software engineering. IEEE software 27, 2 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Wes McKinney and others. 2010. Data structures for statistical computing in python. In SciPy, Vol. 445. 51--56.Google ScholarGoogle Scholar
  20. Gregorio Robles, Jesus M González-Barahona, Daniel Izquierdo-Cortazar, and Israel Herraiz. 2009. Tools for the study of the usual data sources found in libre software projects. Intl J of Open Source Software and Processes 1, 1 (2009), 24--45.Google ScholarGoogle ScholarCross RefCross Ref
  21. Gregorio Robles, Truong Ho-Quang, Regina Hebig, Michel RV Chaudron, and Miguel Angel Fernandez. 2017. An extensive dataset of UML models in GitHub. In MSR'17. IEEE Press, 519--522. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Gregorio Robles, Stefan Koch, and Jesús M González-Barahona. 2004. Remote analysis and measurement of libre software systems by means of the CVSAnalY tool. In 2nd Workshop Remote Analysis and Measurement of Softw Systems. 51--56.Google ScholarGoogle ScholarCross RefCross Ref
  23. Margaret-Anne Storey, Christoph Treude, Arie van Deursen, and Li-Te Cheng. 2010. The impact of social media on software engineering practices and tools. In ESE. ACM, 359--364. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. R Core Team. 2000. R language definition. (2000).Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    ICSE '18: Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings
    May 2018
    231 pages
    ISBN:9781450356633
    DOI:10.1145/3183440
    • Conference Chair:
    • Michel Chaudron,
    • General Chair:
    • Ivica Crnkovic,
    • Program Chairs:
    • Marsha Chechik,
    • Mark Harman

    Copyright © 2018 Owner/Author

    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 27 May 2018

    Check for updates

    Qualifiers

    • demonstration

    Acceptance Rates

    Overall Acceptance Rate276of1,856submissions,15%

    Upcoming Conference

    ICSE 2025

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader