1 Overview

In the first chapter, we will try to distinguish what is (and what is not) research. We will continue with a bird’s-eye view of the whole research process.

1.1 Research and Science

Let us start with a few definitions of the word research:

“a systematic investigation to find answers to a problem”

– Burns (2000)

“creative and systematic work undertaken in order to increase the stock of knowledge”

– OECD (2015)

“a detailed study of a subject, especially in order to discover (new) information or reach a (new) understanding”

– Cambridge Dictionary

Therefore, research is an activity that:

studies something worth studying (relevance),
uses a systematic approach (soundness),
and adds new information to the existing body of knowledge (novelty).

A related term, science, is usually defined more narrowly:

“any system of knowledge that is concerned with the physical world and its phenomena and that entails unbiased observations and systematic experimentation”

– Encyclopedia Britannica

“the systematic study of the structure and behavior of the physical and natural world through observation, experimentation, and the testing of theories against the evidence obtained”

– Oxford Dictionary

We can see that science concerns a natural, externally observable world. Parts of this world are systematically observed or manipulated, and, based on the results, new knowledge is produced. Such an approach is called empirical, from Greek empeiría (experience), where evidence is formed through senses, using measurement and experimentation.

According to the aforementioned definitions, mathematics is not a science since mathematicians do not observe the external world, and they manipulate only abstract entities they devised. Humanities, such as history, literature, or art, are usually not considered sciences either.

1.2 Computer Science

A question now arises: Is computer science a science? According to Denning (2005), yes – as it meets every criterion. We can devise testable hypotheses, observe the external behavior of computers or the people using them, and construct theories based on these observations.

Abelson (1986) thinks the opposite, as he noted in the opening lecture of the Structure and Interpretation of Computer Programs course:

“It might be engineering or it might be art, but we’ll actually see that computer so-called science actually has a lot in common with magic, and we’ll see that in this course. So it’s not a science. It’s also not really very much about computers. And it’s not about computers in the same sense that physics is not really about particle accelerators, and biology is not really about microscopes and Petri dishes.”

Note, however, that this view originates in the era when empirical studies, such as experiments with human participants or thorough evaluations using computer benchmarks, were rarer than today. Many research papers of that time described a new idea that the researcher got, along with some implementation details, neglecting evaluation. As one of the voices for a change, a paper advocating evidence-based software engineering as an analogy to evidence-based medicine (Kitchenham et al., 2004) was published at the beginning of this century.

1.3 Computer Science Subfields

It is difficult to talk about research methods in computer science in general. Computer science is a diverse field, encompassing a large variety of subfields (e.g., embedded systems, cybersecurity, high-performance computing, operating systems, databases, or virtual reality). There have been multiple attempts to divide computer science research into categories, including the Association for Computing Machinery (ACM) Special Interest Groups (SIGs), the ACM Computing Classification System (CCS), and the arXiv category taxonomy (expand the “Computer Science” item). Although the areas certainly overlap to some degree, a majority of researchers specialize in a few narrowly defined topics pertaining to one computer science subfield.

Each subfield has its own typical strategies of advancing knowledge. On one side, there is theoretical computer science. Here, the researcher often starts by defining a theorem, which is split into smaller lemmas. Each lemma is mathematically proved, which finally results in the proof of the main theorem. The research process is thus similar to mathematics as the researchers operate only with abstract notations. However, the proved properties of algorithms, e.g., time complexity, are sometimes empirically verified by executing the program on a computer and measuring the necessary properties.

Human-computer interaction, on the other hand, is people-centric. Traditionally, human participants are observed interacting with a specific hardware or software user interface, and selected quantities are measured.

Some subfields themselves use plenty of different research methods. For example, software engineering papers often describe a new approach designed by the authors, which is subsequently evaluated empirically using simulation, case studies, surveys and interviews, experiments, etc.

Other areas lean toward a commonly used research method. In machine learning, for example, standardized measures such as accuracy or F₁ score are often computed on a popular benchmark for the given problem. This facilitates comparison between multiple approaches but can easily divert attention from problems arising in real-world usage contexts.

The main research area of the author of this text is software engineering, occasionally intersected with human-computer interaction. Despite trying to be impartial, this text may sometimes be a bit biased toward these two fields.

1.4 Interdisciplinarity

Just as the border between computer science subfields is blurry, so is often the border of computer science as a whole. Many research projects span multiple branches of science. The knowledge from multiple fields is combined, and new knowledge, related to one or more of these fields, is produced. This is called an interdisciplinary approach.

Interdisciplinary research can bring new insights to real-world problems that would be otherwise difficult to solve. On the other hand, the problem of research shallowness often particularly affects interdisciplinary research. Consider an application of an off-the-shelf machine learning algorithm to weather prediction. Without an expert in meteorology being a part of the research team, there is a risk of misinterpretation of the results and the neglection of methodological or practical issues. Furthermore, while it may be a research contribution to the field of meteorology, we could hardly consider it a contribution to the field of computer science.

1.5 Research Process Overview

Research is an unpredictable activity to a high degree. Despite this, in Figure 1.1, there is an approximation of the journey a computer science researcher can typically take to finish one research project. Be aware that we could add an arrow from almost any activity to any previous activity.

Figure 1.1: An overview of the research process

At the beginning, we must have an idea of what the project will aim to accomplish and why this is relevant for other researchers or the general public. Then we start gathering and studying relevant literature. Throughout the rest of the research process, we will get a better understanding of the topic, so we should return to this step regularly.

Although some researchers leave the writing of the paper as one of the last steps, it is beneficial to start writing as soon as we have a general idea and a few relevant research papers. Putting your ideas into writing helps you clarify your thinking, find inconsistencies soon, and prevent forgetting. Writing should ideally occur after each of the next steps, until sending the paper for review.

The following steps are variable and differ between projects. It is possible to create a software or hardware artifact based on a new approach we designed and described in the paper, e.g., an implementation of an algorithm, a new editor plugin, or a novel pointing device. We suppose the designed approach (and thus the artifact that implements it) has some properties (it is more efficient, more usable, etc.), based on which we pose research questions or hypotheses. Alternatively, we can skip the artifact creation and do purely empirical research instead, by asking research questions or posing hypotheses about currently existing artifacts, people, or processes.

After carefully planning the research methods corresponding to the research questions, we collect and process the data and answer these questions based on the data.

It is possible that the purpose of selected research questions was to find shortcomings of existing approaches or our newly-designed artifacts. If this is the case, we can continue by designing a better artifact and repeating a part of the process.

After the paper is finished, we send it for peer review to an appropriate journal or conference, where about three experts will assess its suitability for publication. If the paper is not rejected, we incorporate the reviewers’ comments into it, and, depending on the type of the publication, send it again for review and repeat until it is accepted.

The paper is then published in the given journal or conference proceedings. This is usually considered the final step, but we should remember to spread the knowledge further, e.g., summarize it in a blog post or archive it on our personal website if it is not freely accessible from the publisher.

In the following chapters, we will discuss the individual steps of the research process, starting with the formulation of ideas and searching for relevant literature.

Exercises

Suppose you invented a new algorithm to find the shortest path between two vertices in a graph. Is this research? Is it science? Why or why not?
Consider this extreme example: You created a program appending the letter “a” to every 15th occurrence of the letter “z”. The letters and the occurrence number are not configurable, and your algorithm uses hacks relying on these constants. Based on your measurements, the program is faster, more memory efficient, and more user-friendly than any other program doing it. Should this be called research?
According to you, which of the following documents describe research and why?
1. Fatal Abstraction (Steimann, 2018)
2. Business Calendar
3. Fluorescence SpectraViewer
4. Bringing Rich Experiences to Memory-Constrained TV Devices
5. DDD — A Free Graphical Front-End for UNIX Debuggers (Zeller & Lütkehaus, 1996)
6. On the Dichotomy of Debugging Behavior Among Programmers (Beller et al., 2018)
Compare a few papers published at the leading software engineering conference, ICSE, in 1982 (Boehm et al., 1982; Futatsugi & Okada, 1982; Nakamoto et al., 1982) and 2024 (Carvalho et al., 2024; Choudhuri et al., 2024; Huang et al., 2024). How do they differ?
What computer science subfield(s) do you consider your specialty? For inspiration, you can look at the illustrated Map of Computer Science.
Find an example of an interdisciplinary research paper with the experts in two or more domains as co-authors and providing, in your opinion, a valuable contribution to some research field(s). Find another paper where you think the expertise from one field was lacking.
Which parts of the research process do you consider the most difficult and why?

Abelson, H. (1986). Lecture 1A: Overview and introduction to lisp [lecture transcript]. MIT OpenCourseWare 6.001 Structure and Interpretation of Computer Programs. https://ocw.mit.edu/courses/6-001-structure-and-interpretation-of-computer-programs-spring-2005/resources/1a-overview-and-introduction-to-lisp/

Beller, M., Spruit, N., Spinellis, D., & Zaidman, A. (2018). On the dichotomy of debugging behavior among programmers. Proceedings of the 40th International Conference on Software Engineering, 572–583. https://doi.org/10.1145/3180155.3180175

Boehm, B. W., Elwell, J. F., Pyster, A. B., Stuckle, E. D., & Williams, R. D. (1982). The TRW software productivity system. Proceedings of the 6th International Conference on Software Engineering, 148–156. https://dl.acm.org/doi/10.5555/800254.807757

Burns, R. B. (2000). Introduction to research methods (4th ed.). SAGE Publications.

Carvalho, L., Degiovanni, R., Cordy, M., Aguirre, N., Le Traon, Y., & Papadakis, M. (2024). SpecBCFuzz: Fuzzing LTL solvers with boundary conditions. Proceedings of the IEEE/ACM 46th International Conference on Software Engineering. https://doi.org/10.1145/3597503.3639087

Choudhuri, R., Liu, D., Steinmacher, I., Gerosa, M., & Sarma, A. (2024). How far are we? The triumphs and trials of generative AI in learning software engineering. Proceedings of the IEEE/ACM 46th International Conference on Software Engineering. https://doi.org/10.1145/3597503.3639201

Denning, P. J. (2005). Is computer science science? Commun. ACM, 48(4), 27–31. https://doi.org/10.1145/1053291.1053309

Futatsugi, K., & Okada, K. (1982). A hierarchical structuring method for functional software systems. Proceedings of the 6th International Conference on Software Engineering, 393–402. https://dl.acm.org/doi/10.5555/800254.807782

Huang, Y., Wang, J., Liu, Z., Wang, Y., Wang, S., Chen, C., Hu, Y., & Wang, Q. (2024). CrashTranslator: Automatically reproducing mobile application crashes directly from stack trace. Proceedings of the IEEE/ACM 46th International Conference on Software Engineering. https://doi.org/10.1145/3597503.3623298

Kitchenham, B. A., Dyba, T., & Jorgensen, M. (2004). Evidence-based software engineering. Software Engineering, 2004. ICSE 2004. Proceedings. 26th International Conference on, 273–281. https://doi.org/10.1109/ICSE.2004.1317449

Nakamoto, Y., Iwamoto, T., Hori, M., Hagihara, K., & Tokura, N. (1982). An editor for documentation in π-system to support software development and maintenance. Proceedings of the 6th International Conference on Software Engineering, 330–339. https://dl.acm.org/doi/10.5555/800254.807775

OECD. (2015). Frascati manual 2015: Guidelines for collecting and reporting data on research and experimental development (p. 398). OECD Publishing. https://doi.org/10.1787/9789264239012-en

Steimann, F. (2018). Fatal abstraction. Proceedings of the 2018 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software, 125–130. https://doi.org/10.1145/3276954.3276966

Zeller, A., & Lütkehaus, D. (1996). DDD—a free graphical front-end for UNIX debuggers. SIGPLAN Not., 31(1), 22–27. https://doi.org/10.1145/249094.249108