How Will Autonomous Vehicles Decide in Case of an Accident? An Interval Type-2 Fuzzy Best–Worst Method for Weighting the Criteria from Moral Values Point of View

: The number of studies on Autonomous Vehicle (AV) ethics discussing decision-making algorithms has increased rapidly, especially since 2017. Many of these studies handle AV ethics through the eye of the trolley problem regarding various moral values, regulations, and matters of law. However, the literature of this ﬁeld lacks an approach to weighting and prioritizing necessary parameters that need to be considered while making a moral decision to provide insights about AVs’ decision-making algorithms and related legislations as far as we know. This paper bridges the gap in the literature and prioritizes some main criteria indicated by the literature by employing the best–worst method in interval type-2 fuzzy sets based on the evaluations of ﬁve experts from different disciplines of philosophy, philosophy of law, and transportation. The criteria included in the weighting were selected according to expert opinions and to the qualitative analysis carried out by coding past studies. The weighing process includes a comparison of four different approaches to the best–worst method. The paper’s ﬁndings reveal that social status is the most important criterion, while gender is the least important one. This paper is expected to provide valuable practical insights for Autonomous Vehicle (AV) software developers in addition to its theoretical contribution.


Introduction
Throughout history, technological advances have had a major impact on modern society. Groundbreaking innovations in areas that directly affect the personal life of everyone, such as transportation, have completely changed the cities we live in and how we live. Automobiles, since their invention in the early 20th century, have become the most popular mode of transport, thus dramatically changing our lives. The results of the invention of automobiles included increased mobility for individuals, new jobs arising to supply the demand, and legislative requirements (e.g., safety standards, highway rules, and driver's licenses). The next big mobility-related innovation, which is assumed to have a similar impact, is AV technology. AVs are expected to ease everyday life with their promising features and benefits (e.g., self-driving, improved mobility for non-drivers, car/ride sharing, improved safety, reduced congestion, etc.).
As the number of people who benefit from the changes they bring increases, innovations have to go through a shorter period of time before they are publicly accepted (e.g., mobile/smartphones). End users of these enhancements change their habits for individual reasons, but the massive transitions bring out critical environmental, economic, and ethical externalities. As evidenced by all previous similar examples, the upcoming impacts of such a change in our transportation habits are undeniable. Therefore, it has become necessary to investigate these impacts, foresee the outcomes, and study the required changes to minimize the disadvantages of the inevitable transition. prioritization religiously. Subsequent detailed research has revealed that relevant questions can be answered from a utilitarian ethical perspective.
The goal of this study is to determine the necessary parameters that need to be considered while making a moral decision and weigh them to provide insights into AVs' decision-making algorithms and the related legislation. In this context, the best-worst method, which is a relatively new multiple-decision-making method and suitable for the structure of this problem, has been decided. This is a multi-criteria decision-making (MCMD) method developed by Rezaei [16] and applied to many different engineering, social, and natural science problems. It is a method that works on the principle of pairwise comparison and is fed by the pairwise evaluations of experts. It is superior to other pairwise comparison methods, such as Analytic Hierarchy Process (AHP), in terms of both reducing the number of pairwise comparisons and providing more consistent matrices. Since it better reflects the structure of the uncertainty environment, can be used in the scarcity of precise data, and is an important extension of the fuzzy set theory, interval type-2 fuzzy BWM has been used for this problem. This use is the first in solving such an extreme problem as how the moral behavior of such transportation-based and autonomous vehicles will be formed.
The remainder of this paper is organized as follows: the knowledge gap that this study aims to fill is discussed in the next section under literature review; methodology is explained in Section 3; the application of the method to the subject is covered in Section 4; and in the final section a conclusion and the discussion of this paper is given.

Literature Review
This section includes the determination of the knowledge gap and the review of related past studies.

Knowledge Gap
AVs are expected to shape future transportation in many ways. With the appearance of AVs on the roads, certain decisions that may ethically compel people should be made in the face of traffic problems such as accidents. Such decisions will be taken in line with the directions of the software design of AVs. Therefore, ethical rules and principles that are entered into the software should be determined meticulously from the very beginning ensuring consensus among society and the best decision can be made. To find past studies and a gap in the literature, an advanced search query (TITLE-ABS-KEY ("religion" OR "ethics" OR "legislation") AND TITLE-ABS-KEY ("autonomous vehicles")) was conducted in the Scopus database, and 296 publications were identified. Then, the document, including these publications, was uploaded to the VOSviewer tool, and author keyword analysis, which is used for identifying the most used author keywords and knowledge gaps in the literature, was performed. The author keyword analysis results are presented in Figure 1. The size of the circles in Figure 1 reflects the frequency of the keywords (the bigger circles indicate the more frequently used keywords). After that, we researched all the keywords in detail and recognized that ethical dilemmas about whether to sacrifice one person to save more and the role of demographic features in such decisions have been discussed for a long time in the context of the trolley problem, which is shown in the cluster colored red.
Then, to identify a research gap in the research area, the "trolley problem" and "trolley dilemma" keywords were included in the first search query, and a new search was conducted. After the second query, we identified 29 publications. Six irrelevant publications were eliminated. Then, we analyzed the remaining 23 publications in detail and clarified that none of the past studies have focused on weighting necessary parameters to create a formulation of prioritizing the criteria for AV ethics, thus providing insights about AVs' decision-making algorithms and the related legislation. Then, to identify a research gap in the research area, the "trolley problem" and "trolley dilemma" keywords were included in the first search query, and a new search was conducted. After the second query, we identified 29 publications. Six irrelevant publications were eliminated. Then, we analyzed the remaining 23 publications in detail and clarified that none of the past studies have focused on weighting necessary parameters to create a formulation of prioritizing the criteria for AV ethics, thus providing insights about AVs' decision-making algorithms and the related legislation.

Past Studies on AV Ethics
Since there have not been any studies focusing on weighing moral decision parameters, the literature has covered a wide range. AV ethics have been intensively studied through the eye of the trolley problem since 2017. The first group of studies on this subject discusses the trolley problem concerning its nature [17], how it should be handled [18], and identifying awareness about it [19]. The second group of studies in the literature reveals the limitations of the trolley problem in respect of its voting-based structure [20,21], not embracing infinite responsibility to others [22], the self-sacrifice option, and cultural differences [23]. The third group consists of publications focusing on the ma ers of law encompassing the liability [24] and responsibility [25] of AV developers, and AV actions' compliance with the law [26,27].
The remaining part of the literature includes publications on various topics: changeable moral judgments based on the accident severity and probability [28], survival probability estimation [29], risk assessment [30], general social welfare function [31], changeable degree of pragmatism in ethical decision making [32], AV technologies' necessary capabilities during a collision [33], religion-based AV ethics [34], comparing of Kantian rationale and utilitarian ethic [35], and a wider range of criteria than the criteria handled by trolley problem [36,37]. Table 1 shows detailed explanations of the publications in the literature.

Past Studies on AV Ethics
Since there have not been any studies focusing on weighing moral decision parameters, the literature has covered a wide range. AV ethics have been intensively studied through the eye of the trolley problem since 2017. The first group of studies on this subject discusses the trolley problem concerning its nature [17], how it should be handled [18], and identifying awareness about it [19]. The second group of studies in the literature reveals the limitations of the trolley problem in respect of its voting-based structure [20,21], not embracing infinite responsibility to others [22], the self-sacrifice option, and cultural differences [23]. The third group consists of publications focusing on the matters of law encompassing the liability [24] and responsibility [25] of AV developers, and AV actions' compliance with the law [26,27].
The remaining part of the literature includes publications on various topics: changeable moral judgments based on the accident severity and probability [28], survival probability estimation [29], risk assessment [30], general social welfare function [31], changeable degree of pragmatism in ethical decision making [32], AV technologies' necessary capabilities during a collision [33], religion-based AV ethics [34], comparing of Kantian rationale and utilitarian ethic [35], and a wider range of criteria than the criteria handled by trolley problem [36,37]. Table 1 shows detailed explanations of the publications in the literature.

Past Studies on IT2F-BWM
F-BWM was applied to measure the risk factors of FMEA by Tian et al. [40]. For healthcare, Mou et al. [41] developed the intuitionistic fuzzy multiplicative BWM. Omrani et al. [42] used fuzzy BWM to find the optimum power plant alternative. Wu et al. [38] used centroids to develop an integrated model of IT2Fs and BWM and worked on green supplier selection problems. Mi et al. [43] presented a comprehensive survey and a detailed review of BWM, which can be studied by intrigued scholars as a source of inspiration for future BWM research.

Applied Methodology
The applied methodology through the study consists of three stages. In the first stage, frequency analysis was carried out in a qualitative way from the studies in the literature to determine the criteria that AVs will have to consider when deciding. The output of this stage is the final determination of the criteria set that will be considered and whose importance weights will be calculated. In the second stage, the importance weights of the criteria determined in the first step are calculated with an interval type-2 fuzzy BWM. In the last stage, the results obtained with IT2F-BWM were compared with the results obtained from solving the problem with different BWM versions (traditional BWM, fuzzy BWM, and Bayesian BWM). Figure 2 shows the flow chart of this applied methodology.

Frequency Analysis of the Criteria via MAXQDA
The first step of the applied methodology is a qualitative frequency analysis of the criteria via MAXQDA. It enables researchers to employ several different analyses, such as interview transcription and analysis, literature review and analysis, mixed methods, content analysis, and questionnaire analysis. The current study aims to reveal the criteria that AVs will consider in the decision-making process by extracting a number of studies on the subject in the literature.

IT2F-BWM
Best-Worst Method (BWM) was developed by Rezaei [16], utilizing pairwise comparisons of alternatives and criteria. Since it requires fewer data and results in more reliability, the best-worst criterion is widely used as two vectors [44]. For various application areas, a fuzzy version of the BWM is introduced. Hafezalkotob and Hafezalkotob [45], Guo and Zhao [46], and Moslem et al. [47] proposed an application of the triangular fuzzy BWM. In this section, the stages of the IT2F-BWM utilizing the center-of-area approach will be presented.
Step 1. A set of decision criteria ( , , … , ), which is used to calculate the importance weights, is determined.

Frequency Analysis of the Criteria via MAXQDA
The first step of the applied methodology is a qualitative frequency analysis of the criteria via MAXQDA. It enables researchers to employ several different analyses, such as interview transcription and analysis, literature review and analysis, mixed methods, content analysis, and questionnaire analysis. The current study aims to reveal the criteria that AVs will consider in the decision-making process by extracting a number of studies on the subject in the literature.

IT2F-BWM
Best-Worst Method (BWM) was developed by Rezaei [16], utilizing pairwise comparisons of alternatives and criteria. Since it requires fewer data and results in more reliability, the best-worst criterion is widely used as two vectors [44]. For various application areas, a fuzzy version of the BWM is introduced. Hafezalkotob and Hafezalkotob [45], Guo and Zhao [46], and Moslem et al. [47] proposed an application of the triangular fuzzy BWM. In this section, the stages of the IT2F-BWM utilizing the center-of-area approach will be presented. Step 1. A set of decision criteria (c 1 , c 1 , . . . , c n ), which is used to calculate the importance weights, is determined.
Step 2. The decision maker's preference is used to determine the best and the worst criterion.
Step 3. IT2FSs are used to establish the preference for the best criterion and the worst criterion over all the other criteria. The first outcome, the best-to-others (BtO) vector, would be where e Bj demonstrates the preference of the best criterion B over criterion j. It is definite that e BB = ((1; 1; 1; 1; 1; 1), (1; 1; 1; 1; 0.9; 0.9)) In the second outcome of this step, others-to-worst (OtW), the vector would be where a jW demonstrates the preference of the criterion j over the worst criterion W. Again, it is clear that e WW = ((1; 1; 1; 1; 1; 1), (1; 1; 1; 1; 0.9; 0.9)).
Step 4. The optimal weights ( w * 1 , w * 2 , . . . , w * n are determined. The center of the area is used in this step. The frame of the constrained optimization model follows the model presented by Wu et al. [38]. For each pair of w B / w j and w j / w W , the optimal weight for the criteria is the one where the maximum absolute differences w B / w j − E Bj and w j / w W − E jW are minimized. The consistency ratio is calculated using the consistency  Table 3. While creating this, the same method of Rezaei [16] and Wu et al. [38] is utilized. The higher the consistency ratio, the larger the δ*. where, To eliminate w B / w j − E Bj and w j / w W − E jW , the maximum absolute values w B / w j − E Bj and w j / w W − E jW are targeted to minimize. To minimize the absolute gap, the model is then translated to nonlinear programming as δ* = ((δ*; δ*; δ*; δ*; 1; 1), (δ*; δ*; δ*; δ*; 0.9; 0.9)). minδ * s.t.

Comparative Analysis
An indispensable element of MCDM-based studies is testing the validity of the results of the applied approach. It is necessary to compare the applied method with a set of equivalent methods. In this context, criteria weights obtained with IT2F-BWM are compared to show the similarity of the results obtained with other versions and to show that this method is applicable to this problem (as it produces results close to other versions). Although the problem addressed in this article is quite original, determining the closeness of the outputs of IT2F-BWM with other BWM versions will strengthen the reliability of the study.

Application
Autonomous vehicles have to make decisions to protect other alternatives by sacrificing one of the alternatives in the event of an accident. In the context of accidents involving conventional vehicles, this problem has been dealt with by many studies in terms of ethics. MAXQDA, one of the qualitative analysis programs, was used to determine the criteria included in these studies. For this purpose, firstly, after the studies in the literature were loaded into the program, all the studies were subjected to content analysis, and the most common criteria were determined by coding the texts in the studies. Then, the frequency values of the criteria were determined by selecting the documents column from the "Code Matrix Scanner" in the "Visual Tools" menu in the MAXQDA program. The frequency values of the criteria and the studies in which they are included are shown in Figure 3. "Total" in the lower-left corner of the figure shows how often the relevant criteria are included in the total in each paper, while "Total" in the higher-right corner indicates how often all papers contain each criterion. According to the analysis results, age is the most frequently used criterion, being used 35 times. Age is followed by gender, number of persons affected by the accident, social status, compliance of the person to be affected by the accident, and health condition. Criminal history is the least frequently used criterion, being used three times.  In addition, visual maps were created via the MAXQDA program, as seen in Figure  4, in order to more clearly show in which context the determined codes were handled in the past 21 studies shown in Figure 3. According to a couple of quotations shared in the visual maps, some codes of law and regulations do not permit the prioritization of personal a ributes related to criteria such as age, gender, and health condition. However, many researchers and practitioners, especially those studying the trolley problem and moral machine experiments, criticize this prohibitive approach and argue that prioritization can be a necessity. In addition, visual maps were created via the MAXQDA program, as seen in Figure 4, in order to more clearly show in which context the determined codes were handled in the past 21 studies shown in Figure 3. According to a couple of quotations shared in the visual maps, some codes of law and regulations do not permit the prioritization of personal attributes related to criteria such as age, gender, and health condition. However, many researchers and practitioners, especially those studying the trolley problem and moral machine experiments, criticize this prohibitive approach and argue that prioritization can be a necessity. 4, in order to more clearly show in which context the determined codes were handled in the past 21 studies shown in Figure 3. According to a couple of quotations shared in the visual maps, some codes of law and regulations do not permit the prioritization of personal a ributes related to criteria such as age, gender, and health condition. However, many researchers and practitioners, especially those studying the trolley problem and moral machine experiments, criticize this prohibitive approach and argue that prioritization can be a necessity.  The next step in the study, which started with the goal of calculating the weights of the criteria that AVs will have to take into account when deciding, was the determination of the criteria. After a thorough literature research was performed and expert opinions were collected, seven significant criteria were chosen:  The next step in the study, which started with the goal of calculating the weights of the criteria that AVs will have to take into account when deciding, was the determination of the criteria. After a thorough literature research was performed and expert opinions were collected, seven significant criteria were chosen: The moral machine experiment developed at MIT, the results of which were evaluated by Awad et al. [3] and Noothigattu et al. [4], included all the criteria listed above in their questionnaire presented online to participants from all over the world. For instance, the scenarios of the experiment include animations that describe health professionals, children, and elderly people, which are related to social status and age criteria, respectively. Dubljević [7] discussed that the number of persons affected by the accident criteria has the potential to deflect the AV decision-making algorithm in the wrong direction when a large group of nefarious people uses an AV. The discussion demonstrates the necessity of the criminal history criteria.
The second step of the weighting process is selecting the proper experts to evaluate these criteria. Since morality is a collective output of morality and ethics and the discussed dilemmatic scenarios have the potential to bring judicial cases about the usage of AVs, the participants of the evaluation process should specialize in morality, law, philosophy, transportation, or a combination of these subjects. The remaining steps of the study are carried out after selecting five professionals whose expertise is in the related subjects of philosophy, philosophy of law, and transportation. In many MCDM-based studies in the literature, the participation of experts, who are competent, equipped, and experienced in their field, is less than in other statistical analysis-based studies, since in those statisticalbased studies, the power of the study is measured by the questionnaire and the number of participants in the survey. However, the same criterion is not valid for MCDM-based studies, such as our study. Our study applied the interval type 2 fuzzy set version of an MCDM method, which is a solid mathematical optimization model and was first proposed by Rezaei [16], to a social issue that has never been addressed in the literature before. In this respect, by combining this method with interval type-2 fuzzy numbers, it will be possible to weigh the criteria that autonomous vehicles will consider in the decision-making process, and in this way, in the development of software related to this technology, which will come to a more advanced stage in the future, these vehicles will make more rational decisions regarding moral decisions. In other words, the originality of this study lies in the fact that a problem that has not been mentioned in the literature so far will be solved with an advanced version of a model that is accepted and preferred by researchers over its equivalents, such as AHP, FUCOM, etc. (IT2F-BWM).
Then, the optimal interval type-2 fuzzy weights of seven criteria are calculated. The obtained importance weights are as follows: The same calculation is processed for each criterion and five experts. The obtained outcomes of the IT2F-BWM are presented in Table 6. Table 6. The obtained outcomes of the IT2F-BWM. As the final step of the IT2F-BWM, each criterion's pin-sharp importance weights are calculated. The final weights for each expert and the average values are presented in Figure 5. For instance, the importance weights of the seven criteria for Expert 1 are 0.435, 0.145, 0.145, 0.073, 0.036, 0.083, and 0.083, respectively. The most and least significant criterion is determined as social status and gender according to the evaluation of Expert 1, respectively. The calculated weights showed that gender is the least significant criterion for four of the five experts. Additionally, the answers of three of the five experts resulted in compliance of the person to be affected by the accident being the most significant criterion for automated decision making in the possible event of a crash in an AV.
When the average of the weight values obtained from these five experts was taken, the following conclusion emerged. "Social Status" was determined as the most important criterion with a weight value of 0.272. It is followed by the "Compliance of the Person to be Affected by the Accident" criterion with 0.256. The third most important criterion was "Number of Persons Affected by the Accident", with 0.158. "Health Condition", "Age", and "Criminal History" have similar weight values and come after these three criteria in the ranking. The least important criterion is "Gender", with a weight value of 0.052. "Number of Persons Affected by the Accident", with 0.158. "Health Condition", "Age" and "Criminal History" have similar weight values and come after these three criteria i the ranking. The least important criterion is "Gender", with a weight value of 0.052.

Comparison of the IT2F-BWM with Other Best-Worst Methods
The discussed problem in this study, of the importance levels of the criteria that AV will have to take into account when deciding and how decision makers and software de velopers will transfer these criteria to their products with a prioritization, is solved wit an interval type 2 fuzzy-based BWM methodology. However, there is a tendency that it i useful to compare the results by solving the same problem with different BWM options so that the generalization of the results obtained alone is on solid ground. In this contex the problem is solved with three different versions of BWM (traditional BWM, triangula fuzzy number-based BWM, and Bayesian BWM). For all three methods, the question naires applied to the decision-making expert group were renewed depending on the lin guistic scale used, and the questionnaires were re-applied in a reasonable period. Optima values of criteria importance levels were obtained by following the application steps b taking the survey results for all three methods. Traditional BWM is based on Rezaei [16 while fuzzy BWM is employed by Guo and Zhao [46]. Steps in Mohammadi and Rezae [68] have also been followed for Bayesian BWM.
In traditional and fuzzy BWM, models of all decision makers are solved separately and their average values are aggregated. Figures 6 and 7 provide the weights of criteri for each expert. Here, A1 to A7 refers to the criteria descriptions as identified "social sta tus, number of persons affected by the accident, compliance of the person to be affecte by the accident, age, gender, health condition, and criminal history".

Comparison of the IT2F-BWM with Other Best-Worst Methods
The discussed problem in this study, of the importance levels of the criteria that AVs will have to take into account when deciding and how decision makers and software developers will transfer these criteria to their products with a prioritization, is solved with an interval type 2 fuzzy-based BWM methodology. However, there is a tendency that it is useful to compare the results by solving the same problem with different BWM options, so that the generalization of the results obtained alone is on solid ground. In this context, the problem is solved with three different versions of BWM (traditional BWM, triangular fuzzy number-based BWM, and Bayesian BWM). For all three methods, the questionnaires applied to the decision-making expert group were renewed depending on the linguistic scale used, and the questionnaires were re-applied in a reasonable period. Optimal values of criteria importance levels were obtained by following the application steps by taking the survey results for all three methods. Traditional BWM is based on Rezaei [16], while fuzzy BWM is employed by Guo and Zhao [46]. Steps in Mohammadi and Rezaei [68] have also been followed for Bayesian BWM.
In traditional and fuzzy BWM, models of all decision makers are solved separately, and their average values are aggregated. Figures 6 and 7 provide the weights of criteria for each expert. Here, A1 to A7 refers to the criteria descriptions as identified "social status, number of persons affected by the accident, compliance of the person to be affected by the accident, age, gender, health condition, and criminal history". This is unlikely for Bayesian BWM, as this method is already a probabilistic version developed for group decision making and reducing information loss [68]. Bayesian BWM also has a feature called credal ranking which is a graph showing the reliability of the criteria. With the aid of this feature, the importance levels of criteria are determined more sensitively. As can be seen in Figure 8, each point (A1 to A7) indicates the criteria in this problem, while the value written on the arrow between these points indicates the confidence level. A1 cl → A2 indicates that criterion A1 is more significant than A2, along with a confidence level of cl. Therefore, when Figure 8 is examined, it can be easily read at which confidence level each criterion outperforms the other. These graphs also support values in Figure 9 interpreted regarding Bayesian BWM. According to Figure 8, the arrow goes from the A1 criterion to all the other criteria. This means that this criterion is superior to the other five criteria. The confidence level indicating superiority ranges from 0.54 to 1. This indicates that this criterion is the most important. Similarly, no arrows go from the least important criterion, A5, to other nodes. Another important detail is that the confidence level value on the arrow from A1 to A3 has approached almost 0.50 (F 0.54 → B). This value shows that although the importance weights of the two criteria are different, they do not have an obvious advantage over each other. Similar interpretations can be produced for the other six criteria. Credal ranking results facilitate strengthening the produced weight values and the more precise interpretation of obvious superiority/non-superiority situations.  This is unlikely for Bayesian BWM, as this method is already a probabilistic versio developed for group decision making and reducing information loss [68]. Bayesian BW also has a feature called credal ranking which is a graph showing the reliability of th criteria. With the aid of this feature, the importance levels of criteria are determined mo sensitively. As can be seen in Figure 8, each point (A1 to A7) indicates the criteria in th problem, while the value wri en on the arrow between these points indicates the con dence level. 1 → 2 indicates that criterion A1 is more significant than A2, along with  This is unlikely for Bayesian BWM, as this method is already a probabilistic ver developed for group decision making and reducing information loss [68]. Bayesian B also has a feature called credal ranking which is a graph showing the reliability o criteria. With the aid of this feature, the importance levels of criteria are determined m sensitively. As can be seen in Figure 8, each point (A1 to A7) indicates the criteria in ). value shows that although the importance weights of the two criteria are different, do not have an obvious advantage over each other. Similar interpretations can be duced for the other six criteria. Credal ranking results facilitate strengthening the duced weight values and the more precise interpretation of obvious superiority/non periority situations.   value shows that although the importance weights of the two criteria are different, they do not have an obvious advantage over each other. Similar interpretations can be produced for the other six criteria. Credal ranking results facilitate strengthening the produced weight values and the more precise interpretation of obvious superiority/non-superiority situations.    Figure 9 shows the criteria weight results of the comparative study. According to the numerical results obtained, it is seen that the order of priority, which we evaluate depending on the importance weight of the criteria, has not changed in all methods. Although the values of the criteria weights changed, the order of importance was obtained as A1 > A3 > A2 > A6 > A4 > A7 > A8. The correlation between criterion weights is also quite high. Table 7 shows the Pearson correlation coefficients of the criteria weight values obtained from the methods. It is observed that all correlation values are higher than 94.5%. When all the results are evaluated together, it is clearly seen that the current model, like other BWM versions, gives very accurate results for this problem.

Conclusions and Discussion
AI has been gaining attention as the primary decision maker for many driving tasks of AVs. AI needs a formulation based on mathematical algorithms to rationalize the decisionmaking process of AVs. The mathematics behind the algorithm should mainly focus on how to prioritize possible victims of the inevitable accident. The AV literature lacks any approach, to our knowledge, to determine the above parameters and weigh them to make decisions that are as moral as possible.
This paper aims to determine the necessary parameters that need to be considered while making a moral decision and weigh them to create a formulation of prioritizing, thus providing insights about AVs' decision-making algorithms and related legislations using BWM.
The study's findings demonstrate that in the case of a possible collision involving an AV, the most critical determinant of an ethical decision should be "social status". If a person's social status is considered an important indicator of the value he or she will add to society, it is possible to accept that the experts show a propensity for evaluations promoting the common good of society. On the other hand, the paper indicates that the least critical determinant of an ethical decision should be "gender".
The applicability of some criteria in the current technological framework is debatable. For example, for now, it may not be possible for an AV to detect the criminal history of a pedestrian, operate the decision mechanism in seconds and decide in light of the priorities determined in this study. Additionally, an algorithm that covers the criteria identified by the study may harm the privacy of the pedestrian's private life. Despite these current technical impossibilities and concerns, this paper is expected to shed light on the future of the AV research and development process. This paper weighs the necessary parameters based on the evaluations of the experts. Our experts evaluated the criteria through the eyes of utilitarian ethics. In contrast, other experts from Islamic and Christianity philosophies avoided answering survey questions by reminding us that killing a person is strictly forbidden. Considering that religions set inclusive and detailed rules about how people should live and that AVs will be common on roads in the near future, contemporary philosophers of religion and religious scholars should discuss the possibility of finding new answers to the current issue. This paper has specific limitations regarding the number of experts and criteria and the inevitably subjective nature of the surveys. Therefore, the results of the study cannot be generalized to all cases. Accordingly, a larger number of experts and criteria could be included in further studies. In addition, future studies can focus on weighing different categories or levels of each criterion within itself for AV decision making through more objective methods. On the other hand, due to the lack of exact data about weighting parameters when making moral decisions for autonomous vehicles, this paper benefits from expert opinions based on fuzzy sets. The method applied in this paper handles the process of weighting parameters more sensitively than the traditional fuzzy methods by enabling membership degrees to have fuzzy values. In addition, IT2F-BWM results combine the advantages of IT2F and BWM. This paper is expected to provide valuable practical insights for AV software developers in addition to its theoretical contribution.