**
Masanari Kondo **
Kyoto Institute of Technology, Kyoto, Japan.

**
Osamu Mizuno **
Kyoto Institute of Technology, Kyoto, Japan.

**
Eun-Hye Choi **
National Institute of Advanced Industrial Science and Technology (AIST), Ikeda, Osaka, Japan.

DOI https://dx.doi.org/10.33889/IJMEMS.2018.3.2-008

**Abstract**

Software effort estimation is a critical task for successful software development, which is necessary for appropriately managing software task assignment and schedule and consequently producing high quality software. Function Point (FP) metrics are commonly used for software effort estimation. To build a good effort estimation model, independent explanatory variables corresponding to FP metrics are required to avoid a multicollinearity problem. For this reason, previous studies have tackled analyzing correlation relationships between FP metrics. However, previous results on the relationships have some inconsistencies. To obtain evidences for such inconsistent results and achieve more effective effort estimation, we propose a novel analysis, which investigates causal-effect relationships between FP metrics and effort. We use an advanced linear non-Gaussian acyclic model called BayesLiNGAM for our causal-effect analysis, and compare the correlation relationships with the causal-effect relationships between FP metrics. In this paper, we report several new findings including the most effective FP metric for effort estimation investigated by our analysis using two datasets.

**Keywords-** Software effort estimation, Function point (FP) metrics, Causal-effect analysis, Correlation analysis, Linear non-Gaussian acyclic model (LiNGAM), BayesLiNGAM.

**Citation**

Kondo, M., Mizuno, O., & Choi, E. (2018). Causal-Effect Analysis using Bayesian LiNGAM Comparing with Correlation Analysis in Function Point Metrics and Effort. *International Journal of Mathematical, Engineering and Management Sciences*, *3*(2), 90-112. https://dx.doi.org/10.33889/IJMEMS.2018.3.2-008.

**Conflict of Interest**

**Acknowledgements**

**References**

Abran, A., Silva, I., & Primera, L. (2002). Field studies using functional size measurement in building estimation models for software maintenance. Journal of Software Maintenance and Evolution: Research and Practice, 14(1), 31–64.

Ahn, Y., Suh, J., Kim, S., & Kim, H. (2003). The software maintenance project effort estimation model based on function points. Journal of Software Maintenance and Evolution: Research and Practice, 15(2), 71–85.

Albrecht, A. J., & Gaffney, J. E. (1983). Software function, source lines of code, and development effort prediction: a software science validation. IEEE Transactions on Software Engineering, SE-9, (6), 639–648.

Bettenburg, N., Nagappan, M., & Hassan, A. E. (2012). Think locally, act globally: Improving defect and effort prediction models. In Proceedings of the 9th IEEE Working Conference on Mining Software Repositories (MSR) (pp. 60–69).

Briand, L. C., El Emam, K., Surmann, D., Wieczorek, I., & Maxwell, K. D. (1999). An assessment and comparison of common software cost estimation modeling techniques. In Proceedings of the 1999 International Conference on Software Engineering (ICSE) (pp. 313–323).

Efron, B. (1992). Bootstrap methods: another look at the jackknife. In Breakthroughs in Statistics (pp. 569–593). Springer.

Farrar, D. E., & Glauber, R. R. (1967). Multicollinearity in regression analysis: the problem revisited. The Review of Economics and Statistics, 49(1), 92-107.

Green, M. J., Leyland, A. H., Sweeting, H., & Benzeval, M. (2017). Causal effects of transitions to adult roles on early adult smoking and drinking: Evidence from three cohorts. Social Science & Medicine, 187, 193-202

Heinze-Deml, C., Maathuis, M. H., & Meinshausen, N. (2017). Causal structure learning. arXiv preprint arXiv:1706.09141.

Hernán, M. A. (2004). A definition of causal effect for epidemiological research. Journal of Epidemiology & Community Health, 58(4), 265-271.

Holland, P. W., Glymour, C., & Granger, C. (1985). Statistics and causal inference. ETS Research Report Series, 1985(2), 1-72.

Hoyer, P. O., & Hyttinen, A. (2009). Bayesian discovery of linear acyclic causal models. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (pp. 240–248). AUAI Press.

Jeffery, R., & Stathis, J. (1996). Function point sizing: structure, validity and applicability. Empirical Software Engineering, 1(1), 11–30.

Jeffery, R., Ruhe, M., & Wieczorek, I. (2000). A comparative study of two software development cost modeling techniques using multi-organizational and company-specific data. Information and Software Technology, 42(14), 1009 - 1016.

Jorgensen, M., & Shepperd, M. (2007). A systematic review of software development cost estimation studies. IEEE Transactions on Software Engineering, 33(1), 33-53.

Kitchenham, B. A., Mendes, E., & Travassos, G. H. (2007, May). Cross versus within-company cost estimation studies: A systematic review. IEEE Transactions on Software Engineering, 33(5), 316-329.

Kitchenham, B., & Känsälä, K. (1993). Inter-item correlations among function points. In Proceedings of the 15th International Conference on Software Engineering (ICSE) (pp. 477–480).

Kleinberg, S., & Hripcsak, G. (2011). A review of causal inference for biomedical informatics. Journal of Biomedical Informatics, 44(6), 1102-1112.

Kondo, M., & Mizuno, O. (2016). Analysis on causal-effect relationship in effort metrics using Bayesian LiNGAM. In Proceedings of 2016 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW) (pp. 47–48).

Lokan, C. J. (1999). An empirical study of the correlations between function point elements [software metrics]. In Proceedings of 1999 6th International Software Metrics Symposium (pp. 200–206).

Mendes, E., & Mosley, N. (2001). Comparing effort prediction models for web design and authoring using boxplots. In Proceedings of the 24th Australasian Computer Science Conference (ACSC) (pp. 125–133).

Mendes, E., Lokan, C., Harrison, R., & Triggs, C. (2005). A replicated comparison of cross-company and within-company effort estimation models using the ISBSG database. In Proceedings of the 11th IEEE International Symposium Software Metrics (pp. 10).

Menzies, T., Krishna, R., & Pryor, D. (2016). The promise repository of empirical software engineering data; http://openscience.us/repo. North Carolina State University.

Messerli, F. H. (2012). Chocolate consumption, cognitive function, and Nobel laureates. The New England Journal of Medicine, 367(16), 1562.

Molokken, K., & Jorgensen, M. (2003). A review of software surveys on software effort estimation. In Proceedings of 2003 International Symposium on Empirical Software Engineering (ISESE) (pp. 223–230).

Pearl, J. (2002). Causality: models, reasoning, and inference. IIE Transactions, 34(6), 583–589.

Pfister, N., Bühlmann, P., & Peters, J. (2017). Invariant causal prediction for sequential data. arXiv preprint arXiv:1706.08058.

Robins, J. (1986). A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Mathematical Modelling, 7(9-12), 1393-1512.

Shimizu, S., Hoyer, P. O., Hyvärinen, A., & Kerminen, A. (2006). A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7, 2003–2030.

Sigweni, B., Shepperd, M., & Forselius, P. (2015, 3). Finnish software effort dataset (Online), https://figshare.com/articles/Finnish_Effort_Estimation_Dataset/1334271, (accessed: 2017-7-30).

Sigweni, B., Shepperd, M., & Turchi, T. (2016). Realistic assessment of software effort estimation models. In Proceedings of the 20th International Conference on Evaluation and Assessment In Software Engineering (p. 41).

Sprent, P., & Smeeton, N. C. (2016). Applied nonparametric statistical methods. CRC Press.

Takagi, Y., Mizuno, O., & Kikuno, T. (2005). An empirical approach to characterizing risky software projects based on logistic regression analysis. Empirical Software Engineering, 10(4), 495-515.

Tantithamthavorn, C., McIntosh, S., Hassan, A. E., & Matsumoto, K. (2017). An empirical comparison of model validation techniques for defect prediction models. IEEE Transactions on Software Engineering, 43(1), 1-18.

The Standish Group (1994). Chaos. Technical report, The Standish Group International Inc.

Uzzafer, M. (2016). Bootstrap correlation analysis of function point elements. International Journal of Database Theory and Application, 9(3), 11–18.

Wen, J., Li, S., Lin, Z., Hu, Y., & Huang, C. (2012). Systematic literature review of machine learning based software development effort estimation models. Information and Software Technology, 54(1), 41–59.

You, C. (2016). R tutorial (Online), http://www.r-tutor.com/elementary-statistics/numerical-measures/skewness, (accessed 2017-7-16).