Open Access Short Communication

Principal Component Analysis of Factors Affecting Ovulation Interval of Selected Women

Matthew Chukwuma Michael1*, Eriobu Nkiru Obioma2 and Ashinze Mkpuruoma Akudo3

1, 3Department of Mathematics and Statistics, Nigeria

2Department of Statistics, Nigeria

Corresponding Author

Received Date: February 21, 2019;  Published Date: February 27, 2019

Abstract

The factors affecting ovulation interval, like another biological phenomenon are numerous and interdependent. This paper identified the factors to include Age, Height, Weight, Work Time (Stress), Menstrual Duration, Number of Conceptions, Number of Births and Exposure to Sun. It is the aim of this paper to formulate a smaller number of independent random variables such that minimum information is lost. A total of two hundred (200) women in their reproductive age interval were selected for the study. Questionnaires were used to get the relevant information from them. Eighty eight percent (88%) of the total variation in the ovulation interval was accounted for by six (6) principal components compared to the original eight random variables. It was observed that the most important factors are Stress, Height, Menstrual Duration, Weight and Age, and that only Exposure to Sun did not show any significance in affecting Ovulation Interval.

Keywords:Principal Component; Ovulation; Conception; Factors and multivariate

Introduction

The time between two menstrual flows in which egg is released from the ovary into the fallopian tube in readiness for fertilization is called ovulation [1]. Being able to determine the period of ovulation is one of the major challenges to conception and unwanted/unexpected conception. The general believe is that ovulation occurs halfway into the normal menstrual cycle. There are signs and symptoms of ovulation which can help to notice the occurrence of ovulation [2]. The most important sign according to [2] is virginal mucous. Other signs like dizziness, increased body temperature and feverishness exist. Predicting ovulation; that is, knowing when it will occur and when it has occurred is very important for conception [3]. In human population the average duration of ovulation occurrence is 28 days. However, most women experience deviations from this universal parameter. Ultimately, many women experience variation in the interval of their ovulation.

Like every other biological phenomenon, ovulation is affected by some factors which are also interdependent. To enhance a good prediction of ovulation, this work was designed to perform a principal component analysis of some suspected factors affecting ovulation interval in selected women in order to determine a fewer and independent set of random variables.

Methodology

A total of 200 women in their reproductive age were selected for the study. The women were mostly teachers in primary and secondary schools, health workers in public and private health facilities and literate women in market places. The data collection process took place in a period of six weeks. Questionnaires were administered to the women and they supplied the necessary information under proper supervision. The major factors considered for analysis are Age, Height, Weight, Work Time (Stress), Menstrual Duration, Number of Conceptions, Number of Births and Exposure to Sun.

The method of data analysis is Principal Component Analysis. It is a multivariate Technique used in data reduction and data analysis [4,5]. The aim of principal component analysis is to ascertain if the joint variation in p variables can be represented approximately in terms of the joint variation of a fewer number, say k < p , of hypothetical variables without much loss of information. That is X p×1 are replaced by a linear transformation,

irispublishers-openaccess-biostatistics-biometric-applications

Where Yp×1 k < p are significant and approximately chosen from Yp×1. The Yp×1 are interpretable in terms of the original problem and they are independent [5,6].

irispublishers-openaccess-biostatistics-biometric-applications

The are Principal Components (PC) if they are uncorrelated and their variances are as large as possible.

irispublishers-openaccess-biostatistics-biometric-applications

If the mean vector and covariance matrix of are given as and respectively and, has eigen value – eigen vector pairs; the ith

Principal Component Analysis is given by

irispublishers-openaccess-biostatistics-biometric-applications

with

irispublishers-openaccess-biostatistics-biometric-applications

The total variance is given as

irispublishers-openaccess-biostatistics-biometric-applications

Thus, the proportional variance explained by the kth principal component is

irispublishers-openaccess-biostatistics-biometric-applications

If 80% to 90% of the total population variance for large can be attributed to the first one, two or three components, these components can be used to replace the original variables without much loss of information.

When the variables, are in different units, two or more variables are measured on vastly different ranges or standardized random variables are used instead of the original random variables, the correlation matrix, , is used to calculate Principal Components (PC) instead of Σ [4].

Data Analysis

The eigen values for the eight random variables considered are 1.8989, 1.2740, 1.1409, 0.9970, 0.8840, 0.8225, 0.6101 and 0.3707 with the following respective proportions 0.237, 0.158, 0.143, 0.125, 0.110, 0.103, 0.076nand 0.047 and cumulative proportions 0.237,0.397, 0.539, 0.664, 0.774, 0.877, 0.953 and 1.000. The independent Principal Components are thus represented as,

irispublishers-openaccess-biostatistics-biometric-applications

Where x1 =Age, x2 = Height, x3 =Weight, x4 =Work Time, x5 = Menstrual Duration, x6 =Number of Conceptions, x7 =Number of Births and x8 =Exposure to Sun.

The six Principal Components accounts for 87.7%, approximately 88% of the total variation in the ovulation interval of the selected women. The Principal Components attach more significance to Stress, Height, Menstrual Duration, Weight and Age, and less significance to Exposure to Sun in affecting Ovulation Interval.

Conclusion

The original data which was collected in eight dimensions known to be correlated has been successfully reduced into six principal components which are shown to be uncorrelated. The six principal components account for 88% of the total variation in ovulation interval of the studied population. Hence, further analysis of the data demanding the condition of independence of data set can be done using the six principal components.

Acknowledgement

None.

Conflict of Interest

No conflict of interest.

Citation
Keywords
Signup for Newsletter
Scroll to Top