Friday, March 13, 2020
The aim of this investigation is to examine whether or not the number of people per doctor affects a countries average life expectancy Essays
The aim of this investigation is to examine whether or not the number of people per doctor affects a countries average life expectancy Essays   The aim of this investigation is to examine whether or not the number of people per doctor affects a countries average life expectancy Essay  The aim of this investigation is to examine whether or not the number of people per doctor affects a countries average life expectancy Essay          The aim of this investigation is to examine whether or not the number of people per doctor affects a countries average life expectancy.  The life expectancy of many lesser economically developed countries is lower than that of more economically developed countries. Generally, better-developed countries have a greater doctor to population ratio. So I wish to determine whether this is a factor that affects life expectancy.  I choose this investigation, as Im interested in geography particularly travelling. I plan to take a gap year after my A-levels, prior to university and hopefully visit many areas of the world including less economically developed countries. This led me to an interest in the variation of death rates between countries and I decided to compare this data to the number of doctors per person and to see if this influences the death rate in anyway.        DATA COLLECTION:  Firstly, I collected a list of all the countries in the world and their doctor to patient ratio. I got my data from a school Atlas I acquired from the college library; I collected the data from the same source as it was obtained in the same year. The countries were listed alphabetically and assigned a number. Using a graphics calculator I generated a random number, using a random function and chose a sample of 50. However, some numbers were generated twice so I ignored it the second time and went on to the next number.   1   Data No.  People per Dr.  average life expectancy  1  7358  45  2  769  73  3  1250  69  4  555  83  5  14300  47  6  1316  75  7  370  73  8  298  71  9  455  78  10  385  77  11  257  70  12  709  74  13  9090  73  14  5000  58  15  885  76  16  244  68  17  885  76  18  10000  53  19  5825  61  20  57300  44  21  3448  69  22  909  75  23  1111  76  24  667  70  25  357  78  26  2000  52  27  5000  47  28  5556  45  29  2500  69  30  333  79  31  2500  63  32  6423  65  33  1667  62  34  588  78  35  625  70  36  6667  52  37  5000  86  38  20000  56  39  476  77  40  406  77  41  50000  45  42  476  77  43  33333  49  44  303  78  45  10000  68  46  435  73  47  699  72  48  556  70  49  375  81  50  14300  37  Total  123743  1712   2   Modelling Procedures:  I decided to use Excel to input my data into a table format (shown above), from this table I used Excel to draw a scatter diagram of all the data.  Scatter Diagram to Compare Life Expectancies to People Per Doctor For 50 Random Countries  The scatter diagram gives a good diagrammatic representation of the data and shows how the data is spread in roughly an elliptical nature. From this I can make an initial conclusion/statement that both data variables are random and normally distributed. Due to the elliptical nature of the data it allowed me to produce a regression line from the data. The regression lines shows visually roughly how strong or weak the correlation of the data is and in this instance the data is a relatively strong negative correlation. The strength of the correlation can be calculated using Pearsons Product Moment Correlation.  To do this I used Excel to set-up a table consisting of (xi, yi , xi2 , yi2 , xiyi ) and the sum of all columns (shown page. 5)   3   Data No.  People per Dr.  average life expectancy  Xi^2  Yi^2  XiYi  1  7358  45  54140164  2025  331110  2  769  73  591361  5329  56137  3  1250  69  1562500  4761  86250  4  555  83  308025  6889  46065  5  14300  47  204490000  2209  672100  6  1316  75  1731856  5625  98700  7  370  73  136900  5329  27010  8  298  71  88804  5041  21158  9  455  78  207025  6084  35490  10  385  77  148225  5929  29645  11  257  70  66049  4900  17990  12  709  74  502681  5476  52466  13  9090  73  82628100  5329  663570  14  5000  58  25000000  3364  290000  15  885  76  783225  5776  67260  16  244  68  59536  4624  16592  17  885  76  783225  5776  67260  18  10000  53  100000000  2809  530000  19  5825  61  33930625  3721  355325  20  57300  44  3283290000  1936  2521200  21  3448  69  11888704  4761  237912  22  909  75  826281  5625  68175  23  1111  76  1234321  5776  84436  24  667  70  444889  4900  46690  25  357  78  127449  6084  27846  26  2000  52  4000000  2704  104000  27  5000  47  25000000  2209  235000  28  5556  45  30869136  2025  250020  29  2500  69  6250000  4761  172500  30  333  79  110889  6241  26307  31  2500  63  6250000  3969  157500  32  6423  65  41254929  4225  417495  33  1667  62  2778889  3844  103354  34  588  78  345744  6084  45864  35  625  70  390625  4900  43750  36  6667  52  44448889  2704  346684  37  5000  86  25000000  7396  430000  38  20000  56  400000000  3136  1120000  39  476  77  226576  5929  36652  40  406  77  164836  5929  31262  41  50000  45  2500000000  2025  2250000  42  476  77  226576  5929  36652  43  33333  49  1111088889  2401  1633317  44  303  78  91809  6084  23634  45  10000  68  100000000  4624  680000  46  435  73  189225  5329  31755  47  699  72  488601  5184  50328  48  556  70  309136  4900  38920  49  375  81  140625  6561  30375  50  14300  37  204490000  1369  529100  Total  123743  1712  3804969945  120078  6450387   5   Pearsons Product Moment Correlation Coefficient  This is denoted by r  r = Sxy  Sx Sy  Sx = Standard deviation of x =  Sy = Standard deviation of y =  Sxy = Covariance = 1/50 ?xi yi  x y  = 1/50 ?xi yi  x y  Sx Sy  Sx = 11588.897  Sy = 12.312  Sxy = -87234.776  R = -0.624  Hypothesis Test  Im going to test my data at a 5% significant level. p = Population Product Moment Correlation Coefficient,  H0: p = 0 (no correlation between people per doctor and life expectancy)  H1: p  0 (negative correlation between people per doctor and life expectancy)  Im using a 1 tail test- as from the initial scatter diagram and Pearsons Product Moment Correlation Coefficient Im aware that the correlation (if significant will be negative).  * n = 50 r = -0.624 r (critical value) =  Therefore by using the tables of critical values for (r) when n = 50 it is evident that the value for r (-0.624) is greater than the critical value when n = 50 at a 5% significant level.  H1: p ; 0 (negative correlation between people per doctor and life expectancy) can be accepted and H0 rejected. Thus showing that at a 5% significant level there is negative correlation between people per doctor and life expectancy.   6-  Regression Line  Using the equation for a regression line: y- y = Sxy (x -x)  Sx2  Ive generated an equation to calculate the value of (x) from (y).  * y  66.8 = -87234.776 (x- 5879.22)  11588.8972  *  Conclusion  The scatter diagram is a good initial indication of negative correlation between people per doctor and life expectancy, suggesting that for countries that life expectancy is low there will be a greater number of people per doctor- compared to a country with higher life expectancy.  Pearsons Product Moment Correlation Coefficient determines the strength of correlation between data, i.e  * if r = o (no correlation)  * if r = -1 ( perfect negative correlation)  * if r = 1 (perfect positive correlation)  Because my calculation gave me the value of r equal to -0.624 it supported the initial interpretation of the data having negative correlation and indicated that the negative correlation was of a reasonable strength.  I decided to carry out a Hypothesis test on the data. This was carried out by the comparison of r (-0.624) with the corresponding critical values of (r) from the tables- showing negative correlation between people per doctor and life expectancy at a 5% significance level.   7-  Accuracy  The accuracy of my raw data is likely to be of the highest accuracy due to the fixers being obtained from the CIA (Central Intelligence of America) web site- from this I can be certain that all data is recent and for my investigation reliable. The only error likely to occur is the ever changing patient to doctor ratio, although is accounted for before the raw data was published by the CIA. I found this the most accurate and up to date source of information available for my access.  Within the calculations itself the results are also of my highest possible accuracy. I used Excel to initially calculate Pearsons Product Correlation Coefficient, Mean, Standard Deviation and Co-variance, that was then check by hand using a calculator and the formulas included within my investigation. I kept the data to 3signifcant figures as accuracy beyond this wasnt necessary for this particular investigation.  The regression line was also drawn by Excel and not by hand as to be most accurate.  The only inaccuracy that I felt might have effected my investigation is a particular significant outlier or anomal result: (a result over two standard deviations from the mean). This could have caused my standard deviation of X to increase and Y to decrease compared to all other data figures, leading to a possible inaccuracy to my Co-variance and Pearsons Product Correlation Coefficient. The anomaly is highlighted in my scatter diagram (including the regression line) as to show the change in the regression line to incorporate this outlier- another possibly affected factor in my investigation.    
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.