Luke 15 Testing

home
back
 

Basic Testing

A basic test of Luke 15 corresponding to the author's analysis is presented here, being conducted according to the proposed phrase construction rules.

Apparently arbitrary manipulations of the text were maintained, where the author, in more than a dozen instances, altered the order of certain words, stating, "the conjunction for and or but (de), and one instance of therefore (ouv) had to be moved in the pecking order so that the logic of the computer program could calculate every phrase possibility -- with or without." This was generally at the beginning of a verse where de was preceded by a base word (such as said... eipe). This basic test respected such changes, as well as the omission of the autou reference in verse 14. Thus, the text of the Basic Test is exactly the same as the author's.
 

4-Word Test

The number of 4-word phrases constructed in this test is 683 (shown here). This compares to 763 phrases in the author's (corrected) sample (not 772 since the omission of autou is maintained). Since the author did not publish his phrase pool we cannot explain this difference, however we do note 77 redundant phrase sums in the complete pool of 760 phrases available. It is likely that the author included such redundant phrases in his sample.

The summary table below reports the factors outperforming the author's Theomatic factor (row 0) in descending overall statistical significance. F is the factor, followed by its hits (H), then the expected (arithmetic mean) number of hits (M). The next three columns show the clustering percentages, p-values of hits (PH) and clustering (PC), and the joint p-value, P (PHxPHC), indicating general statistical significance in the same manner that the author does in Chapter 9, and its associated odds 1 in N. The last column, O, gives the average number of  Theomatic tests needed before seeing results comparable to this factor based on the distribution of the maximum order statistic (MOS). Differences between the author's corrected results and the following results are primarily due to phrase construction. The actual hits obtained by each factor are shown here . Factor 10 is excluded from consideration (greyed out) due to the affect of phrase construction dynamics.

No

F

H

M

0%

1%

2%

PH

PC

P

N

O

0

90

53

37.94

25

40

36

0.009954

0.679232

0.006761

148

1.00

1

10

363

341.5

29

37

33

0.053984

0.000046

0.000002

BIG

BIG

2

25

129

136.6

34

34

32

0.779650

0.000313

0.000244

4,092

2.24

3

75

53

45.53

36

43

21

0.143105

0.002856

0.000409

2,447

1.65

4

20

181

170.75

29

34

37

0.193934

0.006757

0.001310

763

1.12

5

230

24

14.85

38

29

33

0.016448

0.097987

0.001612

620

1.09

6

125

22

27.32

50

32

18

0.874444

0.001591

0.001392

719

1.11

7

30

132

113.83

28

39

33

0.036743

0.055463

0.002038

491

1.06

8

518

5

6.59

80

0

20

0.787861

0.003183

0.002508

399

1.04

9

170

31

20.09

32

29

39

0.012972

0.194544

0.002524

396

1.04

10

893

9

3.82

33

11

56

0.016287

0.199666

0.003252

308

1.02

11

690

12

4.95

25

33

42

0.004879

0.864302

0.004217

237

1.01

12

15

234

227.67

26

42

32

0.316701

0.016477

0.005218

192

1.01

13

562

12

6.08

25

17

58

0.021346

0.247627

0.005286

189

1.01

14

150

29

22.77

38

34

28

0.113301

0.049787

0.005641

177

1.00

15

339

16

10.07

0

56

44

0.050035

0.115758

0.005792

173

1.00

16

45

87

75.89

30

38

32

0.099841

0.058648

0.005855

171

1.00

17

575

6

5.94

67

0

33

0.545160

0.011109

0.006056

165

1.00

18

383

17

8.92

29

35

35

0.009752

0.624635

0.006092

164

1.00

19

978

5

3.49

0

0

100

0.272765

0.023518

0.006415

156

1.00

20

493

13

6.93

15

62

23

0.024483

0.275695

0.006750

148

1.00

Clearly, one may note that the Theomatic factor identified by the author, 90, is insignificant. All of the valid hits obtained by the author were obtained here: none were missed.

Adhering to fixed phrase construction rules actually yields the same number of hits from a smaller sample (683 vs 765), giving a slightly better PH value (.0010 vs .0194) than that obtained by the author's phrase construction technique. Clustering significance has deteriorated somewhat (p-value of .6792 vs .4703). The general result is more favorable for the claimed Theomatics factor (1:N is 1:148 vs 1:110 but this difference in O is negligible (both are 1.00). Both results are certainly well below the expected value of the MOS and outperformed by a number of apparently random factors. 90 ranks 4th among a thousand factors in hit significance in this test, 668th in clustering, and 21st overall.

It is significant that 38% of these factors are multiples of 10, and 67% are multiples of 5. If one just looks at the top 10 factors, 50% are multiples of 10 and 80% are multiples of 5. In a random environment these values would tend to be 10% and 20% respectively for the top N factors (regardless of N). This is evidence that phrase construction rules tend to favor small multiples of 5, which violates the assumption of randomness in the test and therefore diminishes any claim of theomatic significance in factor 90 (or, for that matter, factor 10, which would be quite significant otherwise). This non-random property certainly cannot hurt 90's performance, but it evidently has not helped much: there is no Theomatic significance in factor 90 in spite this advantage
 

3-Word Test

The above context was also tested with a max phrase length of 3  to compare with the author's results. The manner of phrase construction would, again, be the only difference between these results and the author's results. This test resulted in 420 phrases, fewer than the 465 in the (corrected) sample of the author's test, and are given here. The actual hits obtained by each factor are given here. Columns are as in the previous table.

No

F

H

0%

1%

2%

M

PH

PC

P

N

O

0

90

37

32

43

24

37.94

0.004256

0.073178

0.000311

3,211

1.92

1

10

238

33

40

27

341.5

0.003604

SMALL

SMALL

BIG

BIG

2

25

78

44

31

26

136.6

0.784763

0.000001

0.000001

BIG

BIG

3

45

55

38

42

20

45.53

0.113533

0.000663

0.000075

13,278

6.30

4

30

86

31

43

26

170.75

0.023365

0.005946

0.000139

7,198

3.46

5

50

51

39

33

27

14.85

0.086091

0.002491

0.000214

4,663

2.45

6

70

33

45

21

33

27.32

0.310462

0.000926

0.000287

3,479

2.01

7

75

32

41

47

13

113.83

0.242032

0.001337

0.000324

3,090

1.88

The 3-word test gives much different results than the 4-word test. 90 is still insignificant, since it is still being outperformed by the MOS . 90 rank's 3rd in hit significance, 45th in clustering, 8th overall. A much better p-value is obtained for 90 in this test, yet its clustering is poor in comparison with the other top factors.

Again, note that every factor listed is a multple of 5 and 63% are multples of 10. Each factor attains its significance due to an unusual clustering pattern, tending heavily toward direct hits, and smaller factors are nearly always favored to larger ones. Factors 10 and 25 appear remarkably significant. Evidently, the affect of variable manipulation makes this environment very non-random, implying that the resulting probabilities are meaningless. If the environment were purely random we would expect no such patterns as these in the results.

Factor 70, a larger multiple of 10, is mildly interesting due to being bi-polar in its clustering, and very heavy on direct  hits, but it is observed that 70 happens to be the value of the article O, which occurs 17 times in this text, and 70 also evenly divides two other articles, TON and TOU, which appear a total of 15 times in the text and give rise to multiple variations of any successful hit.
 

2-Word Test

The above context was also tested with a max phrase length of 2. This test resulted in 206 phrases, again fewer than the 226 in the author's test, and are given here. The actual hits obtained by each factor are given here. Columns are as in the previous table.

No

F

H

M

0%

1%

2%

PH

PC

P

N

O

0

90

21

11.44

38

48

14

0.005646

0.027129

0.000153

6,528

3.19

1

10

122

103

46

33

21

0.004885

SMALL

SMALL

BIG

BIG

2

30

44

34.33

48

39

14

0.046597

0.000005

SMALL

BIG

BIG

3

20

59

51.5

46

31

24

0.130662

0.000004

0.000001

BIG

BIG

4

15

69

68.67

39

45

16

0.506540

0.000010

0.000005

BIG

BIG

5

50

32

20.6

47

28

25

0.008272

0.000716

0.000006

BIG

BIG

6

170

11

6.06

64

0

36

0.042593

0.000579

0.000025

40,563

25.97

7

45

31

22.89

42

48

10

0.050231

0.000519

0.000026

38,394

23.99

8

150

12

6.87

58

42

0

0.044654

0.001100

0.000049

20,360

10.33

9

75

18

13.73

50

50

0

0.146710

0.000380

0.000056

17,931

8.86

10

25

41

41.2

46

32

22

0.541662

0.000108

0.000058

17,107

8.38

11

40

28

25.75

50

25

25

0.347801

0.000380

0.000132

7,564

3.62

These results show all of the author's 23 hits for factor 90, which appears as the most significant factor for hits among many that outrank it in overall significance, but ranks 3rd in hit probability overall and 30th due its clustering qualities. It does finally outperform the MOS in overall significance, but many factors obtain very unusual results in this context due to clustering for some reason.

Again, every single factor is a multiple of 5, and 67% are multiples of 10. The trend observed in the 3-word phrases is extended here, though some larger factors are present. It is reasonable to expect that the combination of phrase construction rules (including and excluding multiple short words that are often multiples of 10), a common subject (with similar wording -- 25% of the reference words used to identify the subject, Son , are divisible by 20, 16% are divisible by 90), and only 2 words per phrase (limiting the number of base words and allowing the phrase construction rules even more impact on the combinations) affects the randomness of the phrase sums in a way that allows certain factors to achieve very unlikely clustering results. This is particularly evident for factors 10, 20 and 30. We therefore do not find any of these unusual results to be statistically convincing, or evidence that Theomatics was designed. Certainly, the factor 90 performs less significantly than many other apparently random factors in this context when both hits and clustering are considered.

Very clearly, the results of the above testing, including all phrase lengths considered by the author, and the exact text that he used in his testing, indicate that 90 does not exhibit significance as a Theomatic factor in either its general hit performance or in the nature of its clustering. In every test conducted there are random factors that outperform it. It is never the top ranking factor in overall significance in any word-length category.
 

Cluster Radius 1 Tests

The above context was revisited with the cluster radius reduced to 1. When divisors are only allowed to be off by one unit, we see the following results with 4-word phrases. 90 is still similarly insignificant, ranking 18th in overall significance, 8th in hit significance, and 565th in clustering significance. No factors appear significant.

No

F

H

0%

1%

M

PH

PC

P

N

O

0

90

34

38

62

22.77

0.014900

0.832083

0.012398

81

1.00

1

10

242

44

56

204.9

0.001287

0.002562

0.000003

303,212

864.25

2

25

88

50

50

81.96

0.254344

0.004087

0.001039

962

1.18

3

75

42

45

55

27.32

0.004577

0.262033

0.001199

834

1.14

4

20

114

46

54

102.45

0.119080

0.011787

0.001404

712

1.11

5

30

88

42

58

68.3

0.008803

0.222497

0.001959

511

1.06

6

230

16

56

44

8.91

0.019591

0.150977

0.002958

338

1.03

7

45

59

44

56

45.53

0.026699

0.216609

0.005783

173

1.00

8

150

21

52

48

13.66

0.037318

0.180092

0.006721

149

1.00

9

493

10

20

80

4.16

0.010174

0.670320

0.006820

147

1.00

10

170

19

53

47

12.05

0.037408

0.203497