Theom-Lk15-Tests

A basic test of Luke 15 corresponding to the author's analysis is presented here, being conducted according to the proposed phrase construction rules.

Apparently arbitrary manipulations of the text were maintained, where the author, in more than a dozen instances, altered the order of certain words, stating, "the conjunction for and or but (de), and one instance of therefore (ouv) had to be moved in the pecking order so that the logic of the computer program could calculate every phrase possibility -- with or without." This was generally at the beginning of a verse where de was preceded by a base word (such as said... eipe). This basic test respected such changes, as well as the omission of the autou reference in verse 14. Thus, the text of the Basic Test is exactly the same as the author's.

The number of 4-word phrases constructed in this test is 683 (shown here). This compares to 763 phrases in the author's (corrected) sample (not 772 since the omission of autou is maintained). Since the author did not publish his phrase pool we cannot explain this difference, however we do note 77 redundant phrase sums in the complete pool of 760 phrases available. It is likely that the author included such redundant phrases in his sample.

The summary table below reports the factors outperforming the author's Theomatic factor (row 0) in descending overall statistical significance. F is the factor, followed by its hits (H), then the expected (arithmetic mean) number of hits (M). The next three columns show the clustering percentages, p-values of hits (P_H) and clustering (P_C), and the joint p-value, P (P_HxP_HC), indicating general statistical significance in the same manner that the author does in Chapter 9, and its associated odds 1 in N. The last column, O, gives the average number of Theomatic tests needed before seeing results comparable to this factor based on the distribution of the maximum order statistic (MOS). Differences between the author's corrected results and the following results are primarily due to phrase construction. The actual hits obtained by each factor are shown here . Factor 10 is excluded from consideration (greyed out) due to the affect of phrase construction dynamics.

No	F	H	M	0%	1%	2%	P_H	P_C	P	N	O
0	90	53	37.94	25	40	36	0.009954	0.679232	0.006761	148	1.00
1	10	363	341.5	29	37	33	0.053984	0.000046	0.000002	BIG	BIG
2	25	129	136.6	34	34	32	0.779650	0.000313	0.000244	4,092	2.24
3	75	53	45.53	36	43	21	0.143105	0.002856	0.000409	2,447	1.65
4	20	181	170.75	29	34	37	0.193934	0.006757	0.001310	763	1.12
5	230	24	14.85	38	29	33	0.016448	0.097987	0.001612	620	1.09
6	125	22	27.32	50	32	18	0.874444	0.001591	0.001392	719	1.11
7	30	132	113.83	28	39	33	0.036743	0.055463	0.002038	491	1.06
8	518	5	6.59	80	0	20	0.787861	0.003183	0.002508	399	1.04
9	170	31	20.09	32	29	39	0.012972	0.194544	0.002524	396	1.04
10	893	9	3.82	33	11	56	0.016287	0.199666	0.003252	308	1.02
11	690	12	4.95	25	33	42	0.004879	0.864302	0.004217	237	1.01
12	15	234	227.67	26	42	32	0.316701	0.016477	0.005218	192	1.01
13	562	12	6.08	25	17	58	0.021346	0.247627	0.005286	189	1.01
14	150	29	22.77	38	34	28	0.113301	0.049787	0.005641	177	1.00
15	339	16	10.07	0	56	44	0.050035	0.115758	0.005792	173	1.00
16	45	87	75.89	30	38	32	0.099841	0.058648	0.005855	171	1.00
17	575	6	5.94	67	0	33	0.545160	0.011109	0.006056	165	1.00
18	383	17	8.92	29	35	35	0.009752	0.624635	0.006092	164	1.00
19	978	5	3.49	0	0	100	0.272765	0.023518	0.006415	156	1.00
20	493	13	6.93	15	62	23	0.024483	0.275695	0.006750	148	1.00

Clearly, one may note that the Theomatic factor identified by the author, 90, is insignificant. All of the valid hits obtained by the author were obtained here: none were missed.

Adhering to fixed phrase construction rules actually yields the same number of hits from a smaller sample (683 vs 765), giving a slightly better P_H value (.0010 vs .0194) than that obtained by the author's phrase construction technique. Clustering significance has deteriorated somewhat (p-value of .6792 vs .4703). The general result is more favorable for the claimed Theomatics factor (1:N is 1:148 vs 1:110 but this difference in O is negligible (both are 1.00). Both results are certainly well below the expected value of the MOS and outperformed by a number of apparently random factors. 90 ranks 4^th among a thousand factors in hit significance in this test, 668^th in clustering, and 21^st overall.

It is significant that 38% of these factors are multiples of 10, and 67% are multiples of 5. If one just looks at the top 10 factors, 50% are multiples of 10 and 80% are multiples of 5. In a random environment these values would tend to be 10% and 20% respectively for the top N factors (regardless of N). This is evidence that phrase construction rules tend to favor small multiples of 5, which violates the assumption of randomness in the test and therefore diminishes any claim of theomatic significance in factor 90 (or, for that matter, factor 10, which would be quite significant otherwise). This non-random property certainly cannot hurt 90's performance, but it evidently has not helped much: there is no Theomatic significance in factor 90 in spite this advantage

The above context was also tested with a max phrase length of 3 to compare with the author's results. The manner of phrase construction would, again, be the only difference between these results and the author's results. This test resulted in 420 phrases, fewer than the 465 in the (corrected) sample of the author's test, and are given here. The actual hits obtained by each factor are given here. Columns are as in the previous table.

The 3-word test gives much different results than the 4-word test. 90 is still insignificant, since it is still being outperformed by the MOS . 90 rank's 3^rd in hit significance, 45^th in clustering, 8^th overall. A much better p-value is obtained for 90 in this test, yet its clustering is poor in comparison with the other top factors.

Again, note that every factor listed is a multple of 5 and 63% are multples of 10. Each factor attains its significance due to an unusual clustering pattern, tending heavily toward direct hits, and smaller factors are nearly always favored to larger ones. Factors 10 and 25 appear remarkably significant. Evidently, the affect of variable manipulation makes this environment very non-random, implying that the resulting probabilities are meaningless. If the environment were purely random we would expect no such patterns as these in the results.

Factor 70, a larger multiple of 10, is mildly interesting due to being bi-polar in its clustering, and very heavy on direct hits, but it is observed that 70 happens to be the value of the article O, which occurs 17 times in this text, and 70 also evenly divides two other articles, TON and TOU, which appear a total of 15 times in the text and give rise to multiple variations of any successful hit.

The above context was also tested with a max phrase length of 2. This test resulted in 206 phrases, again fewer than the 226 in the author's test, and are given here. The actual hits obtained by each factor are given here. Columns are as in the previous table.

No	F	H	M	0%	1%	2%	PH	PC	P	N	O
0	90	21	11.44	38	48	14	0.005646	0.027129	0.000153	6,528	3.19
1	10	122	103	46	33	21	0.004885	SMALL	SMALL	BIG	BIG
2	30	44	34.33	48	39	14	0.046597	0.000005	SMALL	BIG	BIG
3	20	59	51.5	46	31	24	0.130662	0.000004	0.000001	BIG	BIG
4	15	69	68.67	39	45	16	0.506540	0.000010	0.000005	BIG	BIG
5	50	32	20.6	47	28	25	0.008272	0.000716	0.000006	BIG	BIG
6	170	11	6.06	64	0	36	0.042593	0.000579	0.000025	40,563	25.97
7	45	31	22.89	42	48	10	0.050231	0.000519	0.000026	38,394	23.99
8	150	12	6.87	58	42	0	0.044654	0.001100	0.000049	20,360	10.33
9	75	18	13.73	50	50	0	0.146710	0.000380	0.000056	17,931	8.86
10	25	41	41.2	46	32	22	0.541662	0.000108	0.000058	17,107	8.38
11	40	28	25.75	50	25	25	0.347801	0.000380	0.000132	7,564	3.62

These results show all of the author's 23 hits for factor 90, which appears as the most significant factor for hits among many that outrank it in overall significance, but ranks 3^rd in hit probability overall and 30^th due its clustering qualities. It does finally outperform the MOS in overall significance, but many factors obtain very unusual results in this context due to clustering for some reason.

Again, every single factor is a multiple of 5, and 67% are multiples of 10. The trend observed in the 3-word phrases is extended here, though some larger factors are present. It is reasonable to expect that the combination of phrase construction rules (including and excluding multiple short words that are often multiples of 10), a common subject (with similar wording -- 25% of the reference words used to identify the subject, Son, are divisible by 20, 16% are divisible by 90), and only 2 words per phrase (limiting the number of base words and allowing the phrase construction rules even more impact on the combinations) affects the randomness of the phrase sums in a way that allows certain factors to achieve very unlikely clustering results. This is particularly evident for factors 10, 20 and 30. We therefore do not find any of these unusual results to be statistically convincing, or evidence that Theomatics was designed. Certainly, the factor 90 performs less significantly than many other apparently random factors in this context when both hits and clustering are considered.

Very clearly, the results of the above testing, including all phrase lengths considered by the author, and the exact text that he used in his testing, indicate that 90 does not exhibit significance as a Theomatic factor in either its general hit performance or in the nature of its clustering. In every test conducted there are random factors that outperform it. It is never the top ranking factor in overall significance in any word-length category.

The above context was revisited with the cluster radius reduced to 1. When divisors are only allowed to be off by one unit, we see the following results with 4-word phrases. 90 is still similarly insignificant, ranking 18^th in overall significance, 8^th in hit significance, and 565^th in clustering significance. No factors appear significant.

No	F	H	0%	1%	M	PH	PC	P	N	O
0	90	34	38	62	22.77	0.014900	0.832083	0.012398	81	1.00
1	10	242	44	56	204.9	0.001287	0.002562	0.000003	303,212	864.25
2	25	88	50	50	81.96	0.254344	0.004087	0.001039	962	1.18
3	75	42	45	55	27.32	0.004577	0.262033	0.001199	834	1.14
4	20	114	46	54	102.45	0.119080	0.011787	0.001404	712	1.11
5	30	88	42	58	68.3	0.008803	0.222497	0.001959	511	1.06
6	230	16	56	44	8.91	0.019591	0.150977	0.002958	338	1.03
7	45	59	44	56	45.53	0.026699	0.216609	0.005783	173	1.00
8	150	21	52	48	13.66	0.037318	0.180092	0.006721	149	1.00
9	493	10	20	80	4.16	0.010174	0.670320	0.006820	147	1.00
10	170	19	53	47	12.05	0.037408	0.203497	0.007612	131	1.00
11	15	159	38	62	136.6	0.019457	0.404275	0.007866	127	1.00
12	115	25	52	48	17.82	0.059935	0.140858	0.008442	118	1.00
13	575	4	100	0	3.56	0.477316	0.018316	0.008742	114	1.00
14	251	16	31	69	8.16	0.009393	0.984496	0.009247	108	1.00
15	518	4	100	0	3.96	0.558356	0.018316	0.010227	98	1.00
16	569	9	33	67	3.6	0.011470	1.000000	0.011470	87	1.00

The 90 factor ranks 3^rd in hit significance, but 249^th in clustering, 3^rd in overall significance. The nonrandom nature of the context for 70, 20 and 30 obscures their results. We think it is appropriate to extend this concept to cover 25 as well, being a small multiple of 5, since it is evidently "energized" by the restricted context but insignificant at the 4-word phrase level. 90 is still insignificant, though it does outperform the MOS whereas it did not in the test using radius 2.

Again, 90 ranks 3^rd in hit significance, but 7^th overall due to poor clustering. Again, we note that with fewer degrees of liberty in the test due to shorter phrases and the smaller cluster radius, very "statistically significant" features appear, all of which are multiples of 10, indicating that the environment has lost so much of its randomness that it is inappropriate to draw any conclusions about Theomatic performance.

The Luke 15 test was repeated after restoring the de conjunctions and oun to their original locations and keeping the other factors constant in the author's text. This resulted in 681 phrases, given here, indicating that the swap does not support significant flexibility in consistent phrase construction (implying 2 fewer phrases), as supposed by the author. The actual hits obtained by each factor are given here. Columns are as above.

No	F	H	0%	1%	2%	M	PH	PC	P	N	O
0	90	49	27	39	35	38	0.041162	0.494563	0.020357	49	1.00
1	10	361	28	39	33	341	0.062631	0.000214	0.000013	74,707	66.57
2	25	124	32	36	31	136	0.889080	0.002469	0.002195	456	1.05
3	170	31	32	32	35	20	0.012497	0.228594	0.002857	350	1.03
4	30	130	28	40	32	114	0.051852	0.055876	0.002897	345	1.03
5	383	18	28	33	39	9	0.004441	0.687289	0.003052	328	1.02
6	339	17	0	53	47	10	0.027037	0.115122	0.003113	321	1.02
7	893	9	33	11	56	4	0.016014	0.199666	0.003197	313	1.02
8	669	10	40	50	10	5	0.034629	0.105399	0.003650	274	1.02
9	20	179	28	36	36	170	0.231622	0.017712	0.004102	244	1.01
10	687	6	0	0	100	5	0.376349	0.011109	0.004181	239	1.01
11	230	23	35	35	30	15	0.027490	0.202336	0.005562	180	1.01
12	978	5	0	0	100	3	0.270834	0.023518	0.006369	157	1.00
13	562	9	33	0	67	6	0.157970	0.049787	0.007865	127	1.00
14	941	8	0	63	38	4	0.031193	0.269146	0.008396	119	1.00
15	75	49	31	47	22	45	0.310901	0.028404	0.008831	113	1.00
16	844	5	0	100	0	4	0.377895	0.023518	0.008887	113	1.00
17	483	8	0	13	88	7	0.408727	0.022093	0.009030	111	1.00
18	125	22	45	32	23	27	0.871312	0.010377	0.009041	111	1.00
19	678	10	0	50	50	5	0.032096	0.286505	0.009196	109	1.00
20	714	9	0	33	67	5	0.053408	0.173774	0.009281	108	1.00
21	894	9	11	33	56	4	0.015911	0.606531	0.009650	104	1.00
22	384	16	31	31	38	9	0.018860	0.510750	0.009633	104	1.00
23	976	6	50	0	50	3	0.140389	0.072440	0.010170	98	1.00
24	518	6	67	17	17	7	0.642623	0.016851	0.010829	92	1.00
25	251	21	24	52	24	14	0.035104	0.311403	0.010931	91	1.00
26	690	11	18	45	36	5	0.012240	0.934091	0.011433	87	1.00
27	326	16	13	25	63	10	0.064445	0.184981	0.011921	84	1.00
28	15	231	25	42	32	227	0.386433	0.036336	0.014042	71	1.00
29	706	9	11	22	67	5	0.056488	0.263597	0.014890	67	1.00
30	510	13	23	31	46	7	0.018882	0.793923	0.014991	67	1.00
31	607	4	75	0	25	6	0.811571	0.019497	0.015823	63	1.00
32	204	23	9	35	57	17	0.079930	0.202336	0.016173	62	1.00
33	502	12	8	58	33	7	0.043306	0.375624	0.016267	61	1.00
34	570	8	38	63	0	6	0.251783	0.065957	0.016607	60	1.00
35	805	2	100	0	0	4	0.924497	0.018316	0.016933	59	1.00
36	506	12	17	58	25	7	0.041238	0.416862	0.017191	58	1.00
37	85	47	32	36	32	40	0.147403	0.117855	0.017372	58	1.00
38	611	2	100	0	0	6	0.975421	0.018316	0.017865	56	1.00
39	723	7	29	71	0	5	0.196083	0.091368	0.017916	56	1.00
40	285	13	23	69	8	12	0.417959	0.044361	0.018541	54	1.00
41	997	7	43	29	29	3	0.058481	0.318907	0.018650	54	1.00
42	549	7	57	29	14	6	0.426284	0.044729	0.019067	52	1.00
43	489	5	0	0	100	7	0.825033	0.023518	0.019403	52	1.00
44	691	10	10	40	50	5	0.028775	0.687289	0.019777	51	1.00
45	289	15	7	67	27	12	0.206926	0.096972	0.020066	50	1.00
46	493	12	17	58	25	7	0.048356	0.416862	0.020158	50	1.00

The author's mysteriously rationalized rearrangement of the de conjunctions was apparently convenient for the Theomatic factor 90, resulting in 4 additional hits. The rearrangement does not give significantly more combinations to test, but different phrases to test. The reordering appears to have been an inappropriate "maneuver" on the part of the author that happens to have favored the desired outcome of his experiment... at least when consistently following phrase construction rules. It is unreasonable to presume that the author did in fact make this substitution blindly, not knowing that it would improve his results. We infer this simply because he offers no justification for the rearrangement of these words and it violates his principle of taking the text exactly as it is written.

In this new test, 90 is not even in the top ten factors according to its hits (ranks 17^th) and has very bland clustering (506^st). There is nothing interesting about this result

Finally, a similar test to the basic test was made of the author's chosen text in two other versions of the New Testament: the Robinson-Pierpont Majority Text (MT) and the Nestle-Aland 26^th/27^th edition (NA).

The following table shows results for 4-word phrases in MT, maintaining the author's reordering of the conjunctions and excluding the reference in verse 14 as before. This compares exactly with the author's text but in a different version of the same type of biblical text. Noted differences with the author's text were very minor, generally in the spelling of certain words (11 instances). 682 phrases were formed, given here, and hit results are given here.

No	F	H	0%	1%	2%	M	PH	PC	P	N	O
0	90	52	25	40	35	37.9	0.014377	0.597845	0.008595	116	1.00
1	10	366	29	38	33	341.0	0.030267	0.000035	0.000001	BIG	BIG
2	125	20	55	30	15	27.3	0.941502	0.000357	0.000336	2,974	1.84
3	25	127	34	34	32	136.4	0.828141	0.000480	0.000398	2,516	1.68
4	170	33	33	27	39	20.1	0.004334	0.118083	0.000512	1,954	1.49
5	30	134	28	40	32	113.7	0.022490	0.050254	0.001130	885	1.16
6	75	51	35	43	22	45.5	0.217170	0.005457	0.001185	844	1.14
7	230	24	38	29	33	14.8	0.016194	0.097987	0.001587	630	1.09
8	518	5	80	0	20	6.6	0.786818	0.003183	0.002504	399	1.04
9	15	236	26	43	31	227.3	0.252750	0.011651	0.002945	340	1.03
10	45	88	30	41	30	75.8	0.078771	0.040119	0.003160	316	1.02
11	150	31	35	39	26	22.7	0.053889	0.070994	0.003826	261	1.01
12	562	13	23	23	54	6.1	0.009299	0.445886	0.004146	241	1.01
13	690	12	25	33	42	4.9	0.004824	0.864302	0.004170	240	1.01
14	20	181	28	36	35	170.5	0.187823	0.022470	0.004220	237	1.01
15	339	16	0	56	44	10.1	0.049512	0.115758	0.005731	174	1.00
16	575	6	67	0	33	5.9	0.543743	0.011109	0.006040	166	1.00
17	50	74	32	31	36	68.2	0.246441	0.024491	0.006036	166	1.00
18	978	5	0	0	100	3.5	0.271799	0.023518	0.006392	156	1.00
19	549	6	67	33	0	6.2	0.588492	0.011109	0.006538	153	1.00
20	493	13	15	62	23	6.9	0.024234	0.275695	0.006681	150	1.00
21	941	8	0	63	38	3.6	0.031422	0.269146	0.008457	118	1.00

The Theomatic factor of 90 ranks 4^th in hit significance, 579^th in clustering, and places 22^nd in overall significance. This compares to 21^st in overall standing with the author's text. Not much is significant here... especially 90.

Test results for the NA text of Luke 15 resulted in 665 phrases, given here, quite a bit fewer than the MT , being a very dissimilar text in a number of respects, and gave factors as follows:

No	F	H	0%	1%	2%	M	PH	PC	P	N	O
0	90	49	31	39	31	36.9	0.029047	0.145356	0.004222	237	1.01
1	10	344	30	38	33	332.5	0.196804	0.000028	0.000006	180,547	314.41
2	230	20	50	15	35	14.5	0.094316	0.002187	0.000206	4,847	2.52
3	15	229	27	44	29	221.7	0.285970	0.000983	0.000281	3,557	2.04
4	25	124	34	33	33	133.0	0.821106	0.000578	0.000475	2,106	1.54
5	30	123	31	39	30	110.8	0.113338	0.005646	0.000640	1,563	1.36
6	854	10	0	70	30	3.9	0.006642	0.105399	0.000700	1,428	1.32
7	125	19	53	32	16	26.6	0.951589	0.001336	0.001271	787	1.13
8	45	87	30	43	28	73.9	0.062333	0.020844	0.001299	770	1.12
9	493	14	14	64	21	6.7	0.009208	0.173774	0.001600	625	1.09
10	50	68	35	28	37	66.5	0.441886	0.004984	0.002202	454	1.05
11	427	14	0	64	36	7.8	0.027493	0.085070	0.002339	428	1.04
12	60	63	33	37	30	55.4	0.159962	0.025765	0.004121	243	1.01
13	866	7	0	14	86	3.8	0.094030	0.044729	0.004206	238	1.01

90 ranks 15^th in hit significance, 95^th in clustering, and 13^th overall. Results are again insiginificant.

The above test results demonstrate that the author's claim to have found an unusually significant Theomatic factor in 90 in this Luke 15 context is a false one, in agreement with the conclusions drawn from a mathematical analysis of his claims. The results of the author's own test indicate that this factor is not unusually significant, and every single test above indicates this same fact.

Furthermore, these tests demonstrate that no Theomatic factors exist for this subject and context, at least as the author as defined Theomatics: we have exhausted every single possible factor under a number of different perturbations of the text and found nothing notably significant. Certainly, there is no outstanding factor that clearly shines above the rest in overall standing, that is robust enough to evidence significance under these different scenarios... all of which are reasonable contexts in which to expect to find such a phenomenon if it does indeed exist. As we have observed, the unusual significance of factors 20 and 30 is expected, and can easily be explained by the fact that many of the Greek articles are multiples of 10, which results in a much higher percentage of hits for factors that are smaller multiples of 10. This concludes our general testing of Luke 15.

Test results indicate no Theomatic significance of any kind in Luke 15 concerning references to sons in the Parable of the Prodigal Son. All possible factors fail to exhibit general statistical significance in this context. We conclude that this context contains no Theomatic structure.