While Chalmers' corpus is useful to demonstrate that a connectionist architecture has interesting cognitive aspects, it misses many interesting points of natural language. For one thing, natural language utterances are well-formed and may refer to state of affairs7.7. Thus, the corpus used in the experiment discussed in this section was modified and extended, as described in the following paragraphs.
In this more complex corpus, an additional conjunction and is introduced and full verb conjugations are taken into account. There are 240 sentences in total. The vocabulary is summarized in Table 7.2. There is considerable variation added to the corpus as compared to that used in [66]. For instance, the past participle of hit is hit, but that of love is loved. In addition, the conjunction and introduces the plurality of actor and recipient. A strategy that manipulates symbols simply on account of position will not work in this case. Moreover, hit (as past participle) and hit (as verb present plural) have the same eigenstate. In other words, they are indistinguishable according to the formulation operator
of common English. As for the verb love, there are three eigenstates associated with it (loves, love, loved). Some examples are listed below.
helen hits john <-> john is hit by helen
helen loves john <-> john is loved by helen
john kills diane and michael <-> diane and michael are killed by john
|
Fifty-six sentences (23% of the corpus) have been randomly chosen as the training set. The other 184 sentences are reserved as test. Using the same optimization algorithm as in the previous section, the quantum mechanical architecture can learn all the utterances in the training set. The architecture can generalize the task on all sentences in the test set (generalization rate is 100%). Given the complexity of the corpus (in comparison with that used in Chalmers' study) and the small size of the training set, this is a very encouraging result.
A typical training curve is shown in Figure 7.13. Using the conjugate gradient method, for around 100 epochs the architecture can learn all the instances in the training set. To visualize more details, the output of an utterance in the training set,
helen and diane hit john -> john is hit by helen and diane
is shown in Figure 7.14, in which each component is illustrated as a vector on the complex plane. A comparison of the output to the target utterance is shown in Figure 7.15. The first two rows are the absolute squares (represented by the area of the black disks) of the targets and outputs respectively. As can be seen in the figure, seven eigenstates have the most significant coefficients. The lower two rows are the phases of target and output vector, respectively.
![]() |
An output of an utterance in the test set,
john kills diane and michael <-> diane and michael are killed by john
is shown in Figure 7.16. The quantum architecture has never seen the utterance, it is remarkable that the differences in absolute squares and phases are hardly noticeable.
![]() |
Theoretically, a quantum mechanical architecture can perform the reverse computation if time is reversed. In this case, if the output state of affairs is subject to the inverse of the unitary operator using
![]() |
![]() |
![]() |
If, however, the input is a well-formed sentence but cannot be transformed to the passive form in the language, such as,
john kills
the system can still arrive at some reasonable solution. This is shown in Figure 7.18 and Figure 7.19. As can be seen in Figure 7.18, four eigenstates (is, killed, by, john) have the most significant components in the end state of affairs vector. In a sense, it suggests a well-formed utterance:
somebody is killed by john
However, in the miniature language we use here, it is not possible to identify who is killed.