A Mathematical Approach to Investigate the
Relationship between Association Memory and Latent
Semantic Analysis in English and Chinese
ABSTRACT
Certain previous researches attempted to characterize how association memory works. A naïve postulation would assume that the mechanisms for that relationship are mainly due to the processes of semantic similarity. The present work not only validates that the LSA calculation outcome is related to the norm of association memory for both English and Chinese, but also proposes that association memory could be constructed by considering the similar co-occurrences across situations for words from the perspective of LSA. The work further analyzes the constructed association bipartite nets and the results showed that the counts of afferent associations are proportional to the strength of association memory. It can be concluded that the words associated with many other words would have higher probability to have higher LSA values. Finally, we suggest a possible mechanism of how association memory is formed and depicts how words with general concept would be more probable to be associated with other words.
Key words: association memory, latent semantic analysis
Scenarios assumptions of
association memory
The present work proposed a mechanism how association memory were build. The association of words were build while two words emerge in a scenery. While one of connected word is stimulated, another word will be associated. The build association will be strengthen by coherence of two words, and the coherent of words can be characterized by LSA.
Further, the word with general concept would be easily coherent to other words and tends to be associated. For example, a bird is a general concept to a nest, a feather, a pelican, awing, a gull, and a eagle. The word bird is tend to be associated while the cue word is stimulated.
Method
Ming-Liang Wei¹, Chung-Ching Wang¹, Yen-Cheng Chen², Yu-Lin Chang³,
Hsueh-Chih Chen³ (chcjyh@ntnu.edu.tw) , Jon-Fan Hu² (jfhu@mail.ncku.edu.tw)
1 Department of Electrical Engineering, National Cheng Kung University, Tainan, 701 Taiwan 2 Department of Educational Psychology, Tainan, 701 Taiwan
3 Ministry of Education, National Taiwan Normal University, Taipei, 106 Taiwan
Bird
Bird+ Nest
Nest
Result and Discussion
R² = 0.8022 0 0.1 0.2 0.3 0.4 0.5 0.6 0 0.2 0.4 0.6 0.8 1 For w ar d Str e n gth LSA LSA-AM of English LSA Mean R² = 0.8325 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0 0.2 0.4 0.6 0.8 1 For w ar d s tre n gth LSA LSA-AM of Chinese LSA Mean
LACD
Lab
Language Acquisition and Cognitive Development LaboratoryThe result shows the score of LSA is proportional to the mean value of forward association memory in each region of each decile. Because LSA characterize the coherence of two words and the coherence of two words may strengthen the association memory based on scenarios assumptions, LSA is proportional to the forward strength of association memory.
Moreover, the general concept would easily be associated from large number of clues. In contrast, the sub-contrast would hardly to be associated, and would be associated from less number of clues. Thus, the divergence degree, which is the number clues associating with the target concept, is proportional to the forward association. On the other hand, the general clues are hardly backwardly associated with its clue concept. The result shows that the divergence degree is inversely proportion to backward association, because the general concept is not specific to any sub concept. The more general the target concept is, the less probability of clue is backwardly associated.
0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0 2 4 6 8 10 Forward strength Divergence degree Divergence degree-Forward AM Chinese English 0 0.03 0.06 0.09 0.12 0.15 0.18 0.21 0.24 0.27 0 5 10 15 20
Back ward strength
Divergence degree
Divergence degree-Backward AM
Chinese English
Conclusion
In the present work, the relation between LSA and AM is revealed. Because LSA characterize the coherence of two words and the coherence of two word strengthen the association. Further, clues tend to associate with the general concept, and are with less probability to be associated from general concept.
To evaluate the relation between LSA (latent semantic analysis) and association memory, the value of LSA and strength of AM (association memory) were collected. AM is the probability the target word sis associated by hint the clue word. Furthermore, both value of AM and LSA were compared. Scenarios assumption of association memory provide the possible way how both AM and LSA are connected. To approve the scenarios assumption of association memory, the link between clue word and associated word constructs the bipartite net. The DD (divergence degree) of clue words and the average of association is these were further analyzed. The divergence degree of a concept is the number of clues associating with certain concept.
Chinese AM Chinese LSA engine English AM Word pairs Word pairs Chinese LSA Chinese LSA-AM Chinese DD Chinese LSA-DD English DD English LSA engine Association bipartite net English LSA-DD English LSA-AM English LSA