Graphs Neural Networks in NLP

Summary

The web content discusses the application and potential of Graph Neural Networks (GNNs) in enhancing NLP tasks by capturing complex linguistic structures, including syntactic, semantic, temporal, and relational information.

Abstract

Graph Neural Networks (GNNs) are emerging as a powerful tool in the field of Natural Language Processing (NLP), offering a means to capture the intricate relationships between words that are essential for understanding language. The article highlights the resurgence of graph-based methods in NLP, emphasizing their ability to model syntactic and semantic parse graphs, knowledge graphs, and temporal dependencies. Despite the shift towards end-to-end deep learning systems, the author argues that GNNs provide a direction of hope for NLP, as they can incorporate structured information that is often overlooked by attention mechanisms. The article also touches on the limitations of current deep learning approaches, such as the inability of LSTMs to handle long-range dependencies effectively, and suggests that GNNs can address these issues by leveraging the rich structures inherent in language. The author provides examples of GNN applications in NLP, such as aspect-based sentiment analysis and fact-based question answering, and points to ongoing research that aims to learn knowledge graphs from plain text in an unsupervised manner.

Opinions

The author believes that GNNs are a promising direction for NLP, countering the trend of flattening state-of-the-art systems due to a lack of novel ideas.
There is a skepticism about the effectiveness of attention mechanisms in capturing syntactic and semantic relationships without additional syntactical constraints.
The author suggests that the use of dependency parse information can improve NLP applications, as demonstrated by the work of He et al. on aspect-based sentiment analysis.
The article expresses enthusiasm for the potential of knowledge graphs in NLP, especially with advancements like GraphSage, which allow generalization to unseen nodes.
The author is optimistic about the future of unsupervised learning of graph structures from text, aligning with the idea that language acquisition may not be learned from scratch but could be influenced by inherent capabilities.
The use of GNNs is seen as a way to leverage the hierarchical nature of language and the world, which could lead to richer embeddings and better understanding of concepts.
Multimodal graphs, which include images, words, and documents, are considered an exciting extension of knowledge graphs, with the potential to process information more comprehensively.

Capturing the semantic, syntactic, temporal and relational structure between words (through GNNs)

Graphs have always formed an essential part of NLP applications ranging from syntax-based Machine Translation, knowledge graph-based question answering, abstract meaning representation for common sense reasoning tasks, and so on. But with the advent of ‘end to end deep learning’ systems, there was a decrease in such traditional parse algorithms. In fact, there have been many debates on flattening of state of the art NLP systems due to lack of completely new ideas!!!!!

“NLP is kind of like a rabbit in the headlights of the Deep Learning machine, waiting to be flattened.”, Neil Lawrence (ICML 2015 workshop).

With this post, I want to mention the recent trends in the field of NLP using Graph Neural Networks (GNNs) and why I think they are one of the directions of hope for the field. For a basic understanding of a GNN- https://www.youtube.com/watch?v=7JELX6DiUxQ

But even if you don't know how GNN’s work this post will give you an idea of where GNN’s could be used in NLP which might give you a motivation to learn about GNN’s.

Syntactic and Semantic Parse Graphs

The image describes the parser output by the Spacy tagger. We can define every node as a word and every edge as the dependency parse tag. Every word can have pos tags as attributes.

Some might argue that powerful attention mechanisms can automatically learn the syntactic and semantic relationships. Though no theoretical work is out there in my knowledge which highlights where is attention ineffective. Here is an example- let’s consider the task of aspect-based sentiment analysis (finding sentiment for each feature- If we are trying to get sentiment for our brand we can get sentiment for different features like- fitting, material, shipping, etc.- These all are different aspects) Now if the sentence given to the network is-

“Its size is ideal and the weight is acceptable.” Attention-based models often identify acceptable as a descriptor of the aspect size, which is in fact not the case. In order to address the issue (He et al.) imposed some syntactical constraints on attention weights.

This gives us a slight hint of why we could use dependency parse as additional information in our NLP applications.

Knowledge Graph

A knowledge graph represents a collection of interlinked descriptions of entities — real-world objects, events, situations, or abstract concepts. Every node is an entity and edges describe relations between them. Most famous KGs in NLP include Dbpedia, WikiData, ConceptNet.

Fact-based question answering is not new for NLP research but it was previously limited to the facts present in the database. But with techniques such as GraphSage(Hamilton et al.), the methods can be generalized to previously unseen nodes. In fact, recent ACL paper (Saxena et al.) generalizes to multi-hop QA for unseen nodes. (Pretty exciting?)

But why graphs? Isn't graph formation from text supervised and our ultimate aim is to make learning unsupervised?

Language acquisition has not yet been proved to be learned from scratch from the human brain. Some scientists argue the presence of inherent capabilities since birth (in the form of semantics- let's keep that for another post).

There has been ongoing work on knowledge acquisition through plain text in an unsupervised fashion (Looks pretty promising?).

The world has hierarchies- We learn words and then build complete sentences.

Multimodal graphs- KGs can be extended to multimodal graphs- which have images, words, and documents to process information

Concepts- As ConceptNet, concepts can be learned in a graphical form in an unsupervised / semi-supervised manner.

These are just my abstract thought on the topic. Feel free to agree or disagree in the comments. For more explanations/comments email on- [email protected]

Graphs Neural Networks in NLP

Capturing the semantic, syntactic, temporal and relational structure between words (through GNNs)

Syntactic and Semantic Parse Graphs

Knowledge Graph

Temporal graphs

But why graphs? Isn't graph formation from text supervised and our ultimate aim is to make learning unsupervised?

References -