---
# System prepended metadata

title: 'Matching the Blanks: Distributional Similarity for Relation Learning'

---

###### tags: Paper Reading
# Matching the Blanks: Distributional Similarity for Relation Learning
## Outline
&#160; &#160; This paper is talking about building and training a general purpose relation extractors with Bert.
## Reference
&#160; &#160; ACL 2019
## Introduction/Motivation
&#160; &#160; Building a relation extractor between any two entities is a long standing goal.  Previous work try to use **surface forms**,**joint representation surface forms**, and learing **word embedding** to represent entities' relations. All works above have common result that the extractor is not general enough for different task. So, this paper take use of Bert and concentrate to learning mappings from relation statements to relation representations(Sectoin 2, task definition)
 
## Model
&#160; &#160;Because Bert has not been used in relation represent problem, it propose some variant of inputs and outputs of Bert encoder.
* &#160; &#160; **input**
    * Standard input 
        &#160; &#160;&#160; &#160;Do not specify position of entity
    * Positional embeddings
        &#160; &#160;&#160; &#160;Use segmentation embedding(one of Bert's input). Put all tokens segement types of entity_1 into 1 (entity_2 into 2)
    * Entity marker tokens
        &#160; &#160;&#160; &#160;Add reserved word into  front and end of entities.(E1_start,E1_end,E2_start,E2_end)
* &#160; &#160; **output**
    * [CLS] token
        &#160; &#160;&#160; &#160;Adopt the **[CLS]** output, as relation representation.
    * Entity mention pooling 
        &#160; &#160;&#160; &#160;**Maxpool** all tokens' final hidden layer output respect two entities. And concat them.
    * Entity start state
        &#160; &#160;&#160; &#160; Use concatenation of **output from E1_start and E2_start** as final representation
![](https://i.imgur.com/IzqJyKq.png)
## Matching the blanks
&#160; &#160;Based on the above assumptions, if **two sentences have same two entities, the representation of two sentences should as similar as possible.** As we know information of entities , so we can minimise loss after checking whether entities between two sentences are the same. 
Therefore, this paper randomly(p=0.7) replace entity into reserved word **[blank]** to build  relation extractor pretrained model.
## Experiment
&#160; &#160; section5
## Conclusion
&#160; &#160; This paper propose a noval traing setup: matching the blanks, give us a new idea to generalize our model. It's also propose new architecture for us to use Bert into relation extraction task. Excellent work.