Grew-match: Online Graph Matching

Grew-match is a one page online web application for searching graph patterns in treebanks. In the current version, the treebanks available are:

If you want to use it on some other corpora, you can run your own Grew-match following the instructions on Local installation of Grew-match.

Basic usage

  1. Select the corpus on which you want to search (Click on Show corpora list if needed)
  2. Enter the search pattern in the text area (you may use some snippets on the right of the text area)
  3. Click on Search

The number of items is displayed and the first 10 items can be explored. If you want to see the next 10 items, click on Get more results.

To limit server usage, only the first 1000 items are computed. If the searched pattern is found more then 1000 times, the amount of corpus used to find the first 1000 items is reported. For instance, if you search for a nsubj relation in the UD_English corpus, the output message is More than 1000 results found in 6.01% of the corpus. This means that the first 1000 items were found in 6.01% of the 16,622 sentences of the UD_English corpus.

Learning syntax

A tutorial with a progressive sequence of patterns is available. You may also explore snippets given on the right of the text area to learn with other examples.

About CoNLL field names

The fields 2, 3, 4 and 5 of CoNLL structure are considered as features with the following feature names.

CoNLL field 2 3 4 5
Name form lemma upos xpos

For instance, if you want to search:

NB In former version of the Grew code, columns were associated to feature names according to the table below.

CoNLL field 2 3 4 5
Name phon lemma cat pos

In order to keep backward compatibility, the features names phon, cat and pos are automatically replaced by form, upos and xpos respectively. As a consequence, it is not recommended to use the 3 names phon, cat and pos in new GRS.

Display options

Below the textarea, a few checkboxes are available to choose the information to be displayed

Access to CoNNL-U specificities

Additional information available in the CoNNL-U format can be shown or accessed through special features:

These special features can be used in patterns. For instance, searching for an empty node which is a verb:

pattern { N[upos=VERB, _UD_empty=Yes] }

Enhanced dependencies are displayed in blue below the sentence. In pattern, a enhanced dependency can be searched with the prefix E:. Searching for a enhanced obl relation in UD_English without a non-enhanced counterpart (see output in UD_English):

pattern { N -[E:obj]-> M }
without { N -[obj]-> M }

Contact

For any remark or request, you can either contact us or open an issue on the GitLab project (you will have to register).