# Create N-gram Tokens

## How to Access This Feature

### From + (plus) Button

You can access it from 'Add' (Plus) button. "Text Mining..." -> "Create N-Gram Tokens". ![](https://2850417076-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M4HLCK3olgduYoe3RVS%2F-M4oMvCUDQwHTJ0eWi_f%2F-M4oNEkROxW7sWNLSicM%2Fdo_ngram_add.png?generation=1586795488131351\&alt=media)

## How to Use?

![](https://2850417076-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M4HLCK3olgduYoe3RVS%2F-M4oMvCUDQwHTJ0eWi_f%2F-M4oNEkTnooX2zORi-45%2Fdo_ngram_param.png?generation=1586795488201423\&alt=media)

* Select a column that has tokenized text - Set a column that has tokens. This is "token" column if it's tokenized by [do\_tokenize](https://docs.exploratory.io/main/do_tokenize) function.
* Select a column as a document - A column considered as document id. If you run [do\_tokenize](https://docs.exploratory.io/main/do_tokenize) beforehand, this can be document\_id.
* Select a column as sentence id - A column considered as sentence id in a document. If you run do\_tokenize beforehand, this can be sentence\_id.
* Max # of Tokens to be Concatenated (Optional) - The default is 2. Maximum number of tokens to be connected.
* Text to Concatenate Words (Optional) - The default is "\_". Character to be used to connect ngrams.

  )
