Collocations = typical word combinations

We are able to generate a collocation database with words categorized by the type of grammatical relations, e.g. objects, subjects, modifiers etc. for any word in the languages. Such a database is important for language learning products and also for digital typing assistants designed to suggest to the users a possible replacement of the word with a more natural and idiomatically correct one.

Collocations are identified by the Word Sketch technology in Sketch Engine and supplied as a database download in a number of formats.

Our natural language processing API supports the retrieval of collocations in real time.

The first few collocations of the Spanish word problema together with frequency and strength of collocation value.

object of
----------------------
resolver 179825 11.38
solucionar 116224 10.99
tener 401288 9.09
enfrentar 29200 8.82
haber 143262 8.75
causar 22885 8.41
plantear 21728 8.33
evitar 26164 8.3

Collocation database in these languages

Collocations for more languages can be made available or developed on request.

Supported languages

Afrikaans
Albanian
Amharic
Arabic
Azerbaijani
Basque
Belarusian
Bengali
Bosnian
Bulgarian
Catalan
Chinese Simplified
Chinese Traditional
Croatian
Czech
Danish
Dutch
English
Estonian
Filipino
Finnish
French
Frisian
Georgian
German
Greek
Gujarati
Hausa (Boko)
Hebrew
Hindi
Hungarian
Icelandic
Igbo
Indonesian
Irish
Italian
Japanese
Kannada
Kazakh
Korean
Kyrgyz
Latin
Latvian
Lithuanian
Macedonian
Malayalam
Malay
Maltese
Maori
Mongolian
Nepali
N'Ko
Norwegian Bokmål
Norwegian
Norwegian Nynorsk
Oromo
Persian
Polish
Portuguese
Punjabi (Shahmukhi)
Romanian
Russian
Samoan
Scottish Gaelic
Serbian (Latin)
Serbian
Setswana
Slovak
Slovenian
Somali
Spanish
Swahili
Swedish
Tajik
Tamil
Tatar
Telugu
Thai
Tibetan
Tigrinya
Turkish
Turkmen
Ukrainian
Urdu
Uzbek
Vietnamese
Welsh
Yoruba