SIL International Home

The Linguist's Shoebox

Integrated data management and analysis for the field linguist


You can use sorting and filtering to find data items that occur multiple times.

During exploratory data analysis, it can be useful to compare records that have something in common. In Shoebox, you can use sorting and filtering to find data items that occur multiple times. The Non-Unique filter element matches sets of database records that contain identical sorting fields. For example, here are some interesting kinds of non-unique items in lexical databases:

Sorting field Non-Unique items
\lx Homonyms, homophones, or homographs. If you have entered homonym numbers, you can use \hm as a secondary sorting field to check that all the lexical records are uniquely identified. For more information, read section 6.3 in Making Dictionaries.
\ge, \gn, or \gr
\re, \rn, or \rr
Vernacular words or phrases that have a common translation approximation in one of the languages of description. If the same gloss occurs many times, you might want to make more specific distinctions. For example, although cut is a good generic gloss in the \ge field for interlinearizing, you might use cut into sections, cut off, cut s.t. short, etc. as more specific glosses in the \re field for the index (finderlist) that can be exported using the Multi-Dictionary Formatter (MDF). For more information, read section 2.3 in Making Dictionaries.
\sc Vernacular terms for life forms that have the same scientific name (i.e., the same classification in the scientific taxonomy). For more information, read section 8.1 in Making Dictionaries.

Note: If you want to be able to find multiple occurrences of data items using sorting and filtering, you will need to enter each item in its own field (instead of listing multiple data items in a single field).

Index of tips: data analysis; exploratory data analysis; fields, multiple; filtering; multiple data fields; sorting
List of tips