One of the most common problems with automatically created content is that the system creates unnecessary duplicate content. That is the system creates content which is basically the same as existing content, just with slightly different translation. This is not helpful for the student.
Examples:
- "driver's licence" and "driving licence" (basically the same thing)
- "walk (2st person singular)", "you walk", "you are walking", "are you walking" (basically all variants of the same thing)
Let's take a look at an example.
The phrase "I don't have a driver's license" has had a new entry "driving license" created and added as a breakdown item. The first thing to notice is that in the sentence the wording is "driver's license" but the entry in the breakdown is "driving licence", which is already not correct, since they should ideally match:
When we look in more detail at the new "driving licence" entry we see the following:
From this we see that there is indeed already an entry called "driver's licence". It says in the warning "This is a duplicate of drivers' licence..." so we should:
- change all the entries which use the new "driving licence" to use the existing "driver's licence"
- delete this new unwanted "driving licence" entry
We can see that there are currently 2 examples which use the unwanted "driving licence" form of the word.
These examples we need to change to use the "driver's licence" version:
Then, when we are sure that "driving licence" no longer has examples which use it, we can delete it. We see below that now "driving licence" has no examples, and so can safely be deleted:
