CONCATENATED WORD CHALLENGE

The challenge

We have provided a file called "words.txt" which contains a sorted list of approximately 173,000 words. The words are listed one word per line, do not contain spaces, and are all lowercase.

Your task is to write a program that reads the file and provides the following:

the longest concatenated word (that is, the longest word that is comprised entirely of shorter words in the file)
the 2nd longest concatenated word
the total count of all the concatenated words in the file

For example, if the file contained: - cat - cats - catsdogcats - dog - dogcatsdog - hippopotamuses - rat - ratcatdogcat

the longest concatenated word would be 'ratcatdogcat' with 12 characters. 'hippopotamuses' is a longer word, however it is not comprised entirely of shorter words in the list. The 2nd longest concatenated word is 'catsdogcats' with 11 characters. The total number of concatenated words is 3. Note that 'cats' is not a concatenated word because there is no word 's' in the list.

Solution

How does it work?

Words are added to an ad-hoc tree, each tree node will be one character and will have a flag indicating if it's an end of word node.

For example, this list of words: 'ah', 'ahead', 'ead', 'ad' Will provide the following tree:

|
|_ 'a'
|    |_ 'h' <- endOfWord
|    |    |_ 'e'
|    |        |_ 'a'
|    |            |_ 'd' <- endOfWord
|    |
|    |_ 'd' <- endOfWord    
|
|_ 'e'
    |_ 'a' 
        |_ 'd' <- endOfWord

You can use tree.print() for visibility (it will affect execution time)

The algorithm will take a word and traverse the tree. First it will find the original word and all the endOfWords within it and then it will recursively check if the different parts of the word are concatenated.

Let's take 'ahead', we would have 'ah' as the only endOfWord within our word, now we remove 'ah' from 'ahead' and apply the same algorithm to it finding 'ead' and returning true.

Running solutions

JS

Latest node.js and npm required
Clone repo and run npm run start_js.
File called words.txt is required in /src
Average Execution Time for 173k words: 480ms

PY

Latest python, node.js and npm required
Clone repo and run npm run start_py.
File called words.txt is required in /py_src
Average Execution Time for 173k words: 3.500ms

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
py_src		py_src
src		src
.gitignore		.gitignore
README.md		README.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CONCATENATED WORD CHALLENGE

The challenge

Solution

How does it work?

Running solutions

JS

PY

About

Uh oh!

Releases

Packages

Languages

DavidOVM/concatWord

Folders and files

Latest commit

History

Repository files navigation

CONCATENATED WORD CHALLENGE

The challenge

Solution

How does it work?

Running solutions

JS

PY

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages