-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[opt](inverted index) Inverted Index Dictionary Compression #245
Conversation
run buildall |
bd09a41
to
17c8947
Compare
@@ -1383,7 +1388,7 @@ void IndexWriter::indexCompaction(std::vector<lucene::store::Directory *> &src_d | |||
proxOutputList.push_back(proxOut); | |||
// Instantiate a new termInfosWriter which will write in directory | |||
// for the segment name segment using the new merged fieldInfos | |||
TermInfosWriter *termInfosWriter = _CLNEW TermInfosWriter(dest_dir, segment.c_str(), fieldInfos, termIndexInterval); | |||
auto* termInfosWriter = _CLNEW STermInfosWriter<char>(dest_dir, segment.c_str(), fieldInfos, termIndexInterval); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why we change STermInfosWriter here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because we have two types in clucene: STermInfosWriter and TermInfosWriter, I only modified the code logic in STermInfosWriter. My expectation is to maintain only one process within STermInfosWriter.
@@ -1879,7 +1884,8 @@ void IndexWriter::mergeTerms(bool hasProx, IndexVersion indexVersion) { | |||
TermInfo termInfo; | |||
termInfo.set(dfs[i], freqPointer, proxPointer, (int32_t) (skipPointer - freqPointer)); | |||
// Write a new TermInfo | |||
termInfosWriter->add(smallestTerm, &termInfo); | |||
std::string cur_term = lucene_wcstoutf8string(smallestTerm->text(), smallestTerm->textLength()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Find solution to avoid lucene_wcstoutf8string function call here.
3d2ad23
to
a4c9c44
Compare
a4c9c44
to
d8817bf
Compare
No description provided.