-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
I use the following command to get Anserini index
python -m pyserini.index.lucene \
-collection JsonVectorCollection \
-input "experiments/spladev2/out/docs_anserini" \
-index "experiments/spladev2/out/anserini_index/" \
-generator DefaultLuceneDocumentGenerator \
-threads 16 -impact -pretokenized \
-optimize
and
./ciff/target/appassembler/bin/ExportAnseriniLuceneIndex -index "experiments/spladev2/out/anserini_index/" -output experiments/spladev2/out/anserini_index.ciff
to export the index from Anserini to Ciff
But I get the error:
Exception in thread "main" java.lang.IllegalArgumentException: indexCreatedVersionMajor is in the future: 9
at org.apache.lucene.index.SegmentInfos.<init>(SegmentInfos.java:169)
at org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:327)
at org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:291)
at org.apache.lucene.index.StandardDirectoryReader$1.doBody(StandardDirectoryReader.java:64)
at org.apache.lucene.index.StandardDirectoryReader$1.doBody(StandardDirectoryReader.java:61)
at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:720)
at org.apache.lucene.index.StandardDirectoryReader.open(StandardDirectoryReader.java:84)
at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:64)
at io.osirrc.ciff.lucene.ExportAnseriniLuceneIndex.main(ExportAnseriniLuceneIndex.java:136)
Suppressed: org.apache.lucene.index.CorruptIndexException: checksum passed (b2b4eb97). possibly transient resource issue, or a Lucene or JVM bug (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/home/jupyter/neural-ranking/splade/experiments/cocondenser_kldiv_distil_01-05_190239/out/anserini_index/segments_2")))
at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:466)
at org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:434)
... 7 more
Can you help me to solve the error? If you need more information don't hesitate to let me know. Thanks so much!
Metadata
Metadata
Assignees
Labels
No labels