Skip to content

Commit a35b013

Browse files
committed
[WIP] Prepare to stream tokens for GPU - remove redundant comment about fallback to regular list access in Llama.java
1 parent 5f740c9 commit a35b013

File tree

1 file changed

+0
-1
lines changed
  • src/main/java/com/example/inference/engine/impl

1 file changed

+0
-1
lines changed

src/main/java/com/example/inference/engine/impl/Llama.java

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -218,7 +218,6 @@ public static List<Integer> generateTokensGPU(Llama model, State state, int star
218218
// Try to extract the underlying array for faster access
219219
try {
220220
// This is a performance optimization that may not work on all JVMs
221-
// Fall back to regular list access if it fails
222221
promptTokenArray = promptTokens.stream().mapToInt(Integer::intValue).toArray();
223222
} catch (Exception e) {
224223
// Fall back to list access

0 commit comments

Comments
 (0)