B4513 by idales · Pull Request #92 · kherud/java-llama.cpp

idales · 2025-01-30T07:52:55Z

support for the b4513 version of the llama.cpp

…MakeList.txt

idales · 2025-01-31T09:31:24Z

I've fixed a build issue from commit 4dc9dd1 by moving the external CMake option LLAMA_BUILD_COMMON=ON directly into CMakeLists.txt. However, I'm unsure if the pipeline has run with this update yet.

kherud · 2025-01-31T10:23:18Z

Hi @idales thank you very much for this PR! I'll look at it over the weekend and merge it. Until then I'll try to approve running the pipeline.

kherud · 2025-02-03T08:36:38Z

Hi @idales building the C++ code locally, I get the same error as the CI runners GGML_ASSERT(n_threads > 0) failed. Unfortunately I didn't have the time too look more deeply into it over the weekend. Does it work for you (and if yes, which platform)? I'll investigate it more closely over the week.

vaiju1981 · 2025-02-03T17:26:03Z

I was able to pass it by passing modelParameters.setNthreads(8) so instead of default value. But i have another error that I am getting and looking into it.

Error: libc++abi: terminating due to uncaught exception of type nlohmann::json_abi_v3_11_3::detail::type_error: [json.exception.type_error.302] type must be number, but is array

kherud · 2025-02-03T17:39:55Z

I was able to pass it by passing modelParameters.setNthreads(8) so instead of default value. But i have another error that I am getting and looking into it.

Awesome, nice hint, thanks!

Error: libc++abi: terminating due to uncaught exception of type nlohmann::json_abi_v3_11_3::detail::type_error: [json.exception.type_error.302] type must be number, but is array

The Java library has the two classes ModelParameters and InferenceParameters to configure llama.cpp. Both classes create a JSON string under the hood, which is then passed to llama.cpp (since its easier to re-use llama.cpp code that way). This string is parsed via nlohmann::json. My suspicion is that in earlier versions there was a parameter that used to be an array and still is in the two Java classes, but the newer llama.cpp version now expects a single number instead.

vaiju1981 · 2025-02-03T18:36:34Z

Looks like it's tensor_split ( abetlen/llama-cpp-python#1016 ) but i don't see it set anywhere and i even removed ( commented ) [] brackets in the code.

kherud · 2025-02-03T22:03:03Z

src/main/cpp/jllama.cpp

 {
-    gpt_params params;
+    common_params params;

-    auto *ctx_server = new server_context();
-
-    std::string c_params = parse_jstring(env, jparams);
-    json json_params = json::parse(c_params);
-    server_params_parse(json_params, params);
-
-    if (json_value(json_params, "disable_log", false))
+    const jsize argc = env->GetArrayLength(jparams);
+    char **argv = parse_string_array(env, jparams, argc);
+    if (argv == nullptr)
    {
-        log_disable();
-    }
-    else
-    {
-        log_enable();
+        return;
    }

-    if (!params.system_prompt.empty())
+    const auto parsed_params = common_params_parse(argc, argv, params, LLAMA_EXAMPLE_SERVER);
+    free_string_array(argv, argc);
+    if (!parsed_params)
    {
-        ctx_server->system_prompt_set(params.system_prompt);
+        return;
    }


@vaiju1981 I changed how the model parameters are passed from Java to C++. The code now assembles a CLI string (e.g. something like --model /path/to/file -c 2048). This way we can better re-use the existing llama.cpp code to parse the parameters and don't have to rely on custom logic. So far it seems to work great, but I'm now stuck after requesting a completion and getting no output. I'll investigate further over the next few days.

idales · 2025-02-04T11:45:20Z

Hi @idales building the C++ code locally, I get the same error as the CI runners GGML_ASSERT(n_threads > 0) failed. Unfortunately I didn't have the time too look more deeply into it over the weekend. Does it work for you (and if yes, which platform)? I'll investigate it more closely over the week.

Hi @kherud
I am creating an Android project on Ubuntu. In the project I load a model and run the completion. Indeed, the latest versions of llama.cpp require mandatory assignment of the number of threads. I do this in my code.

vaiju1981 · 2025-02-04T18:00:33Z

I am able to pass all the test except for embedding. I will keep you posted if I can fix it.

Right now the only issue is with embedding call: libc++abi: terminating due to uncaught exception of type nlohmann::json_abi_v3_11_3::detail::type_error: [json.exception.type_error.302] type must be number, but is array

vaiju1981 · 2025-02-07T18:10:04Z

How important are testLogJSON and testLogText tests, apart from these and embedding i am able to run all the other tests.

kherud · 2025-02-07T19:03:26Z

I'd say if logging in general works don't worry about the tests. Logging is tricky to test, so the tests were never really reliable anyway. Feel free to shoot a pull request 👍

vaiju1981 · 2025-02-12T06:05:08Z

apart from logging, rest is working fine, I also updated to the latest release tag of llama.cpp.

I created new PR: #93

It kept on giving me permission error when updating this branch.

kherud · 2025-03-09T16:33:34Z

Thank you again for this PR. We finally finished #93 and just released a new major version to upgrade to the newest available llama.cpp version. Thus I'll close this PR.

idales and others added 3 commits January 27, 2025 20:06

begin upgrading to llama.cpp b4513

e4e2ed9

compiled version

4dc9dd1

fix: the external cmake option "LLAMA_BUILD_COMMON=ON" moved inside C…

bc9c85a

…MakeList.txt

idales closed this Jan 30, 2025

idales reopened this Jan 30, 2025

kherud added 6 commits February 3, 2025 22:57

add cli params abstract class

f446fb5

update config param enums

027f2b6

refactor model parameters to build CLI string instead of JSON

658d9b5

update examples to use new model parameters

ec592d5

gitignore cmake build files

ffdbf4e

implement jni cli param parsing

83cd1cf

kherud reviewed Feb 3, 2025

View reviewed changes

vaiju1981 mentioned this pull request Feb 13, 2025

updating code to match to match llamacpp tag b4689 #93

Merged

kherud closed this Mar 9, 2025

Comments

Conversation

idales commented Jan 30, 2025

Uh oh!

idales commented Jan 31, 2025

Uh oh!

kherud commented Jan 31, 2025

Uh oh!

kherud commented Feb 3, 2025

Uh oh!

vaiju1981 commented Feb 3, 2025

Uh oh!

kherud commented Feb 3, 2025

Uh oh!

vaiju1981 commented Feb 3, 2025

Uh oh!

kherud Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

idales commented Feb 4, 2025

Uh oh!

vaiju1981 commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vaiju1981 commented Feb 7, 2025

Uh oh!

kherud commented Feb 7, 2025

Uh oh!

vaiju1981 commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kherud commented Mar 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vaiju1981 commented Feb 4, 2025 •

edited

Loading

vaiju1981 commented Feb 12, 2025 •

edited

Loading