[SC-16225] Add Gemini support by juanmleng · Pull Request #513 · validmind/validmind-library

juanmleng · 2026-05-19T12:36:00Z

Pull Request Description

What and why?

Added Gemini support across the library’s three LLM evaluation paths: prompt evaluation tests, RAGAS tests, and DeepEval scorers.

Before this change, the shared judge config only supported OpenAI/Azure and Gemini was not wired consistently across all evaluators. After this change, all three evaluation paths can use Gemini, RAGAS handles Gemini finish reasons more reliably, and the judge-configuration notebook now documents the current environment variables and defaults used by the library.

How to test

Comment OPENAI_API_KEY key in your .env
Set GEMINI_API_KEY, and optionally GEMINI_MODEL and GEMINI_EMBEDDINGS_MODEL
Run notebooks/how_to/run_tests/configure_tests/configure_judge_llms.ipynb

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Added Gemini support for ValidMind prompt evaluation tests, RAGAS tests, and DeepEval scorers, and updated the judge-configuration notebook to reflect the current environment variables and defaults.

Checklist

Co-authored-by: Cursor <cursoragent@cursor.com>

johnwalz97

very nice! lgtm

AnilSorathiya

Nice!

Co-authored-by: Cursor <cursoragent@cursor.com>

github-actions · 2026-05-22T17:18:36Z

PR Summary

This PR makes extensive modifications to support an additional LLM provider (Gemini) in the ValidMind Library. Significant changes include:

• Updates to the configuration routines in the AI utilities (in validmind/ai/utils.py) so that the library now selects among OpenAI, Azure, and Gemini providers based on environment variables. The functions now consistently leverage a new helper (_get_configured_provider) and define default model names/constants (e.g. GEMINI_MODEL and GEMINI_EMBEDDINGS_MODEL).

• Refactoring of the get_client_and_model and get_judge_config functions. The judge configuration now correctly calls provider-specific methods, using a Gemini-specific implementation (_build_gemini_judge_config) when appropriate and falling back to the OpenAI method otherwise.

• Introduction of a new function, get_deepeval_model, which returns a native model object (or wraps it) appropriately for DeepEval scorers. This ensures that the Gemini provider is correctly supported, and the scoring modules in the deepeval subpackage have been updated to call get_deepeval_model in lieu of get_client_and_model.

• Updates across multiple scorer implementations (in the deepeval folder) and prompt validation tests to integrate the new provider configuration, ensuring that all evaluation paths consistently use the updated configuration logic.

• Addition of new unit tests in the tests/unit_tests directory to verify proper handling of Gemini credentials and parsing of finish signals in RAGAS tests. The test files have been augmented to mock environment variables for Gemini, confirm that the correct models are instantiated, and validate that the new finish parser behaves as expected.

In summary, this PR unifies LLM evaluation configuration across multiple usage points and extends the support to Gemini-based APIs, ensuring that users with different credentials experience consistent behavior across prompt-validation, RAGAS, and DeepEval workflows.

Test Suggestions

Write integration tests that set the GEMINI_API_KEY (or equivalent) environment variable and verify that get_client_and_model returns the expected model configuration.
Test the _ragas_is_finished_parser function with different finish reasons including 'stop', 'max_tokens', and other edge cases to ensure it returns True as expected.
Add tests for get_deepeval_model to ensure that when Gemini credentials are set, the function imports the appropriate module and returns an instance of the Gemini model.
Simulate missing dependency scenarios (e.g., when langchain_google_genai is not installed) to ensure proper ImportError messages are raised.

cachafla

Looks great 👍

juanmleng and others added 2 commits May 19, 2026 14:15

add Gemini support for LLM evaluations

8653a03

Co-authored-by: Cursor <cursoragent@cursor.com>

refactor judge config helpers

eac7386

Co-authored-by: Cursor <cursoragent@cursor.com>

juanmleng self-assigned this May 19, 2026

juanmleng added the enhancement New feature or request label May 19, 2026

juanmleng requested review from AnilSorathiya, cachafla, johnwalz97 and nibalizer May 19, 2026 13:53

johnwalz97 approved these changes May 19, 2026

View reviewed changes

AnilSorathiya approved these changes May 20, 2026

View reviewed changes

nrichers added the support Support-related PR label May 20, 2026

juanmleng and others added 2 commits May 22, 2026 19:13

Merge branch 'main' into juan/sc-16225/add-llm-config-support

ef666ab

2.13.3

07a8dac

Co-authored-by: Cursor <cursoragent@cursor.com>

cachafla approved these changes May 22, 2026

View reviewed changes

juanmleng merged commit daca818 into main May 22, 2026
21 checks passed

juanmleng deleted the juan/sc-16225/add-llm-config-support branch May 22, 2026 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SC-16225] Add Gemini support#513

[SC-16225] Add Gemini support#513
juanmleng merged 4 commits into
mainfrom
juan/sc-16225/add-llm-config-support

juanmleng commented May 19, 2026

Uh oh!

johnwalz97 left a comment

Uh oh!

AnilSorathiya left a comment

Uh oh!

github-actions Bot commented May 22, 2026

Uh oh!

cachafla left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

juanmleng commented May 19, 2026

Pull Request Description

What and why?

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

Uh oh!

johnwalz97 left a comment

Choose a reason for hiding this comment

Uh oh!

AnilSorathiya left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 22, 2026

PR Summary

Test Suggestions

Uh oh!

cachafla left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants