-
Notifications
You must be signed in to change notification settings - Fork 1
CNPq/Lattes Navigator - Browser-Use Tool for COI Detection #10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
rubenszinho
wants to merge
21
commits into
Labic-ICMC-USP:main
Choose a base branch
from
rubenszinho:main
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…ght) and add missing X11 dependencies for Chromium
…d langchain-openai from requirements simplified LLM instantiation
Change to structured steps, clear STEP 1-4 format and portuguese labels. Add actual section names from Lattes JSON code block. Example wrapped in json for better parsing. Warnings now include response preview. Dual JSON extraction. Checks for json blocks first, then raw JSON. Increased steps, 25 for complex pages.
Direct URL navigation - Using visualizacv.do?id= endpoint instead of profile URL Explicit DO NOT use search engine - Prevents DuckDuckGo fallback Captcha fallback - Returns structured error if blocked Simpler JSON template
…apture all relevant content from agent responses
…ing results and known limitations related to captcha protection and JSON response handling
…avigate directly to lattes URL, include detailed navigation steps and error response handling
…on instructions, explicit wait times, and robust error handling
…on flow, enhancing instructions and error handling for CAPTCHA scenarios
…avigation steps, enhanced wait times, and improved error response handling
…structions, enhancing error handling for no results, and optimizing browser settings for improved performance
…uplication of publications, and improved JSON response structure for activities and evidence details
bf6055b to
bd970bb
Compare
…arnings, refining JSON response structure, and optimizing navigation instructions for better performance
4f1e1fa to
f413c8b
Compare
…ach to profile collection and conflict of interest analysis, improving navigation instructions
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
This tool navigates public CNPq/Lattes researcher profiles to detect Conflicts of Interest (COI) and summarize academic production over a configurable time window (default: 5 years).
Implementation
Dual deployment architecture:
tool/) - Can be imported directly into Open WebUI's Tools interface for natural language interactionapi/) - Standalone REST API deployed on Railway for direct HTTP access and testingLive API endpoint: https://lattes-navigator-api-production.up.railway.app/health
Features
browser-uselibrary with PlaywrightTesting
A demo module (
demo/) was implemented for local testing and validation of agent navigation procedures. During testing, we observed that the Lattes platform implements CAPTCHA challenges to block automated access. A fallback mechanism was added to detect and log these cases gracefully.API reference and test results are documented in:
README.md- Usage, endpoints, COI rulesTESTING.md- Test procedures and observed resultsDeployment
The API is publicly available for testing on Railway:
Note: Open WebUI instance (https://open-webui-production-de8c.up.railway.app/) is temporarily deactivated due to memory constraints (~2GB RAM). Validation is being conducted directly through the FastAPI endpoint.
Known Limitations
Request
curl -s -X POST https://lattes-navigator-api-production.up.railway.app/analyze -H "Content-Type: application/json" -d '{"researchers": [{"name": "Ricardo Marcacini", "lattes_id": "4003190744770195"}], "time_window": 5}' | python3 -m json.toolPrompt:
Tool logs: