A doc searcher of the documents on the local host that is based on: Tika, ElasticSearch and Kibana
- Install Elasticsearch with Dockeredit
Testing:curl -X GET "localhost:9200/_cat/nodes?v&pretty"
Java REST Client - Installing Elasticsearch
See the gradle taskinstallunder elasticsearch-server
UI dashboard for Elastic Search
- Install Kibana with Dockeredit
- Insalling Kibana
See the gradle taskinstallunder kibana-server
See TikaOCR
See Recursive Metadata and Conten
This is a Spring Boot application the main tasks of that are:
- scanning every 1 minute the files in the special folder
- extracting a content of the files via Tika+OCR server API
- storing the metadata and the content of the files in the Elasticsearch
./gradlew clean build./gradlew clean build; docker-compose up --force-recreate --build
- Prune unused Docker objects:
docker system prune -f - To clear containers:
docker rm -f $(docker ps -a -q) - To clear images:
docker rmi -f $(docker images -a -q) - To clear volumes:
docker volume rm $(docker volume ls -q) - To clear networks:
docker network rm $(docker network ls | tail -n+2 | awk '{if($2 !~ /bridge|none|host/){ print $1 }}')