Phase wise evaluation
🧩 Phase 1: Initialization
This phase:
Loads env variables
Connects to Weaviate
Loads evaluation JSON
Creates evaluator instance
🧩 Phase 2: Evaluate All Methods
This is the core loop:
For each category and query
Run all query methods
Calculate 3 metrics (Precision, Recall, Relevancy)
Log & store results
🧩 Phase 3: Result Analysis + Export
This phase:
Aggregates scores across all queries
Calculates final average P, R, RR, Score
Finds the best method
Writes to CSV and JSON