Large Language Model Evaluation


[Up] [Top]

Documentation for package ‘vitals’ version 0.1.0

Help Pages

are An R Eval
detect_answer Scoring with string detection
detect_exact Scoring with string detection
detect_includes Scoring with string detection
detect_match Scoring with string detection
detect_pattern Scoring with string detection
generate Convert a chat to a solver function
model_graded_fact Model-based scoring
model_graded_qa Model-based scoring
scorer_detect Scoring with string detection
scorer_model Model-based scoring
Task Creating and evaluating tasks
vitals_bind Concatenate task samples for analysis
vitals_bundle Prepare logs for deployment
vitals_log_dir The log directory
vitals_log_dir_set The log directory
vitals_view Interactively view local evaluation logs