You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|`--approach`| Inference approach to use |`"auto"`|
260
-
|`--simulations`| Number of MCTS simulations | 2 |
261
-
|`--exploration`| Exploration weight for MCTS | 0.2 |
262
-
|`--depth`| Simulation depth for MCTS | 1 |
263
-
|`--best-of-n`| Number of samples for best_of_n approach | 3 |
264
-
|`--model`| OpenAI model to use |`"gpt-4o-mini"`|
265
-
|`--base-url`| Base URL for OpenAI compatible endpoint |`""`|
266
-
|`--rstar-max-depth`| Maximum depth for rStar algorithm | 3 |
267
-
|`--rstar-num-rollouts`| Number of rollouts for rStar algorithm | 5 |
268
-
|`--rstar-c`| Exploration constant for rStar algorithm | 1.4 |
269
-
|`--n`| Number of final responses to be returned | 1 |
270
-
|`--return-full-response`| Return the full response including the CoT with <thinking> tags |`False`|
271
-
|`--port`| Specify the port to run the proxy | 8000 |
272
-
|`--optillm-api-key`| Optional API key for client authentication to optillm |`""`|
273
-
|`--cepo_bestofn_n`| Number of responses to be generated in best of n stage | 3 |
274
-
|`--cepo_bestofn_temperature`| Temperature for verifier in best of n stage | 0.1 |
275
-
|`--cepo_bestofn_max_tokens`| Maximum number of tokens for verifier in best of n stage | 4096 |
276
-
|`--cepo_bestofn_rating_type`| Type of rating in best of n stage ("absolute" or "pairwise") |`"absolute"`|
277
-
|`--cepo_planning_n`| Number of plans generated in planning stage | 3 |
278
-
|`--cepo_planning_m`| Number of attempts to generate n plans in planning stage | 6 |
279
-
|`--cepo_planning_temperature_step1`| Temperature for generator in step 1 of planning stage | 0.55 |
280
-
|`--cepo_planning_temperature_step2`| Temperature for generator in step 2 of planning stage | 0.25 |
281
-
|`--cepo_planning_temperature_step3`| Temperature for generator in step 3 of planning stage | 0.1 |
282
-
|`--cepo_planning_temperature_step4`| Temperature for generator in step 4 of planning stage | 0 |
283
-
|`--cepo_planning_max_tokens_step1`| Maximum number of tokens in step 1 of planning stage | 4096 |
284
-
|`--cepo_planning_max_tokens_step2`| Maximum number of tokens in step 2 of planning stage | 4096 |
285
-
|`--cepo_planning_max_tokens_step3`| Maximum number of tokens in step 3 of planning stage | 4096 |
286
-
|`--cepo_planning_max_tokens_step4`| Maximum number of tokens in step 4 of planning stage | 4096 |
287
-
|`--cepo_print_output`| Whether to print the output of each stage |`False`|
288
-
|`--cepo_config_file`| Path to CePO configuration file | None |
289
-
290
-
When using Docker, these can be set as environment variables prefixed with `OPTILLM_`.
257
+
## Available parameters
258
+
259
+
optillm supports various command-line arguments for configuration. When using Docker, these can also be set as environment variables prefixed with `OPTILLM_`.
0 commit comments