I'm getting timeouts using flock to query a model with hard maths problems
INSTALL flock FROM community;
LOAD flock;
CREATE SECRET (TYPE OPENAI, BASE_URL 'https://router.huggingface.co/v1', API_KEY 'hf_xxx');
CREATE MODEL('GLM-5.2', 'zai-org/GLM-5.2:cheapest', 'openai');
CREATE PROMPT('answer', 'Anwser the mathematical question. {question}');
SELECT llm_complete(
{'model_name': 'GLM-5.2'},
{'prompt_name': 'answer', 'context_columns': [{'data': problem}]}
)
FROM 'hf://datasets/PeakStars/Math-Instruct/train-00000-of-00001.parquet'
LIMIT 5;
I'm getting
Invalid Error:
[ModelProvider] Invalid JSON response (HTTP 504)
It seems to be because flock doesn't use streaming and the response takes too long to arrive.
Should the openai implementation in flock use streaming instead to avoid such issues ? cc @anasdorbani
impacted inference providers so far: novita, deepinfra, zai-org
I'm getting timeouts using flock to query a model with hard maths problems
I'm getting
It seems to be because flock doesn't use streaming and the response takes too long to arrive.
Should the openai implementation in flock use streaming instead to avoid such issues ? cc @anasdorbani
impacted inference providers so far: novita, deepinfra, zai-org