Skip to content
View goktugozkanmd's full-sized avatar
  • Remote

Block or report goktugozkanmd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. medical-ai-failure-atlas medical-ai-failure-atlas Public

    Clinician led synthetic medical AI safety evaluation resources: Failure Atlas, SourceCheckup, Turkish medical language risk, and outside objection routes.

    Python 1

  2. tr-ai-card-radar tr-ai-card-radar Public

    Audit Hugging Face model/dataset cards for Turkish AI resources and write small, reproducible metadata reports. Clinician-led, open-source. No model ranking, no legal/clinical claims.

    Python

  3. lighteval lighteval Public

    Forked from huggingface/lighteval

    Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

    Python

  4. lm-evaluation-harness lm-evaluation-harness Public

    Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Python

  5. inspect_ai inspect_ai Public

    Forked from UKGovernmentBEIS/inspect_ai

    Inspect: A framework for large language model evaluations

    Python

  6. trust-safety-evals trust-safety-evals Public

    Forked from The-AI-Alliance/trust-safety-evals

    The AI Alliance project to define a reference stack for AI model and system evaluation, with evaluations, benchmarks, and leaderboards.

    Makefile