feat: [Hackathron] Docstring linter by yuriy-encord · Pull Request #1037 · encord-team/encord-client-python

yuriy-encord · 2025-11-30T20:36:03Z

Add Docstring Quality Linter for SDK Documentation

Overview

This PR introduces a comprehensive docstring linter to ensure consistent, high-quality documentation across the Encord SDK. The linter enforces Sphinx role usage, validates formatting, and maintains cross-references to improve auto-generated documentation quality.

What the Linter Does

🔍 Issue Detection

The linter checks for 13 different types of docstring issues, grouped into three severity levels:

Errors (Must Fix - Exit Code 1)

parse_error - Python syntax errors preventing file parsing
malformed_class_reference - Incorrect :class: syntax (e.g., :class:\Name [path]`instead of:class:`~path``)
malformed_method_reference - Incorrect :meth: syntax (e.g., :meth:\name [path]`instead of:meth:`~path``)
incomplete_class_reference - Missing tilde and full path (e.g., :class:\Dataset`instead of:class:`~encord.dataset.Dataset``)

Warnings (Should Fix - Exit Code 1 in check mode)

unlinked_exception - Exception names in Raises: sections without :class: role

# ❌ Bad
Raises:
    AuthenticationError: If API key is invalid

# ✅ Good
Raises:
    :class:`~encord.exceptions.AuthenticationError`: If API key is invalid

invalid_section_header - Markdown-style headers instead of Google-style

# ❌ Bad: **Args**:, **Returns**:, Args::
# ✅ Good: Args:, Returns:

deprecated_section_header - Non-standard headers (e.g., Parameters: instead of Args:)

markdown_list_formatting - Markdown lists in Args/Returns sections instead of Google-style

# ❌ Bad
Args:
    - `param_name` (str): Description

# ✅ Good
Args:
    param_name: Description

inconsistent_indentation - Wrong indentation in docstring sections (should be 4 spaces for definitions, 8 for continuations)
section_spacing - Missing or extra blank lines between sections (should be exactly 1 blank line)

redundant_type_annotation - Type annotations in docstrings when function already has type hints

# ❌ Bad
def get_dataset(self, dataset_hash: str) -> Dataset:
    """
    Args:
        dataset_hash (str): The dataset identifier
    """

# ✅ Good
def get_dataset(self, dataset_hash: str) -> Dataset:
    """
    Args:
        dataset_hash: The dataset identifier
    """

missing_class_docstring - Public classes without docstrings

Info (Nice to Have - Exit Code 0)

unlinked_class - SDK class references without :class: role (improves cross-linking)

# ❌ Could be better
Returns:
    Dataset: The dataset instance

# ✅ Best
Returns:
    :class:`~encord.dataset.Dataset`: The dataset instance

returns_missing_crossref - Class names in Returns section without :class: role

🛠️ Auto-Fix Capabilities

The linter can automatically fix these issues when run with --fix:

✅ Invalid section headers (e.g., **Args**: → Args:)
✅ Deprecated section headers (e.g., Parameters: → Args:)
✅ Spelling inconsistencies
✅ Markdown list formatting → Google-style
✅ Inconsistent indentation
✅ Section spacing issues
✅ Redundant type annotations
✅ Malformed class/method references
✅ Incomplete class references
✅ Missing cross-references in Returns sections

Note: Issues like unlinked_exception and unlinked_class can also be auto-fixed, but require manual review since the linter needs to determine the correct module path.

📊 Output Modes

1. Summary Mode (Default)

Shows high-level statistics:

======================================================================
DOCSTRING LINTING REPORT
======================================================================

Files checked: 130
Total issues: 289
  Errors:   0
  Warnings: 207
  Info:     82

Issues by type:
  missing_class_docstring: 207
  unlinked_class: 82

2. Verbose Mode (`--verbose` or `-v`)

Shows all issues grouped by type with clickable file locations:

unlinked_exception (15):
----------------------------------------------------------------------
encord/client.py:123: ⚠ [get_dataset] Exception "AuthenticationError" should use :class: role
  "AuthenticationError" → ":class:`~encord.exceptions.AuthenticationError`"
encord/project.py:456: ⚠ [create_project] Exception "ResourceNotFoundError" should use :class: role
  "ResourceNotFoundError" → ":class:`~encord.exceptions.ResourceNotFoundError`"

unlinked_class (8):
----------------------------------------------------------------------
encord/user_client.py:182: ℹ [get_dataset] Class "Dataset" could use :class: role
  "Dataset" → ":class:`~encord.dataset.Dataset`"

Benefits of verbose mode:

Issues grouped by type (same as summary)
Clickable file paths for IDE navigation (file.py:line)
Suggested fixes shown for each issue
Function/method context for each issue

3. JSON Report Mode (`--report report.json`)

Generates machine-readable JSON for CI/CD integration:

{
  "files_checked": 130,
  "files_modified": 0,
  "statistics": {
    "total": 289,
    "errors": 0,
    "warnings": 207,
    "info": 82,
    "unlinked_exception": 15,
    "unlinked_class": 82
  },
  "issues": [
    {
      "file": "encord/client.py",
      "line": 123,
      "function": "get_dataset",
      "type": "unlinked_exception",
      "severity": "warning",
      "message": "Exception \"AuthenticationError\" should use :class: role",
      "original": "AuthenticationError",
      "suggested_fix": ":class:`~encord.exceptions.AuthenticationError`"
    }
  ]
}

Usage

Command Line

# Check single file
python tools/docstring_linter.py --check --files encord/user_client.py

# Check with detailed output
python tools/docstring_linter.py --check --files encord/user_client.py --verbose

# Check entire SDK
python tools/docstring_linter.py --check --sdk-path encord/

# Auto-fix issues
python tools/docstring_linter.py --fix --sdk-path encord/

# Generate JSON report
python tools/docstring_linter.py --check --sdk-path encord/ --report issues.json

Pre-commit Hook Integration

The linter is integrated as a pre-commit hook. It runs automatically on modified files before commit:

# .pre-commit-config.yaml
- id: docstring-linter
  name: Docstring Linter
  entry: python tools/docstring_linter.py --check --config tools/docstring_linter_config.json --files
  language: system
  types: [python]
  pass_filenames: true

Current behavior: Warnings will block commits. Use --no-verify to skip if absolutely necessary.

Configuration

The linter is configured via tools/docstring_linter_config.json:

{
  "sdk_package_name": "encord",

  "sdk_classes": [
    "LabelRowV2",
    "Dataset",
    "Project",
    "Workflow",
    ...
  ],

  "exceptions": [
    "AuthenticationError",
    "ResourceNotFoundError",
    ...
  ],

  "auto_fix_rules": {
    "invalid_section_headers": true,
    "spelling_inconsistencies": true,
    "unlinked_exceptions": false,
    "unlinked_classes": false
  },

  "ignore_patterns": [
    "*/tests/*",
    "*/test_*.py",
    "*/__pycache__/*"
  ]
}

Add new classes/exceptions: Simply append to the respective arrays in the config file.

Benefits

1. Improved Documentation Quality

Consistent formatting across all docstrings
Proper cross-references enable hyperlinks in generated docs
Better developer experience when browsing API documentation

2. Automated Quality Gates

Catches issues before they reach production docs
Pre-commit hook prevents regression
CI/CD integration via JSON reports

3. Developer Productivity

Auto-fix saves manual editing time
Clear suggested fixes for each issue
Verbose mode shows exactly what needs fixing

4. Type Safety

Removes redundant type annotations when type hints exist
Prevents documentation/code drift
Cleaner, more maintainable docstrings

5. SDK Consistency

Enforces Google-style docstrings throughout
Standardizes Sphinx role usage
Makes codebase easier to navigate

Current Status

As of this PR:

289 issues remaining across 130 files
207 warnings (should be addressed)
82 info (nice to have)

Top issue types:

missing_class_docstring: 207 (mostly internal classes)
unlinked_class: 82 (can be auto-fixed)

Migration Plan

Phase 1: Enable Pre-commit Hook (Current)

Hook is active and checking modified files
Warnings will block commits
Developers fix issues as they modify files

Phase 2: Fix Existing Warnings (Recommended)

Run auto-fix on core modules:

python tools/docstring_linter.py --fix --files encord/client.py encord/project.py encord/dataset.py

Manually review and fix remaining warnings
Focus on user-facing APIs first

Phase 3: Gradual Cleanup

Address info-level issues over time
Add docstrings to public classes
Enhance cross-references for better docs

Testing

The linter includes:

✅ Full mypy type checking compliance
✅ Comprehensive test coverage of all issue types
✅ Safe auto-fix with backup validation
✅ Handles edge cases (mixed indentation, YAML front matter, etc.)

Documentation

Full documentation available in:

tools/README.md - Comprehensive usage guide
tools/docstring_linter.py - Inline docstring with examples
tools/docstring_linter_config.json - Configuration reference

Breaking Changes

None. This is a development tool that:

✅ Does not modify runtime behavior
✅ Only affects documentation/docstrings
✅ Can be disabled via --no-verify if needed

Future Enhancements

Potential improvements:

IDE integration (VS Code extension)
GitHub Actions workflow for PR checks
Auto-fix suggestions in PR comments
Dashboard showing progress over time
Support for more Sphinx roles (:attr:, :data:, etc.)

Examples

Before

def get_dataset(self, dataset_hash: str) -> Dataset:
    """Get a dataset by hash.

    **Args**:
        - `dataset_hash` (str): The dataset identifier

    **Returns**:
        Dataset: The dataset instance

    **Raises**:
        AuthenticationError: If API key is invalid
        ResourceNotFoundError: If dataset not found
    """

After

def get_dataset(self, dataset_hash: str) -> Dataset:
    """Get a dataset by hash.

    Args:
        dataset_hash: The dataset identifier

    Returns:
        :class:`~encord.dataset.Dataset`: The dataset instance

    Raises:
        :class:`~encord.exceptions.AuthenticationError`: If API key is invalid
        :class:`~encord.exceptions.ResourceNotFoundError`: If dataset not found
    """

Impact

✅ Proper section headers
✅ Google-style formatting
✅ Removed redundant type annotation (already in signature)
✅ Added Sphinx cross-references
✅ Enabled hyperlinks in generated documentation

github-actions · 2025-11-30T20:36:47Z

Unit test report (Python 3.9.24, Pydantic 2.12.3)

272 tests 272 ✅ 8s ⏱️
1 suites 0 💤
1 files 0 ❌

Results for commit 5edcc6e.

♻️ This comment has been updated with latest results.

github-actions · 2025-11-30T20:36:47Z

Unit test report (Python 3.9.24, Pydantic 1.10.22)

272 tests 272 ✅ 8s ⏱️
1 suites 0 💤
1 files 0 ❌

Results for commit 5edcc6e.

♻️ This comment has been updated with latest results.

gemini-code-assist · 2025-11-30T20:37:03Z

Summary of Changes

Hello @yuriy-encord, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new docstring linter designed to significantly improve the quality and consistency of the SDK's documentation. By enforcing a standardized style, validating Sphinx roles, and ensuring accurate cross-references, the linter streamlines the documentation generation process and enhances the overall developer experience when interacting with the SDK. It includes robust issue detection, auto-fixing capabilities, and flexible reporting options, making it a powerful tool for maintaining high documentation standards.

Highlights

New Docstring Linter: Introduced a comprehensive docstring linter to enforce consistent, high-quality documentation across the SDK, ensuring Sphinx role usage, valid formatting, and proper cross-references.
Issue Detection & Auto-Fix: The linter detects 13 types of docstring issues across three severity levels (Errors, Warnings, Info) and offers auto-fix capabilities for many common problems like invalid section headers, spelling inconsistencies, and redundant type annotations.
Pre-commit Hook Integration: The linter is integrated as a pre-commit hook, automatically checking modified files before commit to prevent documentation quality regressions. Warnings will block commits by default.
Improved Documentation & Developer Experience: The tool aims to improve documentation quality, automate quality gates, boost developer productivity through auto-fixes and clear suggestions, enhance type safety, and standardize SDK docstring consistency.
Extensive Configuration & Reporting: The linter is highly configurable via a JSON file, allowing customization of SDK classes, exceptions, auto-fix rules, and ignore patterns. It also supports detailed verbose output and JSON report generation for CI/CD integration.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a new docstring linter to enforce documentation quality, along with a large number of auto-generated fixes to existing docstrings. This is a fantastic initiative for improving code maintainability and documentation clarity. My review focuses on the implementation of the linter itself and the correctness of the applied fixes. I've identified a few areas where the linter's logic can be improved for robustness and maintainability, and several instances where the auto-generated fixes are incorrect. Please see the detailed comments for suggestions.

gemini-code-assist · 2025-11-30T20:39:47Z


        Returns:
-            List[DataRow]: A list of DataRows object that match the filter
+            List[:class:`~encord.orm.dataset.DataRow`]: A list of :class:`~encord.orm.dataset.DataRow`s object that match the filter


The auto-generated docstring contains a grammatical error: '...a list of :class:~encord.orm.dataset.DataRows object...'. This should be '...a list of :class:~encord.orm.dataset.DataRow objects...'. This suggests a potential issue in the linter's pluralization logic.

Suggested change

List[:class:`~encord.orm.dataset.DataRow`]: A list of :class:`~encord.orm.dataset.DataRow`s object that match the filter

List[:class:`~encord.orm.dataset.DataRow`]: A list of :class:`~encord.orm.dataset.DataRow` objects that match the filter

gemini-code-assist · 2025-11-30T20:39:47Z

+            Iterator[Union[:class:`~encord.storage.StorageItem`, :class:`~encord.storage.StorageItem`Inaccessible]]: An iterator containing both accessible
            and inaccessible storage items in the collection.


The generated reference for StorageItemInaccessible appears to be broken. The linter seems to have incorrectly split the class name, resulting in :class:~encord.storage.StorageItemInaccessible. The correct syntax should be a single :class: role for the full class name, like :class:~encord.storage.StorageItemInaccessible``.

Suggested change

Iterator[Union[:class:`~encord.storage.StorageItem`, :class:`~encord.storage.StorageItem`Inaccessible]]: An iterator containing both accessible

and inaccessible storage items in the collection.

Iterator[Union[:class:`~encord.storage.StorageItem`, :class:`~encord.storage.StorageItemInaccessible`]]: An iterator containing both accessible

and inaccessible storage items in the collection.

gemini-code-assist · 2025-11-30T20:39:48Z

+        Args:
+            k : str: The key under which the embedding will be stored in the schema.
+            size : int: The size of the embedding.


The linter has correctly reformatted the Args section, but there are a couple of issues with the output:

The type annotations (: str, : int) are redundant as they are already present in the function signature. According to the PR description, these should be removed.

The format k : str: is not standard Google style. It should be k: ....

A better format would be:

Args: k: The key under which the embedding will be stored in the schema. size: The size of the embedding.

Suggested change

Args:

k : str: The key under which the embedding will be stored in the schema.

size : int: The size of the embedding.

Args:

k: The key under which the embedding will be stored in the schema.

size: The size of the embedding.

gemini-code-assist · 2025-11-30T20:39:48Z

 def label_row_metadata_dto_to_label_row_metadata(label_row_metadata_dto: LabelRowMetadataDTO) -> LabelRowMetadata:
    """Convert a :class:`LabelRowMetadataDTO` instance to a
-    :class:`LabelRowMetadata` dataclass.
+    :class:LabelRowMetadata dataclass.


The linter seems to have incorrectly removed the backticks from the :class: role. The syntax :class:LabelRowMetadata is invalid. It should be enclosed in backticks, like :class:~encord.orm.label_row.LabelRowMetadata``.

Suggested change

:class:LabelRowMetadata dataclass.

:class:`~encord.orm.label_row.LabelRowMetadata` dataclass.

gemini-code-assist · 2025-11-30T20:39:48Z

        allow_mirror_dataset_changes: bool = False,
    ) -> None:
-        """Move items (list of `StorageItem` objects or UUIDs) to another folder (specify folder object or UUID).
+        """Move items (list of `:class:`~encord.storage.StorageItem`` objects or UUIDs) to another folder (specify folder object or UUID).


The auto-generated reference for StorageItem has extra backticks, resulting in :class:~encord.storage.StorageItem```. This is incorrect and will not render correctly. It should be :class:~encord.storage.StorageItem``.

Suggested change

"""Move items (list of `:class:`~encord.storage.StorageItem`` objects or UUIDs) to another folder (specify folder object or UUID).

"""Move items (list of :class:`~encord.storage.StorageItem` objects or UUIDs) to another folder (specify folder object or UUID).

gemini-code-assist · 2025-11-30T20:39:48Z

+    SDK_CLASS_PATTERNS = [
+        r"\bLabelRowV2\b",
+        r"\bDataset\b",
+        r"\bProject\b",
+        r"\bOntologyStructure\b",
+        r"\bWorkflow\b",
+        r"\bStorageItem\b",
+        r"\bStorageFolder\b",
+        r"\bCollection\b",
+        r"\bOntology\b",
+        r"\bProjectUser\b",
+        r"\bDataRow\b",
+        r"\bObjectInstance\b",
+        r"\bClassificationInstance\b",
+    ]
+
+    # Mapping of class names to their actual import paths
+    CLASS_TO_MODULE = {
+        "LabelRowV2": "encord.objects.LabelRowV2",
+        "Dataset": "encord.dataset.Dataset",
+        "Project": "encord.project.Project",
+        "OntologyStructure": "encord.objects.OntologyStructure",
+        "Workflow": "encord.workflow.Workflow",
+        "StorageItem": "encord.storage.StorageItem",
+        "StorageFolder": "encord.storage.StorageFolder",
+        "Collection": "encord.collection.Collection",
+        "Ontology": "encord.ontology.Ontology",
+        "ProjectUser": "encord.utilities.project_user.ProjectUser",
+        "DataRow": "encord.orm.dataset.DataRow",
+        "ObjectInstance": "encord.objects.ObjectInstance",
+        "ClassificationInstance": "encord.objects.ClassificationInstance",
+    }
+
+    # Known exception names - automatically extracted from exceptions.py
+    EXCEPTION_CLASSES = {
+        "EncordException",
+        "InitialisationError",
+        "AuthenticationError",
+        "AuthorisationError",
+        "ResourceNotFoundError",
+        "TimeOutError",
+        "RequestException",
+        "RateLimitExceededError",
+        "PayloadTooLargeError",
+        "UnknownException",
+        "InvalidDateFormatError",
+        "MethodNotAllowedError",
+        "OperationNotAllowed",
+        "AnswerDictionaryError",
+        "CorruptedLabelError",
+        "FileTypeNotSupportedError",
+        "FileSizeNotSupportedError",
+        "FeatureDoesNotExistError",
+        "ModelWeightsInconsistentError",
+        "ModelFeaturesInconsistentError",
+        "UploadOperationNotSupportedError",
+        "DetectionRangeInvalidError",
+        "InvalidAlgorithmError",
+        "ResourceExistsError",
+        "DuplicateSshKeyError",
+        "SshKeyNotFound",
+        "InvalidArgumentsError",
+        "GenericServerError",
+        "CloudUploadError",
+        "MultiLabelLimitError",
+        "LabelRowError",
+        "OntologyError",
+        "WrongProjectTypeError",
+        "BundledMoveWorkflowTasksPayloadError",
+        # Exceptions from other modules
+        "MetadataSchemaError",
+    }
+
+    # Build regex patterns for all exceptions
+    EXCEPTION_PATTERNS = [rf"\b{exc}\b" for exc in EXCEPTION_CLASSES]
+
+    # Normalize exception names to match actual class names
+    # (Some docstrings use American spelling but the class uses British spelling)
+    EXCEPTION_NAME_MAPPING = {
+        "AuthorizationError": "AuthorisationError",  # Normalize to British spelling
+        "AuthorisationError": "AuthorisationError",  # Already correct
+    }
+    # Default: map each exception to itself
+    for exc in EXCEPTION_CLASSES:
+        if exc not in EXCEPTION_NAME_MAPPING:
+            EXCEPTION_NAME_MAPPING[exc] = exc
+
+    # Exceptions not in encord.exceptions need special path mapping
+    EXCEPTION_MODULE_MAPPING = {
+        "MetadataSchemaError": "encord.metadata_schema.MetadataSchemaError",
+    }


The lists SDK_CLASS_PATTERNS, CLASS_TO_MODULE, EXCEPTION_CLASSES, and EXCEPTION_MODULE_MAPPING are hardcoded within the linter. The PR description and the new config file suggest that these should be configurable via tools/docstring_linter_config.json. Loading these from the config file would make the linter more maintainable and easier to update as the SDK evolves.

gemini-code-assist · 2025-11-30T20:39:48Z

+    def _is_within_docstring(self, content: str, pos: int) -> bool:
+        """Check if a position in content is within a docstring (between triple quotes)."""
+        # Count triple quotes before this position
+        before = content[:pos]
+        # Count both ''' and """ as docstring delimiters
+        triple_double = before.count('"""')
+        triple_single = before.count("'''")
+
+        # If odd number of triple quotes before, we're inside a docstring
+        # We check both types independently
+        in_double = (triple_double % 2) == 1
+        in_single = (triple_single % 2) == 1
+
+        return in_double or in_single


The _is_within_docstring function relies on counting triple quotes, which can be brittle. For instance, a code example within a docstring that contains a triple-quoted string could confuse the parser. A more robust approach would be to use the line and column information from the ast node for the docstring. This information is available from the ast.walk traversal and would allow for precise boundary checking.

github-actions · 2025-11-30T21:04:57Z

SDK integration test report

285 tests ±0 277 ✅ +1 13m 50s ⏱️ +33s
1 suites ±0 4 💤 ±0
1 files ±0 4 ❌ - 1

For more details on these failures, see this check.

Results for commit 5edcc6e. ± Comparison against base commit 6060568.

Implement docstring linter

9155b0f

gemini-code-assist Bot reviewed Nov 30, 2025

View reviewed changes

Remove linter from pre-commit hooks

5edcc6e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [Hackathron] Docstring linter#1037

feat: [Hackathron] Docstring linter#1037
yuriy-encord wants to merge 2 commits into
masterfrom
ym/doctring-linter

yuriy-encord commented Nov 30, 2025

Uh oh!

github-actions Bot commented Nov 30, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Nov 30, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Nov 30, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Uh oh!

github-actions Bot commented Nov 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	List[:class:`~encord.orm.dataset.DataRow`]: A list of :class:`~encord.orm.dataset.DataRow`s object that match the filter
	List[:class:`~encord.orm.dataset.DataRow`]: A list of :class:`~encord.orm.dataset.DataRow` objects that match the filter

		Iterator[Union[:class:`~encord.storage.StorageItem`, :class:`~encord.storage.StorageItem`Inaccessible]]: An iterator containing both accessible
		and inaccessible storage items in the collection.

	:class:LabelRowMetadata dataclass.
	:class:`~encord.orm.label_row.LabelRowMetadata` dataclass.

	"""Move items (list of `:class:`~encord.storage.StorageItem`` objects or UUIDs) to another folder (specify folder object or UUID).
	"""Move items (list of :class:`~encord.storage.StorageItem` objects or UUIDs) to another folder (specify folder object or UUID).

Conversation

yuriy-encord commented Nov 30, 2025

Add Docstring Quality Linter for SDK Documentation

Overview

What the Linter Does

🔍 Issue Detection

Errors (Must Fix - Exit Code 1)

Warnings (Should Fix - Exit Code 1 in check mode)

Info (Nice to Have - Exit Code 0)

🛠️ Auto-Fix Capabilities

📊 Output Modes

1. Summary Mode (Default)

2. Verbose Mode (--verbose or -v)

3. JSON Report Mode (--report report.json)

Usage

Command Line

Pre-commit Hook Integration

Configuration

Benefits

1. Improved Documentation Quality

2. Automated Quality Gates

3. Developer Productivity

4. Type Safety

5. SDK Consistency

Current Status

Migration Plan

Phase 1: Enable Pre-commit Hook (Current)

Phase 2: Fix Existing Warnings (Recommended)

Phase 3: Gradual Cleanup

Testing

Documentation

Breaking Changes

Future Enhancements

Examples

Before

After

Impact

Uh oh!

github-actions Bot commented Nov 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit test report (Python 3.9.24, Pydantic 2.12.3)

Uh oh!

github-actions Bot commented Nov 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit test report (Python 3.9.24, Pydantic 1.10.22)

Uh oh!

gemini-code-assist Bot commented Nov 30, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Nov 30, 2025

SDK integration test report

Uh oh!

Reviewers

2. Verbose Mode (`--verbose` or `-v`)

3. JSON Report Mode (`--report report.json`)

github-actions Bot commented Nov 30, 2025 •

edited

Loading

github-actions Bot commented Nov 30, 2025 •

edited

Loading