-
Notifications
You must be signed in to change notification settings - Fork 10
Documentation: iterating over collection and documents module #814
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from 2 commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
This file was deleted.
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,14 +1,14 @@ | ||
| Remove a collection from the platform. This is a two step process: | ||
| Remove a collection from the platform. | ||
|
|
||
| This is a two step process: | ||
|
coderabbitai[bot] marked this conversation as resolved.
Outdated
|
||
|
|
||
| 1. Delete all resources that were allocated: file(s), the Vector | ||
| Store, and the Assistant. | ||
| 2. Delete the collection entry from the kaapi database. | ||
|
|
||
| No action is taken on the documents themselves: the contents of the | ||
| documents that were a part of the collection remain unchanged, those | ||
| documents can still be accessed via the documents endpoints. The response from this | ||
| endpoint will be a `collection_job` object which will contain the collection `job_id` and | ||
| status. When you take the id returned and use the `collection job info` endpoint, | ||
| documents can still be accessed via the documents endpoints. The endpoint returns the job ID and status of the collection delete operation. When you take the id returned and use the `collection job info` endpoint, | ||
| if the job is successful, you will get the status as successful. | ||
| Additionally, if a `callback_url` was provided in the request body, | ||
| you will receive a message indicating whether the deletion was successful or if it failed. | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,8 +1,8 @@ | ||
| Retrieve detailed information about a specific collection by its collection id. This endpoint returns the collection object including its project, organization, timestamps, and service-specific details. | ||
| Retrieve detailed information about a specific collection by its collection id. | ||
|
|
||
| **Response Fields:** | ||
|
|
||
| **Note:** While the API schema shows both `llm_service_id`/`llm_service_name` AND `knowledge_base_id`/`knowledge_base_provider`, the actual response will only include the fields relevant to what was created: | ||
| **Note:** While the example response shows both `llm_service_id`/`llm_service_name` AND `knowledge_base_id`/`knowledge_base_provider`, the actual response will only include the fields relevant to what was created: | ||
|
|
||
| - **If an Assistant was created** (with model + instructions): The response will only include `llm_service_id` and `llm_service_name` | ||
| - **If only a Vector Store was created** (without model/instructions): The response will only include `knowledge_base_id` and `knowledge_base_provider` | ||
|
|
@@ -11,4 +11,4 @@ Retrieve detailed information about a specific collection by its collection id. | |
|
|
||
| If the `include_docs` flag in the parameter is true then you will get a list of document IDs associated with a given collection as well. Note that, documents returned are not only stored by Kaapi, but also by Vector store provider. | ||
|
|
||
| Additionally, if you set the `include_url` parameter to true, a signed URL will be included in the response, which is a clickable link to access the retrieved document. If you don't set it to true, the URL will not be included in the response. | ||
| Additionally, if you set the `include_url` parameter to true, a signed URL will be included in the response, which is a clickable link to access the retrieved document(s) of the collection you have retrieved. If you don't set it to true, the URL will not be included in the response. | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The new sentence feels repetitive. The previous version was already neat |
||
This file was deleted.
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,8 +1,6 @@ | ||
| Perform a delete of the document. This makes the | ||
| document invisible. It does not delete the document from cloud storage | ||
| or its information from the database. | ||
| Perform a delete of the document. | ||
|
|
||
| If the document is part of an active collection, those collections | ||
| will be deleted using the collections delete interface. Noteably, this | ||
| means all OpenAI Vector Store's and Assistant's to which this document | ||
| belongs will be deleted. | ||
| This makes the document invisible. It does not delete the document | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Avoid using the term “invisible” in the API documentation. Use a clearer and more descriptive term instead. |
||
| from cloud storage or its information from the database. | ||
|
|
||
| If the document belongs to any active collections, those collections will also be deleted. This includes all associated knowledge bases — for example, any OpenAI vector stores that were created through this platform with this document. | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,8 +1,7 @@ | ||
| Permanently delete a document from cloud storage. | ||
|
|
||
| This operation marks the document as deleted in the database while retaining its metadata. However, the actual file is | ||
| permanently deleted from cloud storage (e.g., S3) and cannot be recovered. Only the database record remains for reference | ||
| purposes. | ||
|
|
||
| If the document is part of an active collection, those collections | ||
| will be deleted using the collections delete interface. Noteably, this | ||
| means all OpenAI Vector Store's and Assistant's to which this document | ||
| belongs will be deleted. | ||
| If the document belongs to any active collections, those collections will also be deleted. This includes all associated knowledge bases — for example, any OpenAI vector stores that were created through this platform with this document. |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,18 +1,18 @@ | ||
| Upload a document to Kaapi. | ||
| Upload a document to Kaapi and optionally transform it as well. | ||
|
|
||
| - If only a file is provided, the document will be uploaded and stored, and its ID will be returned. | ||
| - If a target format is specified, a transformation job will also be created to transform document into target format in the background. The response will include both the uploaded document details and information about the transformation job. | ||
| - If a callback URL is provided, you will receive a notification at that URL once the document transformation job is completed. | ||
|
|
||
| ### Supported Transformations | ||
| ### Supported Transformations: | ||
|
|
||
| The following (source_format → target_format) transformations are supported: | ||
| The following (source_format → target_format) transformations are supported for now: | ||
|
|
||
| - pdf → markdown | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. add heading or title -> Supported Conversion |
||
| - zerox | ||
|
|
||
| ### Transformers | ||
| ### Transformers: | ||
|
|
||
| Available transformer names and their implementations, default transformer is zerox: | ||
| Available transformer names and their implementations, default transformer is zerox for now: | ||
|
|
||
| - `zerox` | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nitpick: Replace “pertinent” with simpler, more developer-friendly terminology.