Deleting uploaded files/index data #292
Replies: 4 comments 12 replies
-
Could this be added to the UX pls, e.g. select File from Upload Status list -> Delete, then trigger Search re-index? |
Beta Was this translation helpful? Give feedback.
-
@lon-tierney, Removing/Clearing files from the system can be done in a few ways depending on your needs. Removing ALL data from Azure Storage and Search Index only (leaving logs intact)As we don't yet have an automated way to remove files from the system, some manual steps are required:
This process cleans up the files and clears and recreates the index. Then once you upload new files they will be processed and indexed normally. Remove ALL Data from Azure Storage, Search Index, and LogsTo do this follow Steps 1 and 2 above, then also complete these additional steps manually:
This will recreate both the Azure CosmosDB container and the search index. Then new files uploaded will process as normal and all logs in the UX will reflect only newly processed files. |
Beta Was this translation helpful? Give feedback.
-
What has changed in 0.4-DeltaThe overall pattern of how we index content has changed a lot in 0.4-Delta. In 0.3-Gamma, we simply wrote out JSON files to Azure Blob storage and then depended on an Azure AI Search Indexer, Skillset, Datasource, and Index combination to detect the files in blob storage and enrich the index with OOTB skillsets. This had some useful features like the ability to detect deleted files in the blob storage container and clean up the index, but also came with a lot of uncontrollable errors in the OOTB skillsets. In 0.4-Delta, we have removed the Azure AI Search Indexer, Skillsets, and Datasource leaving only a single index. And rather than depending on the Indexer to come around on a schedule and detect new files, we are directly inserting data into the index via a REST API call. We have also replaced the OOTB skillsets with enrichment REST API calls in our Azure Functions and Azure App services. What does this mean for deletion?Removing some file(s) from the indexAs we don't yet have an automated way to remove files from the system, some manual steps are required:
Removing ALL data from Azure Storage and Search Index only (leaving logs intact)
This process cleans up the files and clears and recreates the index. Then once you upload new files they will be processed and indexed normally. Remove ALL Data from Azure Storage, Search Index, and LogsTo do this follow Steps 1 and 2 above, then also complete these additional steps manually:
This will recreate both the Azure CosmosDB container and the search index. Then new files uploaded will process as normal and all logs in the UX will reflect only newly processed files. |
Beta Was this translation helpful? Give feedback.
-
What about updating content? If I re-upload a document with updated contents, will it overwrite all chunks and index-documents using the filename as key? |
Beta Was this translation helpful? Give feedback.
-
@dayland can you please share steps to clear out index/uploaded doc data from an installation?
Beta Was this translation helpful? Give feedback.
All reactions