Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Application Accessing Deleted Blob Storage Documents #61

Open
usmantariq-odl opened this issue Dec 10, 2024 · 3 comments
Open

Application Accessing Deleted Blob Storage Documents #61

usmantariq-odl opened this issue Dec 10, 2024 · 3 comments

Comments

@usmantariq-odl
Copy link

I am encountering an issue where the application continues to access and provide answers referencing documents that have already been deleted from the Blob Storage. Despite removing these documents, it seems the system is still using them as part of its data source.

@pamelafox , Could you please investigate why this is happening and provide guidance or a resolution to ensure deleted documents are no longer accessible or referenced by the application?

@pamelafox
Copy link
Collaborator

@usmantariq-odl Are you using the integrated vectorization approach, the default for this repo, or are you bringing your own index? Every indexing approach will have different ways they handle blob processing.

If you're using integrated vectorization, have you re-run the indexer after removing them? It's not currently run on a schedule.

@usmantariq-odl
Copy link
Author

usmantariq-odl commented Dec 10, 2024

@pamelafox , Yes I'm using the Integrated vectorization approach. I recreated the indexer by deleting it and deploying app again and it worked fine then. But i couldn't find a way to just clean the indexer or reset it directly from Azure, if you know the best approach. Please let me know.

Kind regards.

@pamelafox
Copy link
Collaborator

Here's documentation about running an indexer:
https://learn.microsoft.com/en-us/azure/search/search-howto-run-reset-indexers?tabs=portal#how-to-reset-and-run-indexers

It sounds like you may want to run it on a schedule:
https://learn.microsoft.com/en-us/azure/search/search-howto-schedule-indexers?tabs=portal

We can add these links to the readme somewhere.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants