You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a need to know what is indexed and saw there is a pipeline.
i don’t write rust so can’t change the code easily so I was hoping that the system can through events about what documents it is indexing.
It could just make an event and payload that describes:
path
File nane
fike ext
event type
The event type could be:
new . File that it has not seen before
modified. Fike that has changes.
Deleted. Fike that was deleted.
I don’t know if it’s able to know these evebt types?
The reason for this is that then I can integrate it with other processes.
A classics example is links and back links. We can use html as an example by the logic can apply to many document types.
A web psge x.html has an image element. We don’t know the state of that actual image though .
the opposite is also true. When indexing an image we want to know all html pages that use that image.
By throwing events on every document it allows custom logic to be written to keep track of what references what. For example a web page is using image A , but what else is using imsge A ?
An event system can then parse all events and built a view of what is linked to what !!
This is really useful for use cases where you are editing an image but you are not sure how many other documents are using that image.
you can then bring up all documents using that image using search and then decide if you should copy the image of not.
Also when you copy the image you might want 20 documents that use that image to start to use the new image.
It’s really similar to in code where you are refactoring code and you want to find all references to a function.
The text was updated successfully, but these errors were encountered:
Conceptually the pipeline is meant to provide the ability to customize processing of content including collection, parsing and tagging. The pipeline code is still in its early stages and does not yet provide the ability to customize nor does it provide the information you are looking for.
For your use case I can see the need for notification on change, but would the contents be needed also? We provide the ability to custom tag documents, in your example would the documents associated with the same image be tagged with that information for easy search?
I have a need to know what is indexed and saw there is a pipeline.
i don’t write rust so can’t change the code easily so I was hoping that the system can through events about what documents it is indexing.
It could just make an event and payload that describes:
The event type could be:
I don’t know if it’s able to know these evebt types?
The reason for this is that then I can integrate it with other processes.
A classics example is links and back links. We can use html as an example by the logic can apply to many document types.
A web psge x.html has an image element. We don’t know the state of that actual image though .
the opposite is also true. When indexing an image we want to know all html pages that use that image.
By throwing events on every document it allows custom logic to be written to keep track of what references what. For example a web page is using image A , but what else is using imsge A ?
An event system can then parse all events and built a view of what is linked to what !!
This is really useful for use cases where you are editing an image but you are not sure how many other documents are using that image.
you can then bring up all documents using that image using search and then decide if you should copy the image of not.
Also when you copy the image you might want 20 documents that use that image to start to use the new image.
It’s really similar to in code where you are refactoring code and you want to find all references to a function.
The text was updated successfully, but these errors were encountered: