Initial draft of search API. #2868

kentonv · 2017-02-11T03:37:14Z

DO NOT MERGE

There are some TODO(now)s in there regarding parts I'm still thinking about, but wanted to get some initial feedback on this.

dwrensha · 2017-02-13T02:12:05Z

src/sandstorm/indexer.capnp

+# Indexer implementation
+
+interface IndexerSession {
+  # A session type specifically implemented by the indexer app. The app's UiView's newSession()


How many indexer app grains are there? Just one? One per user? Maybe the are created on demand when the user manually sets up some grains as indexable?

dwrensha · 2017-02-13T02:15:05Z

src/sandstorm/indexer.capnp

+interface GrainIndexer(Metadata) {
+  # Capability used to index the content of a grain.
+  #
+  # This is a one-way capability. Although GrainIndexer is implemented by the indexer and is called


I only vaguely understand what you're intending to protect against by making this a "one-way" capability. Should we be worried that capabilities could be smuggled through the metadata field of IndexableContent?

dwrensha · 2017-02-13T02:18:29Z

src/sandstorm/indexer.capnp

+  # Note that, as an optimization, Sandstorm may start out assuming that all users see exactly
+  # the same content, and may do all indexing as an anonymous user, perhaps with no permissions.
+  # However, if the app logs an ActivityEvent that specifies that it requires specific permissions
+  # or is visible only to certain users, Sandstorm uses that as a hint to index the path associated


Does this mean that Sandstorm tries to index any grain that calls SessionContext.activity()?

dwrensha · 2017-02-13T02:19:22Z

src/sandstorm/indexer.capnp

+using Grain = import "grain.capnp";
+
+interface IndexingSession(Metadata) {
+  # This is a UiView session type, created by calling UiView.newSession().


So I guess this should have extends(UiSession)?

kentonv · 2017-02-13T23:36:43Z

I added comments answering all of @dwrensha's questions.

ocdtrekkie · 2017-02-14T16:09:16Z

How well do you expect this to scale? Global full-text search of say... documents... seems reasonable to expect Sandstorm servers to do, but what happens if I have an eBook library with a few hundred books of text in my Sandstorm?

Will it be possible to define what type of content is being indexed? To extend the above example, if I have an eBook library, and a lot of personal documents, it may be hard to find personal documents (data I generated) I am looking for if it also returns a lot of results from books (data I have).

kentonv · 2017-02-14T21:49:30Z

@ocdtrekkie I would guess that the tech can handle indexing your ebooks. Lucene is designed to be able to index millions or even billions of documents, and I assume Lucy is similar. As a rule of thumb, the index ought to be similar in size to the text it is indexing.

As for usability issues, that's a good point. I think we would handle this by adding search operators and such. This initial design doesn't really get into how those will work, but it can be extended in that direction in the future.

zenhack · 2020-01-17T19:39:20Z

src/sandstorm/indexer.capnp

+  # metadata format.
+  #
+  # TODO(someday): Spec out how this works. Not needed for MVP of search.
+}


It's not obvious to me how an indexer would be able to use "generic" operators like this while treating the metadata as completely opaque.

If the plan is to not implement this right away anyway, is there a good reason to even include this method? We can always add more later...

Initial draft of search API.

057235b

kentonv assigned dwrensha Feb 11, 2017

dwrensha reviewed Feb 13, 2017

View reviewed changes

Answer @dwrensha's questions.

20a6f3d

ocdtrekkie added the app-platform App/Sandstorm integration features label Jan 15, 2020

ocdtrekkie mentioned this pull request Jan 17, 2020

Search across all sandstorm documents #1234

Open

zenhack reviewed Jan 17, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial draft of search API. #2868

Initial draft of search API. #2868

kentonv commented Feb 11, 2017

dwrensha Feb 13, 2017

dwrensha Feb 13, 2017

dwrensha Feb 13, 2017

dwrensha Feb 13, 2017

kentonv commented Feb 13, 2017

ocdtrekkie commented Feb 14, 2017

kentonv commented Feb 14, 2017

zenhack Jan 17, 2020

Initial draft of search API. #2868

Are you sure you want to change the base?

Initial draft of search API. #2868

Conversation

kentonv commented Feb 11, 2017

dwrensha Feb 13, 2017

Choose a reason for hiding this comment

dwrensha Feb 13, 2017

Choose a reason for hiding this comment

dwrensha Feb 13, 2017

Choose a reason for hiding this comment

dwrensha Feb 13, 2017

Choose a reason for hiding this comment

kentonv commented Feb 13, 2017

ocdtrekkie commented Feb 14, 2017

kentonv commented Feb 14, 2017

zenhack Jan 17, 2020

Choose a reason for hiding this comment