Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOCS0-2] Fill out missing details in docs and check consistency #1950

Open
20 tasks
Tracked by #1756
blythed opened this issue Apr 5, 2024 · 15 comments
Open
20 tasks
Tracked by #1756

[DOCS0-2] Fill out missing details in docs and check consistency #1950

blythed opened this issue Apr 5, 2024 · 15 comments
Assignees

Comments

@blythed
Copy link
Collaborator

blythed commented Apr 5, 2024

Read through structure, give live feedback, suggestions, and push improvements.

  • Getting started
  • Core API
  • Apply API
  • Execute API
  • Models
  • Data integrations
    • MongoDB
    • SQL
  • AI Integrations
    • Anthropic
    • Cohere
    • OpenAI
    • Jina
    • Scikit-learn
    • Torch
    • Transformers
    • vLLM
    • LlamaCpp
  • Fundamentals
  • Production features

For each AI integration:

  • How to instantiate
  • (How to train)
@fnikolai
Copy link
Collaborator

The Oracle tab is wrong. It refers toMSSQL

https://docs.superduperdb.com/docs/docs/reusable_snippets/connect_to_superduperdb

@fnikolai
Copy link
Collaborator

fnikolai commented May 14, 2024

The Get useful sample data can be renamed to Fetch Dataset

@fnikolai
Copy link
Collaborator

Tabs on Compute features have some spaces at the beginning

@fnikolai
Copy link
Collaborator

data integrations seems redundant and misleading.

If we want to keep it, then at least it should be similar to reusable snippets

@fnikolai
Copy link
Collaborator

Create Vector-Index seems to have an issue with the tabs

@fnikolai
Copy link
Collaborator

The SQL statement on Perform a vector search seems a bit odd.

select = query_table_or_collection.like(item, vector_index=vector_index_name, n=10).limit(10)        

Why do we need limit(10) when we have n=10 ?

@fnikolai
Copy link
Collaborator

Connecting Listeners is empty

@fnikolai
Copy link
Collaborator

On Postgresql of Connect to SuperDuperDB the user and password should be superduper in order to be in compliance with the credentials of the Docker databases.

@fnikolai
Copy link
Collaborator

Change Mongo Connection to:

from superduperdb import superduper

user = 'superduper'
password = 'superduper'
port = 27017
host = 'localhost'
database = 'test_db'

db = superduper(f"mongodb://{user}:{password}@{host}:{port}/{database}")

@fnikolai
Copy link
Collaborator

Also explain how do to pre-filtering and post-filtering of the data.

In general, whatever questions have raised on Slack/Issues, should be answered somewhere on the docs.

@fnikolai
Copy link
Collaborator

fnikolai commented May 17, 2024

from superduperdb import dtype is shown everywhere on Get Useful Samples but it returns

ImportError: cannot import name 'dtype' from 'superduperdb' (/home/superduper/superduperdb/superduperdb/__init__.py)

replace it with:

from superduperdb.backends.ibis.field_types import dtype

@fnikolai
Copy link
Collaborator

fnikolai commented May 17, 2024

curl -O s3://superduperdb-public-demo/images.zip returns curl: (1) Unsupported protocol or curl: (1) Protocol "s3" not supported or disabled in libcurl.

You need to replace it with https://superduperdb-public-demo.s3.amazonaws.com/images.zip

The same for video and audio.

@fnikolai
Copy link
Collaborator

fnikolai commented May 17, 2024

On Multimodal Vector Search in Define the embedding model datatype the SQL and Mongo are swapped compared to others.

@fnikolai
Copy link
Collaborator

On Multimodal Vector Search the Image chunker is missing and therefore the Listener cannot be used.

@fnikolai
Copy link
Collaborator

On Multimodal Vector Search when choose Text chunker the listener blocks forever

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

No branches or pull requests

6 participants