ML 10906 image001
Amazon Kendra is a highly accurate and simple-to-use intelligent search service powered by machine learning (ML). Amazon Kendra offers a suite of data source connectors to simplify the process of ingesting and indexing your content, wherever it resides.
Valuable data in organizations is stored in both structured and unstructured repositories. An enterprise search solution should be able to pull together data across several structured and unstructured repositories to index and search on.
One such data repository is Dropbox. Enterprise users use Dropbox to upload, transfer, and store documents to the cloud. Along with the ability to store documents, Dropbox offers Dropbox Paper, a coediting tool that lets users collaborate and create content in one place. Dropbox Paper can optionally use templates to add structure to documents. In addition to files and paper, Dropbox also allows you to store shortcuts to webpages in your folders.
We’re excited to announce that you can now use the Amazon Kendra connector for Dropbox to search information stored in your Dropbox account. In this post, we show how to index information stored in Dropbox and use the Amazon Kendra intelligent search function. In addition, Amazon Kendra’s ML powered intelligent search can accurately find information from unstructured documents having natural language narrative content, for which keyword search is not very effective.
With Amazon Kendra, you can configure multiple data sources to provide a central place to search across your document repository. For our solution, we demonstrate how to index a Dropbox repository or folder using the Amazon Kendra connector for Dropbox. The solution consists of the following steps:
To try out the Amazon Kendra connector for Dropbox, you need the following:
Before we set up the Dropbox data source, we need a few details about your Dropbox repository. Let’s gather those in advance.
Store these values in a safe place—we need to refer to these later.
The session token is valid for up to 4 hours. You have to generate a new session token each time you index the content.
To store your Dropbox credentials in Secrets Manager, compete the following steps:
appKey
, appSecret
, and refreshToken
and enter the values saved from Dropbox.AmazonKendra-dropbox-secret
).To configure the Amazon Kendra connector, complete the following steps:
my-dropbox-index
).my-dropbox-connector
).AmazonKendra-dropbox-role
).Now that you have ingested the content from your Dropbox account into your Amazon Kendra index, you can test some queries.
Go to your index and choose Search indexed content. Enter a sample search query and test out your search results (your query will vary based on the contents of your account).
The Dropbox connector also crawls local identity information from Dropbox. For users, it sets user email id as principal. For groups, it sets group id as principal. To filter search results by users/groups, go to the Search Console.
Click on “Test query with user name or groups” to expand it and click on the button that says “apply user name or groups”.
Enter the user and/or group names and click Apply. Next, enter the search query and hit enter. This brings you a filtered set of results based on your criteria.
Congratulations! You have successfully used Amazon Kendra to surface answers and insights based on the content indexed from your Dropbox account.
The instructions in this post walk you through creating, configuring, and using a temporary access token. Apps can also get long-term access by requesting offline access, in which case the app receives a refresh token that can be used to retrieve new short-lived access tokens as needed, without further manual user intervention. You can find more information in the Dropbox OAuth Guide and Dropbox authorization documentation. Use the following steps to create a permanent refresh token (for example to set the sync to trigger on a schedule):
https://www.dropbox.com/oauth2/authorize?token_access_type=offline&response_type=code&client_id=<appkey>
.You can store this refresh token along with the app key and app secret to configure a permanent token in the data source configuration for Amazon Kendra. Amazon Kendra generates the access token and uses it as needed for access.
This solution has the following limitations:
With the Dropbox connector for Amazon Kendra, organizations can tap into the repository of information stored in their account securely using intelligent search powered by Amazon Kendra.
In this post, we introduced you to the basics, but there are many additional features that we didn’t cover. For example:
allowedUsersColumn
and allowedGroupsColumn
so you can apply access controls based on users and groups, respectivelyTo learn about these possibilities and more, refer to the Amazon Kendra Developer Guide.
Language models — often known for the acronym LLM for Large Language Models, their large-scale…
This post is in two parts; they are: • Understanding the Encoder-Decoder Architecture • Evaluating…
Investment professionals face the mounting challenge of processing vast amounts of data to make timely,…
GenLayer is betting that AI-driven contracts, enforced on the blockchain, will be the foundation for…
The acting inspector general says the Office of Personnel Management is investigating whether any “emerging…
AI models often rely on "spurious correlations," making decisions based on unimportant and potentially misleading…