feat: Add new Atlassian Confluence Component for document loading and vector database integration #2718

danielgines · 2024-07-16T00:39:54Z

Adds new Atlassian Confluence Component

Implements ConfluenceComponent to load documents from the Atlassian Confluence platform.
Adds necessary inputs, including URL, username, API key, space key, and more.
Supports configuration of max_pages for pagination control.
Allows documents to be loaded into a vector database for queries.

This new module facilitates integration with the Atlassian Confluence platform.

Add gemma2 to groq_constants.py

- Implements ConfluenceComponent to load documents from the Confluence platform. - Adds necessary inputs, including URL, username, API key, space_key, and more. - Supports configuration of max_pages for pagination control. - Implements lazy loading in the load_documents method for incremental document processing. - Allows immediate processing of documents as they are loaded. This new module facilitates integration with the Confluence platform and enables efficient handling of large volumes of data.

- Implements ConfluenceComponent to load documents from the Confluence platform. - Adds necessary inputs, including URL, username, API key, space key, and more. - Supports configuration of max_pages for pagination control. This new module facilitates integration with the Confluence platform.

github-actions · 2024-07-16T00:40:42Z

Pull Request Validation Report

This comment is automatically generated by Conventional PR

Whitelist Report

Whitelist	Active	Result
Pull request is a draft and should be ignored	✅	❌
Pull request is made by a whitelisted user and should be ignored	❌	❌
Pull request is submitted by a bot and should be ignored	✅	❌
Pull request is submitted by administrators and should be ignored	❌	❌

Result

Pull request does not satisfy any enabled whitelist criteria. Pull request will be validated.

Validation Report

Validation	Active	Result
All commits in this pull request has valid messages	❌	✅
Pull request does not introduce too many changes	❌	✅
Pull request has mentioned issues	❌	✅
Pull request has valid branch name	❌	✅
Pull request should have a non-empty body	✅	✅
Pull request has a valid title	✅	❌

Result

Pull request is invalid.

Reason

Pull request title does not follow the desired pattern

_{Last Modified at 16 Jul 24 00:40 UTC}

aws-amplify-sa-east-1 · 2024-07-16T00:43:16Z

This pull request is automatically being deployed by Amplify Hosting (learn more).

Access this pull request here: https://pr-2718.dmtpw4p5recq1.amplifyapp.com

ogabrielluiz

Hey @danielgines

This looks awesome! Thank you.

LGTM.

By the way, the component has a to_data method that you could use instead of the docs_to_data function.

danielgines · 2024-07-16T16:53:40Z

Hey @danielgines

This looks awesome! Thank you.

LGTM.

By the way, the component has a to_data method that you could use instead of the docs_to_data function.

@ogabrielluiz Thank you for the feedback and the suggestion!

Is the Data module's from_document method the one you were referring to? It would be something like this:

def load_documents(self) -> List[Data]:
    confluence = self.build_confluence()
    documents = confluence.load()
    data = [Data.from_document(doc) for doc in documents]  # Using the from_document method of Data
    self.status = data
    return data

- Changed load_documents method to convert documents using Data..from_document instead of docs_to_data for better integration with Data module. - Updated trace_type to "tool" because the LangSmith API only supports one of the following types: ["tool", "chain", "llm", "retriever", "embedding", "prompt", "parser"].

… into feat/documentloaders/confluence

… vector database integration (langflow-ai#2718) * feat: Add Gemma 2 to Groq model list (langflow-ai#2586) Add gemma2 to groq_constants.py * Adds new ConfluenceComponent module with lazy loading support - Implements ConfluenceComponent to load documents from the Confluence platform. - Adds necessary inputs, including URL, username, API key, space_key, and more. - Supports configuration of max_pages for pagination control. - Implements lazy loading in the load_documents method for incremental document processing. - Allows immediate processing of documents as they are loaded. This new module facilitates integration with the Confluence platform and enables efficient handling of large volumes of data. * Adds new ConfluenceComponent module - Implements ConfluenceComponent to load documents from the Confluence platform. - Adds necessary inputs, including URL, username, API key, space key, and more. - Supports configuration of max_pages for pagination control. This new module facilitates integration with the Confluence platform. * Updated load_documents method to use Data.from_document - Changed load_documents method to convert documents using Data..from_document instead of docs_to_data for better integration with Data module. - Updated trace_type to "tool" because the LangSmith API only supports one of the following types: ["tool", "chain", "llm", "retriever", "embedding", "prompt", "parser"]. * [autofix.ci] apply automated fixes --------- Co-authored-by: Gordon Stein <[email protected]> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> (cherry picked from commit 114cdb9)

gsteinLTU and others added 6 commits July 8, 2024 12:21

feat: Add Gemma 2 to Groq model list (langflow-ai#2586)

d793341

Add gemma2 to groq_constants.py

Merge branch 'langflow-ai:main' into dev

e336c3f

Merge branch 'langflow-ai:main' into dev

3b46aa8

Merge branch 'langflow-ai:main' into dev

773d226

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Jul 16, 2024

danielgines changed the title ~~Adds new Atlassian Confluence Component~~ feat: Add new Atlassian Confluence Component for document loading and vector database integration Jul 16, 2024

github-actions bot added enhancement New feature or request and removed enhancement New feature or request labels Jul 16, 2024

Merge branch 'langflow-ai:main' into feat/documentloaders/confluence

6f6e589

ogabrielluiz approved these changes Jul 16, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jul 16, 2024

danielgines and others added 6 commits July 16, 2024 15:18

Merge branch 'langflow-ai:main' into feat/documentloaders/confluence

4a9526f

Merge remote-tracking branch 'origin/feat/documentloaders/confluence'…

d1e8f36

… into feat/documentloaders/confluence

Merge branch 'langflow-ai:main' into feat/documentloaders/confluence

22f032d

[autofix.ci] apply automated fixes

8926418

Merge branch 'main' into feat/documentloaders/confluence

c88c4cf

ogabrielluiz enabled auto-merge (squash) July 17, 2024 17:29

ogabrielluiz merged commit 114cdb9 into langflow-ai:main Jul 17, 2024
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add new Atlassian Confluence Component for document loading and vector database integration #2718

feat: Add new Atlassian Confluence Component for document loading and vector database integration #2718

danielgines commented Jul 16, 2024

github-actions bot commented Jul 16, 2024

aws-amplify-sa-east-1 bot commented Jul 16, 2024

ogabrielluiz left a comment

danielgines commented Jul 16, 2024

feat: Add new Atlassian Confluence Component for document loading and vector database integration #2718

feat: Add new Atlassian Confluence Component for document loading and vector database integration #2718

Conversation

danielgines commented Jul 16, 2024

github-actions bot commented Jul 16, 2024

Pull Request Validation Report

Whitelist Report

Validation Report

aws-amplify-sa-east-1 bot commented Jul 16, 2024

ogabrielluiz left a comment

Choose a reason for hiding this comment

danielgines commented Jul 16, 2024