-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Introducing Statsmodels Guru on Gurubase.io #9429
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Kursat Aktas <kursat.ce@gmail.com>
I don't think we should advertise anything like this yet. First we don't really want to pick one of the competitors for it. Second, from what I have seen so far, the answers are good for very common problems or where our documentation is very good, e.g. OLS regression. copilot is better in this case and uses But copilot also often gives example that will not work or will not do what is required. |
Hi @josef-pkt Thanks for the review. You are right, it works pretty well for well-written documents and the questions that can be extracted from them. However, for questions with very limited supporting context, it sometimes hallucinates. We are building a "Trust Score" feature, which will show users a score indicating the level of supporting context for the answer. As for the copilot side, I have no comment. We position Gurubase as a Learning Assistant rather than a Coding Assistant. Perhaps you can use both tools. |
That's what I was looking at with copilot when it showed up on my windows computer. Since the latest windows update copilot is not a trial version anymore, but some performance has decreased. I now don't automatically get references (links to websites) anymore. |
BTW, in issue #9318 I was looking in both direction Can we use copilot and similar to improve or speed up improving our docs? One task is to add more example notebooks for where we have gaps or where notebooks are not descriptive enough. When I looked at the trial version of copilot I often got reference links to blog articles that described with examples a statsmodels feature. Those copilot answers were usually pretty good and mostly correct. |
another problem that I just found: Delay in getting newer features example "two sample test for comparing proportion in statsmodels" The answers themselves look fine (no obvious errors when skimming), but are outdated. I guess it takes some time for docs, blog posts and similar to create enough material to get into LLMs. (aside |
We have a similar idea planned for the future. The idea is to compare the codebase with the documentation to identify missing parts and inconsistencies.
I manually added the latest version of the documentation as a data source for Statsmodels Guru. The issue might be that I missed some parts of it, or the system didn’t perform well in finding the relevant context. I’ll look into this. Additionally, I want to update you on the release of the Maintainer Panel feature on Gurubase. With this panel, you can add, remove, or update data sources, change the logo, and more. You can find the details here. |
Hello team,
I'm the maintainer of Anteon. We have created Gurubase.io with the mission of building a centralized, open-source tool-focused knowledge base. Essentially, each "guru" is equipped with custom knowledge to answer user questions based on collected data related to that tool.
I wanted to update you that I've manually added the Statsmodels Guru to Gurubase. Statsmodels Guru uses the data from this repo and data from the docs to answer questions by leveraging the LLM.
In this PR, I showcased the "Statsmodels Guru", which highlights that Statsmodels now has an AI assistant available to help users with their questions. Please let me know your thoughts on this contribution.
Additionally, if you want me to disable Statsmodels Guru in Gurubase, just let me know that's totally fine.