-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EPIC AI products #30297
Comments
(also Tim and I are very very happy to invest very seriously in this area, we could add lots of engineers as we need, but obviously that can make things way slower) |
I am a big fan of the idea, and we should invest all resources. No matter the cost. |
Also: I don't think I was invited to Paris. |
Some random feedback. New ModesCurrent IDEs only have SWE modes: chat (user <-> LLM), implement (agent: plan and code), and fix (code patch). Let's bring a new mode–Product Manager. LLMs are good at planning and implementing, so we can build a multi-agent system where the PM agent orchestrates the work of SWE agents based on the product-level data. Session ReplaySession replay is extremely useful for this product and the most complex for LLMs at the same time. Replay uses rrweb, so the payload is huge. This makes it challenging to convert to natural language, and no existing solution can transcribe replay on a scale with the appropriate cost. At the same time, users have real problems to solve with AI there:
$0.005/session = 50k tokens on Mistral Small - impossible to feed the entire session. I've been thinking about clustering sessions based on the features so we can later process samples: collected events, sessions dropped from key metrics (such as a funnel for the checkout flow), ML-based feature extraction from payloads (no ideas yet), some algorithmic extraction from the JSON (such as pre-processing to transcribe first to consume fewer tokens). Based on those features (tags), we can partially solve the volume problem so similar sessions can be grouped. More advanced processing can be done to group sessions based on their transcripts and extract actionable feedback: using a multimodal LLM to find visual problems, detecting errors affecting key metrics the most, understanding user intent and behavior, and more. This is something we must start doing ASAP, as session replay seems to me to be the "glue" for Max and for the Editor. Other products are in a better position to be LLM-ready. Product/Company MemoryThere is a potential for product/company memory:
We could make the information consumable by the agents by collecting a memory bank, which will greatly help with the decision-making, such as proposing features to build, retrieving user requests, etc. Suggestions of what to buildLLMs are really bad at decision-making. I would start by indexing the customer requests, operating on top of the roadmap, or key metrics. Those resources are more likely to contain actionable feedback, so LLMs won't struggle. Otherwise 👀 MVPI think points 1 and 2 are realistic for the hackathon. We're ready to go with taxonomy query runners and the RAG pipeline for the event instrumentation. Technical Questions
We can render insights and filters into a Webview component (or actually any UI we have). The UI won't be perfect in some cases but should work. |
love the combination of code + app/product/behavioral data, that's how future products will be built and ai agents can do improve products autonomously based on user feedback and app usage. to put the same idea as in the desc. visually, i see ide extension, desktop app (and even mcp), as a new interfaces we are introducing for posthog customers (first focusing on it as a new ui/ux, and then ai to boost it and make it 100x more useful) currently posthog insights are available outside of where our core icp hangs out the most, i.e. their code editor. which means we need to:
the process 👆🏼 is time consuming, error prone and manual. imagine a scenario where:
to summarize, i think it makes a lot of sense to move closer to where our icp spend 90% of their time, and with the extension or ide, we can make it to 99%, helping them to only focus on big picture and core functionality and we help them to:
p.s. in the future scenario, a potential "extension" to the platform can be the deployment infra, download posthog ide, build product, ship it, sell it (with our crm), measure it, improve it, all in one platform 🚀. |
i love this @skoob13. do you want this to be ai-first from get go, or bring in the data (similar to our web app) for human in the loop feedback, and automate more and more. main reason i'm thinking bringing in data can be done first is that:
hence, we can focus on:
p.s. a random idea: another "interface" for posthog can be github app and actions, similar to graptile comments |
The Replay team has done some (not so successful work) with clustering in the past and we let it rest a bit, but we now have two super AI keen people in the team, and while we still have to do our planning, this will likely be one of our key objectives for next quarter. We also have an offsite next week, where we were planning to speak about this. Since Replay is already integrated into Max AI, we can chip away at making Replay more intelligent over time, and if Max AI is available in a code editor, Replay insights will be available as well. But yeah, the key thing for Replay in an AI-first world is to answer: paging @pauldambra @veryayskiy @sortafreel |
yep, i'd be amazed if we didn't end up with something around the "watch my" or "find my" recordings for Q2 |
What do we think about the marketability/distributability of an extension? I've always found it odd that extensions aren't more popular (besides the typical ones you'd need such as language support). Or maybe, I'm totally unaware of the popularity. But especially with recent trends, it seems like people are more ready to try new/forked IDEs than extensions |
This looks great, I'm excited to see how this progresses! Couple of things I would personally love to see in terms of actionable feedback in a PostHog oriented IDE experience:
![]()
Some things that could help us achieve this faster:
Some things that don't feel that useful:
|
@HamedMP I think this is well worth us exploring - it'd be nice to have an experience in Max AI where we upload the diff from the PR and it gives you a bunch of actionable comments based on the analytics of the things you are changing |
@joshsny oh, you sparked the idea that just like autocapture in web, we can do autocapture for components (similar to react dev component view) to measure each components performance/actions/bugs (connected with recording json data) re. the pr diff idea, 💯%, i think that'd be really cool to hack together! |
We have this concept in feature flags, well at least theoretically. The idea was for developers to highlight components in the code, that then would be tracked in PostHog automatically as a “feature”. TBH I haven’t looked into it much whether this is used at all, it lacked strong engineer ownership when we built it, so we never really got somewhere with it. I am not sure I can directly link to it (on my phone), but it’s the “Method 2: Using the PostHogFeature component” section in our React docs: https://posthog.com/docs/libraries/react Anything we can do to make this easier/automatic to set up in an AI first world would be awesome, I think it was a hard concept to teach to engineers and learn how to use, at the right point in time. |
@EDsCODE's
💯 We could start with an extension, but I'm certain that's not where the VC scale win is. We should fork VS Code from the get go and go against Curs0r et al. This is significantly more ambitious – to be competitive, we'll need not just a great AI agent, but also excellent AI autocomplete – but that's how we build the ultimate Product OS for the AI-powered future of building. Extremely ambitious to be honest, but our tagline is "how developers build successful products". We've already put a surprising amount of thought into the IDE UX with @corywatilo's IDE-like designs for the web app and we don't actually need to worry about distribution, especially with our brand. But we must 100% create unprecedented value. In coding, we're fortunate to have a late mover's advantage. We can see what works out there. Still a major investment, but I'm confident we can ship a great experience there and embed into the builder's relationship with code. Then, when we own the relationship, we get into novel territory. In-IDE integration with analytics, flags, error tracking, etc. becomes 100% natural, and you can ask the AI agent to tackle any of these problems right away. Imagine finding the most common backend error, and solving it in one click without ever leaving the IDE. Then you see a funnel drop-off problem – the agent addresses it right away, and after checking that the event is instrumented correctly, it drafts a proposed UX change and the right experiment to A/B test it. Short framing: |
The only reason i've said not to do this is i've herad it's hassle, if you're up for it let's do this. it'd be much better positioning wise. Can we build it into a fork of vscode as an extension so we can also distribute into curs0r etc too? or does this limit us? |
we could just sample and assume we'll catch most stuff this way anyway that affects the most users |
Love this. I can excitingly see a whole world where IDE becomes what people think of as our main product and fully cements the Product OS vision. |
A lot of super exciting stuff in here! Sharing some raw high-level thoughts about key considerations and trade-offs involved between areas I'd consider important here - ICPs, interfaces, and business models. ICP vs interface
Interface trade-offs
Business model vs interface
With this in mind, I'd lean toward starting with a lighter interface that can integrate well into existing tools (extensions, MCPs)
One area that I might be underestimating is the limitations that certain interfaces will impose on the UX we can deliver. But there seem to be so many low-hanging fruits in integrating product analytics/sessions/error tracking (and many others) into the development experience that even with the more basic interfaces, we should be able to deliver a lot of unique value. |
Could we do both? VS Code fork team vs. VS Code extension team and see which can win the greatest adoption? |
Context / Goal
AI editors now exist that do everything inside your code.
PostHog does everything outside the code.
Bring them together - and you have products that make themselves successful
Behold - the PostHog Editor.
Proposal
I propose we build everything into a VSCode extension.
I think there are 4 products within this:
Longer term
We fork vs code and create a desktop app.
PMs use it to see suggestions of what to work on and to dig deeply into product data. Myabe even Designers use it to tweak the designs
Engineers use it to code the things the team should work on
Salespeople use it to figure out how customers are using products/trials and so on, and to create deals/operate it as a CRM
Marketing people use it to configure messaging campaigns, do conversion rate optimziation or to ask deep data questions and to configure ad campaigns
Support people use it to handle tickets, with AI powering the answers
Whereas teams used to have 10 developers, 10 salespeople, 10 marketing people, etc, they will just have 1 of each at the same revenue. This person will "configure" how the function operates, then AI will do all of it.
How do we launch this
Pretty much any of these distinct "products" in the extension adds enough value i'd argue we can launch with just one of them.
We have a huge launch thing happening in early May. This is the goal to promote the PostHog Editor.
Technical questions
Max displays data. How painful is it to do this in an extension? I think if we push editing via max / chat then we can avoid having to create ie interactive filters with the escape path being - click it, view the insight on the web to edit
How do we preview suggested new features to a user? is text enough? does any IDE do a good job of this?
Notably missing
after chatting with Hamed I don't think MCPs make a lot of sense here, which feels odd given the industry seems to want to go that way. The extension just means we can do way more as far as i can tell.
What it would look like
product features aren't listing well yet but you get the idea
The text was updated successfully, but these errors were encountered: