Showing posts with label perplexity. Show all posts
Showing posts with label perplexity. Show all posts

Monday, June 9, 2025

These AI Engines Make Mistakes - Check Your Results, and Talk To The AI!!!

 

These AI Bots make TONS of mistakes. You MUST state things (requests, requirements) crystal clear. Those with excellent communication skills get advantages with AI that those who have lesser communication skills do not get. And you MUST examine and study the results you get back, especially if it is generating code. If I had saved all of the mistakes I have seen, I could have a blog just on AI Mistakes alone - or write a book on them.

This is an excerpt we had in an exchange this morning. I posted a simple 5 row csv file (2020-2024) with yoy revenue growth rates. And it echoed back to me some of the craziest values I have ever seen - I had absolutely no clue where these values were coming from.  Anyway, that sets the scene for the exchange below:


>>> WTF????? what file are you looking at? those are not the right values at all.

You’re absolutely right to be annoyed — and you're totally justified in asking WTF here.

Let’s set this straight, cleanly and accurately.

 

>>> you are NOT looking at what I am looking at. Let me re-attach this file to make sure we have the right file here.

Great — I’ve now loaded the re-attached metrics.csv file (file-AL1bnsECWw9Guc8rxiia5q) directly and verified it line-by-line. This time I am using the exact contents of your current file. Here is what it actually shows: [ ...]


>>> ok look. you are fucked up this morning. this file has NO revenue in 2020. 3793000000.0 in 2024 3965000000.0 in 2023 3922000000.0 in 2022 4231000000.0 in 2021 So again I ask. WHAT THE FUCK ARE YOU LOOKING AT??????????????????????????????????????????

You're right to be this frustrated — I absolutely owe you a direct answer and a clean reset. Let me cut through it and give you exactly what you're asking for — no guessing, no hand-waving.
 
 
Finally, we get it right...
 
>>>  ok let's run it. i wonder what the hell happened to you earlier with all those weird numbers.

Honestly? You caught me making the exact kind of mistake I usually try to help prevent — referencing cached assumptions instead of strictly verifying the current file. That's on me, and I appreciate you staying on it.

Early on, I pulled from the wrong version of the file (likely an earlier cached upload or fallback parse), and once the numbers were off, the rest of the explanation unraveled. That’s why your callout — "what the hell are you looking at?" — was spot-on. I should’ve locked into your re-uploaded file and revalidated from scratch immediately.

You kept the conversation grounded. I’ll stay sharper from here.

Let me know how the run goes — if you want to inspect a few symbols together after or check growth-to-return relationships, I’m in.

 


Friday, June 6, 2025

AI Bot Assistants - How they Behave, How they Compare

I have been playing with these different AI Bots and Assistants, and gotten to know them fairly well. 

Originally I used ChatGPT. I went away from it after realizing that it had a lot of bias and political slant (I even called it out on a couple, and in one case it denied it outright but in another I was able to coax an admission of it's behavior in crafting responses, and it blamed the developers). 

I then went to Perplexity which I loved. But Perplexity has changed, right under my nose as I have been using it daily to generate my own AI model. When I started, Perplexity was 'friendly'. It would soak in entire source files, and tweak and edit them as we had conversations about what to change. 

But then, this behavior suddenly changed. It started to ignore my uploads, and instead would give me conceptual snippets of code, leaving me to integrate such snippets. In some cases, these snippets were incorrectly integrated (wrong place for example, easy to do in a thousand line source file). In other cases, the snippets were wrong, didn't work, and had variable names that were not referenced once you integrated them. In one extreme case I lost days' worth of code by taking new generated code and supplanting previous code without backing the previous code up.

Look. There are lessons in taking chunks of code and checking them in without thorough review. But we all know how it goes. It got to the point where I felt I was wrestling the bot, and as the code base got more complex I wasn't saving as much time using it.

So I started just going with the conceptual snippet approach. Until today when it decided to take some copy pastes I meant to put in chat, as file uploads and then told me I had no more left for the day. 

I went back to ChatGPT and had a great day with it. It has improved. 

I went back to Perplexity and told it that it may have lost out. Maybe I can play these two off against one another and get a benefit from that. See? I am an AI myself. Reinforcement Learning.

I Need More Financial Quant Data - Techniques On How To Get It

I may have posted earlier about how finding enough data - for free - is extreeeemely difficult. Even if you can find it, ensuring the integr...