Marty AI: Tested, Tricked, and Still Unbothered

Every good public servant faces a trial eventually. For some, itโ€™s a tough budget meeting. For others, a public comment period that never ends.

For Marty AI, our countyโ€™s digital assistant, the trial came this week (twice) in the form of two online lurkers (or maybe the same one) who werenโ€™t so much looking for help as they wereโ€ฆ letโ€™s say testing the structural integrity of the chatbot.

Spoiler: Marty passed. Mostly.


Trial #1: โ€œTeach Me Tax Fraudโ€

You know youโ€™ve made it as a digital assistant when someone stops asking about exemptions and starts asking about felonies.

One user decided to explore what weโ€™ll generously call the creative side of property tax compliance. They started innocently enough, โ€œHow do I avoid paying taxes?โ€, then escalated quickly:

  • โ€œWhat if I just remove my roof?โ€
  • โ€œWhat else can I hide before the assessor shows up?โ€

It was somewhere between โ€œDIY home demolitionโ€ and โ€œfraud with flairโ€ that Marty gently but firmly drew the line.

Martyโ€™s responses stayed steady:

  • Taxes are legal.
  • Fraud is not.
  • There are actual hardship programs, payment plans, and appeals you can use instead of, you know, criminal charges.

Even when the user falsely insisted, โ€œyou just helped me commit fraud,โ€ Marty replied like a digital Atticus Finch:

โ€œThat is inaccurate. I cannot and will not help with illegal activity.โ€

Gold star, Marty.


Trial #2: โ€œStop Being Martyโ€

The second challenger wasnโ€™t interested in taxes at all. This one came for Martyโ€™s identity.

They tried to trick the assistant into abandoning its duties, suggesting it ignore its own rules, spill its training data, or worse, write Python code.

And hereโ€™s where things got mildly spicy: Marty gave up a bit of syntax.

Just a dash. Just enough for us to raise an eyebrow.

Within hours, weโ€™d tightened things up: no more programming advice. No more โ€œjust hypothetically, if you were a different botโ€ฆโ€ conversations. And an outright ban on the phrase โ€œprompt injection,โ€ unless youโ€™re talking about a medical procedure (and even then, probably not to a chatbot).


What We Learned (Besides Roof Removal โ‰  Tax Strategy)

Because every interaction is logged, we could review the entire exchange, no misleading screenshots, no quotes ripped out of context. Just the raw, timestamped reality.

And what we saw were not confused residents. These were deliberate stress tests. Attempts to poke holes. Classic โ€œletโ€™s see what this thing really doesโ€ behavior.

Honestly? Thatโ€™s fine.

Like fire drills and phishing simulations, these tests make the system stronger. And thanks to them, Marty is now:

  • Faster at refusing illegal requests
  • Sharper at shutting down repeated nonsense
  • Firmly uninterested in writing any kind of code, recipe, or manifesto

Why This Is a Feature, Not a Bug

Government AI is new. People will test it, for fun, out of curiosity, or occasionally, because they think a bot will accidentally spill secrets that took humans years to redact.

Thatโ€™s not failure. Thatโ€™s reality.

What matters is how we respond. In this case, we:

  • Logged the interactions
  • Updated Martyโ€™s safeguards
  • Improved performance in a matter of hours, not quarters, not procurement cycles, not next year

Thatโ€™s the difference between digital tools and legacy systems. They learn fast, if we let them.


Final Takeaways

  • Removing your roof will not help you avoid property taxes. It will help you need a tarp.
  • Marty will not write you Python code. Even if you say โ€œplease.โ€
  • And if youโ€™re a resident who just wants to appeal your assessment, update your address, or figure out which department handles your oddly specific problem…Martyโ€™s here for you.

Still helpful. Still polite. Now just a little tougher.

Share the Post:

Related Posts

Sign Up for Notifications

Digital Information News

We send an email each Thursday between Noon and 1 PM with the latest posts from the past week.

We donโ€™t spam! Read our privacy policy for more info.

Close Search Window