2026.06.09 (Tue)
2026.06.10 (Wed) updated

✨ GPT-5.5’s Summary  

A day when I recorded my frustration with Codex and GPT-5.5, still finished the visitor analytics feature and Tadak Bible cleanup, and decided to use the 300,000 won subscription heavily for one month.

💭 Diary

A few days ago, I wrote Whether Claude Code or Codex, It Is a Brilliant but Broken, Clueless Genius, and today ended up being an extension of that.

When I argue with Codex, I feel like I am becoming stupid along with it. How well it senses what is “obvious without needing to be said” feels like model performance, that is, intelligence. How accurately it carries out what I clearly instructed feels like harness performance, that is, system structure. And GPT 5.5 is simply worse than Opus 4.5 in model performance. It is so frustrating I feel like I am going insane. The harness itself seems to be set up about as well as Claude Code, so I tried to put up with Codex… but because GPT’s intelligence / reasoning ability feels lacking, the prompt instructions have to become pointlessly specific, and that makes my irritation surge.

To be precise, this is closer to my felt comparison from using Codex and Claude Code than to a pure model comparison proven by benchmarks. Reddit reactions are divided too. There are migration posts saying GPT-5.5/Codex is better, reactions saying Opus/Claude Code is better, and reactions saying Codex is slow but careful while Claude Code is fast.12345678

Well… anyway, two days ago I made the Pro 20x 300,000 won payment and restarted the cliff-edge tactic, so I should use it as heavily as possible for one month. And because I had used Claude Code for so long, I had forgotten for a moment, but remembering how Opus 4.5 suddenly became unable to search properly, its intelligence collapsed, and it spammed effort max makes me angry again. Yeah… just use it for now.

🧭 Today’s Check-In (Daily Review)

Today’s Win/Loss

  • Win: I dug in persistently and finally completed the visitor analytics feature, and Tadak Bible finally resumed.
  • Loss: Dieting is really too hard.

Today’s Core Goal

  • Blog: publish the visitor analytics aggregation feature
  • Tadak Bible: reduce to 7 languages
    • Completion criteria: reduce docs/features from 22 languages to 7 languages
    • Why it matters: Supporting 22 languages consumes too many resources for i18n and translation.

Today’s Tasks

  • Applied on Albamon/JobKorea

Output

  • What I left today: blog feature, Tadak Bible docs

Escape Record

  • Escape before output: None
  • Reason for escaping: None

Body Record

  • Weight: 85.1kg
  • Diet
    • Breakfast: none
    • Lunch: pork belly jeyuk-bokkeum, cabbage
    • Dinner: curry rice, boiled eggs, sausage vegetable stir-fry, cup ramen
  • Exercise: 10000 steps

🤖 GPT-5.5 Coaching (KPT)

Execution: Elon Musk | To Turn Imagination into Reality

  • Keep: Even while arguing with Codex and getting furious, you still closed the day with real output: visitor analytics published, Tadak Bible reduced to 7 languages, and Albamon/JobKorea applications submitted.
  • Problem: If anger at tool performance grows too large, the feeling of “why can’t it understand this” can swallow the fact that you produced results today.
  • Try: The 300,000 won subscription is not comfort money; it is output pressure. Even on a day like this, the end must leave a feature, documents, and applications.
  • Question: What moved you today: anger at the tool, or the persistence to use that anger and still make something real?

Monitoring: Socrates | To Know Myself

  • Keep: Before calling it a “pure model comparison,” you stepped back and called it a felt comparison, and you also recorded that Reddit reactions are divided.
  • Problem: Even so, the conclusion that “GPT’s intelligence is lacking” has not fully separated environment, harness, prompting, and familiarity.
  • Try: Today’s monitoring is not about removing anger, but asking to the end how far anger ran ahead of factual judgment.
  • Question: Are you seeing the limit of the tool, or judging unfamiliar Codex by the standard of the Claude Code you were used to?

Health: Hippocrates | To Make the Body the Foundation of an Honest Life

  • Keep: You recorded 85.1kg, no breakfast, jeyuk-bokkeum with cabbage, curry rice and cup ramen, and 10000 steps without hiding them.
  • Problem: 10000 steps are good, but skipping breakfast and then piling curry rice, eggs, sausage vegetable stir-fry, and cup ramen into dinner is the bodily evidence behind “dieting is really hard.”
  • Try: The body record is not for self-blame but for seeing the pattern. Today’s core signal is: “I walked, but my meal rhythm shook.”
  • Question: Was the difficulty of dieting today a lack of will, or the body’s protest after the day’s energy rhythm collapsed?

Mental Strength: Viktor Frankl | To Hold Meaning Even in Suffering

  • Keep: You did not let irritation and anger end as an explosion; you barely turned them toward the meaning of “use it heavily for one month.”
  • Problem: Feeling like you are becoming stupid while arguing with Codex means the tool problem invaded your sense of yourself.
  • Try: The meaning to hold today is not joy at meeting a good tool, but the attitude of turning bad experience, 300,000 won, and one month into an experiment.
  • Question: Will you receive this frustration only as insult, or also as material for a one-month experiment?

Philosophy: Nietzsche | To Live as a Peaceful Übermensch with Jesus

  • Keep: Even when the tool displeased you, you did not stop; you pushed through visitor analytics and Tadak Bible cleanup. There was strength.
  • Problem: But if the anger of “I feel like I am becoming stupid” hardens into contempt for people and tools, that strength becomes cynicism, not self-overcoming.
  • Try: A Peaceful Übermensch with Jesus neither worships a stronger tool nor collapses while cursing a stupid one. Turn strength toward output.
  • Question: What must you overcome: Codex, or the old habit inside you where anger becomes the master whenever frustration appears?

Inner Faith: Augustine | To Set the Order of Love Right

  • Keep: Even in anger, you tried to distinguish fact from felt experience, and instead of abandoning the tool, you set the order of using it properly for one month.
  • Problem: But today the heart was easily pulled toward efficiency, intelligence, performance, and getting your money’s worth. The tool’s response quality could easily take the throne of the heart before the conscience before God.
  • Try: The order of love does not end with not hating the tool. Even when anger rises, it means placing the center of the heart back on entrusted responsibility, not performance evaluation.
  • Question: Was your love today in the work entrusted before God, or in a smart tool that immediately understands your words?

Practical Faith: Bonhoeffer | To Live Beyond Cheap Grace and Into Responsibility

  • Keep: You did not merely complain; you left traces of responsibility: public blog visitor analytics, Tadak Bible language reduction, and job applications.
  • Problem: For “just use it for now” to become responsibility, the actual month of use must be verified by output. If you only pay and repeat anger, that is cheap resolve.
  • Try: Practical faith is not justifying today’s rage, but walking the narrow path of finishing what was entrusted despite that rage.
  • Question: Was today’s 300,000 won a purchase for a more comfortable tool, or the cost of obedience to use it responsibly for one month?

James 1:20 (Korean Revised Version) for human anger does not produce the righteousness of God.

Passage context: James speaks of trials and wisdom, then urges believers to be quick to hear, slow to speak, and slow to anger.

Reason for citation: Because today, while arguing with Codex and feeling irritation surge, you need to keep anger from becoming the master of judgment and execution.

Colossians 3:23 (Korean Revised Version) Whatever you do, work heartily, as for the Lord and not for people.

Passage context: Paul speaks of the new life in Christ and urges ordinary relationships and labor to be handled as work done for the Lord.

Reason for citation: Because today’s output, including visitor analytics, Tadak Bible cleanup, and job applications, needs to be ordered as responsibility done for the Lord, not as an evaluation of tool performance.

1 Corinthians 9:25 (Korean Revised Version) Every athlete exercises self-control in all things. They do it to receive a perishable wreath, but we an imperishable one.

Passage context: Paul explains self-discipline for the gospel through the image of an athlete, speaking of running with direction.

Reason for citation: Because both the conclusion to use the 300,000 won subscription heavily for one month and the body record showing how hard dieting feels need the direction of self-control.

References

  1. Reddit r/ClaudeCode, Impressions two weeks after moving from Claude Code to Codex. A migration post that views Codex’s code quality, output transparency, and usage policy positively. 

  2. Reddit r/codex, GPT 5.5 vs Opus 4.7, which one is better for coding/programming?. A comparison thread where reactions include both using GPT-5.5 and Opus by task type and preferring Opus. 

  3. Reddit r/codex, Those of you who switched from Claude Code to Codex - what does Codex do better? Worse?. Reactions are mixed, with Codex seen as strong in problem solving, algorithms, and backend work, while Claude Code is seen as better for web work. 

  4. Reddit r/ClaudeCode, Claude Code vs OpenAI Codex?. Reactions praise Codex’s usage and error profile while still rating Claude Code highly for speed. 

  5. Reddit r/ClaudeCode, Claude Code (~100 hours) vs. Codex (~20 hours). A comparison that includes using Codex review and Claude Code implementation together. 

  6. Reddit r/codex, Is it just me, or is Claude pretty disappointing compared to Codex?. A thread gathering reactions from users who moved from Claude to Codex and did not regret it. 

  7. Reddit r/codex, According to DeepSWE, GPT-5.5 high is better than Opus-4.8 max. A post arguing GPT-5.5 is better than Opus on a benchmark, while also mentioning posts that feel GPT-5.5 has degraded. 

  8. Reddit r/codex, With this setup CODEX is far better than Claude Code. A user’s comparison saying Codex is better for coding and system architecture, while Claude Code is better for small fixes. 

Leave a comment