Dear author, I encountered some issues while running the training for the four circuits using your code. The TSA circuit training is working normally; it reached a success rate of around 90% after 12,000 steps, which is consistent with the paper. However, I am observing performance drops in the other circuits: LDO: It only achieved a success rate of around 60% after 12,000 steps. CMA: It only reached a success rate of around 10% after 24,000 steps, which is a significant gap compared to the results reported in the paper. Could you please advise on what might be causing these discrepancies? Thank you for your help.
Dear author, I encountered some issues while running the training for the four circuits using your code. The TSA circuit training is working normally; it reached a success rate of around 90% after 12,000 steps, which is consistent with the paper. However, I am observing performance drops in the other circuits: LDO: It only achieved a success rate of around 60% after 12,000 steps. CMA: It only reached a success rate of around 10% after 24,000 steps, which is a significant gap compared to the results reported in the paper. Could you please advise on what might be causing these discrepancies? Thank you for your help.