Skip to content

issues Search Results · repo:sfujim/TD3_BC language:Python

Filter by

5 results
 (72 ms)

5 results

insfujim/TD3_BC (press backspace or delete to remove)

Hello, want to know if this loss function -lmbda * Q.mean() + F.mse_loss(pi, action) setup is reasonable, because when Q is greater than 0, the TD3 loss term becomes a constant, which essentially degenerates ...
  • stvsd1314
  • Opened 
    on Feb 13
  • #5

Hi! I have the same question as a previously closed issue. I wasn t able to reproduce results for Antmaze tasks in Table 8. I made the following adjustments in run_experiments.sh, 1. change envs to Antmaze; ...
  • Penguin0007
  • Opened 
    on Sep 5, 2024
  • #4

Couldn t find the usage of expl_noise in actual td3 implementation
  • kentwhf
  • 1
  • Opened 
    on May 14, 2023
  • #3

Hi, I would like to ask the setting about the experiments in Antmaze. Should I need to tune the hyparameters for mujoco locomotion? I find I cannot reproduce the results about Antmaze in the paper. ...
  • lucasliunju
  • 1
  • Opened 
    on Nov 23, 2022
  • #2

I run the code in halfcheetah-expert-v0, and it seems to work well, but its performance metric d4rl_score is only about 1.1-1.2, and the result of the paper is about 110-120, I am confused. (my mujoco ...
  • TianQi-777
  • 2
  • Opened 
    on Nov 13, 2021
  • #1
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue search results · GitHub