← index #321Issue #316
Related · high · value 3.738
QUERY · ISSUE

Umqtt.robust: subscription vanished after reconnect

openby JDchauhanopened 2018-11-27updated 2023-08-12

Board used
ESP8266

What is the problem:
Umqtt.robust() on reconnection will forget the previously subscribed topics i.e, it do not fetch messages of subscribed topics after reconnection.

When the problem occurs (how to mimic it)
after successful subscription of topics and verifying that everything works fine just turn off your internet connection. Then after 5 minutes restart internet connection.

(Note: try to use publish-message with try-catch before check_msg() otherwise u might get stuck with #192 )

Then you will observe that no message is being fetched by the check_msg() on that topic.

How to solve this
This might not be a good solution but for now, I am just subscribing all the topics again whenever the connection lost

12 comments
kevinkk525 · 2018-12-03

That's the only working solution and the best one. I'm doing so as well with another mqtt client.
The only other solution would be to connect without a clean connect but that can result in getting all missed messages between connection loss and reconnection and is therefore not advised to do as it would likely crash your small microcontroller.

turutupa · 2020-02-11

How to you re-subscribe to all topics on connection lost??

kevinkk525 · 2020-02-11

You have all your subscriptions stored and on reconnect you just subscribe all of them.

turutupa · 2020-02-11

But I mean. If on my main while loop I only have client.wait_msg(). Where exactly do I execute the re-subscribe? Is there an onconnect / onreconnect function or something like this? (thanks for the quick response btw!)

kevinkk525 · 2020-02-11

Hmm you are right, the umqtt.robust doesn't really offer a convenient way of recognizing a reconnect.. I haven't used this code in years. I use an async mqtt client https://github.com/peterhinch/micropython-mqtt

It looks like you need to create a subclass so you can change the way the reconnect works so you can subscribe after a reconnect.
But maybe someone else who actually uses this code has a better solution.

JDchauhan · 2020-02-12

How to you re-subscribe to all topics on connection lost??

I have maintained list of subscribed topic in my code itself, Whenever It gots to know about disconnection and gets reconnected I will again subscribing them, for prompt of disconnect I am running the publish event (just for checking) in the try block and checking problems in catch if any comes

turutupa · 2020-02-12

UUuuuh, nice. Hadn´t thought about publishing just to check if it is subscribed to a topic, nice approach! Mainly cause I have only used umqtt to receive messages.

Now I fully understand your note

try to use publish-message with try-catch before check_msg() otherwise u might get stuck with #192 )

Thanks!

turutupa · 2020-02-27

Hey guys, I am back and not with good news hehe.. I was wondering, @JDchauhan could you provide code to your publish-subscribe? I haven't spent too much time on this, but I must admit I haven't managed to make it work. I have tried to try-catch publish and then check_msg but it gives me mqtt: OSError(-1,)

Hoping to find a robust way to make it work

IMPORTANT EDIT: forgot to mention. If I only publish every 5 ~ 10 seconds its fine. It does publish. Gives me error when I check_msg() after publishing

I have to add I am using Adafruit IO to publish-subscribe... could that be issue? What service are you using? Perhaps I should run my own mosquitto on a pi? I rather use Adafruit for IFTTT...

alessionossa · 2023-06-14

Hey guys, I am back and not with good news hehe.. I was wondering, @JDchauhan could you provide code to your publish-subscribe? I haven't spent too much time on this, but I must admit I haven't managed to make it work. I have tried to try-catch publish and then check_msg but it gives me mqtt: OSError(-1,)

Hoping to find a robust way to make it work

IMPORTANT EDIT: forgot to mention. If I only publish every 5 ~ 10 seconds its fine. It does publish. Gives me error when I check_msg() after publishing

I have to add I am using Adafruit IO to publish-subscribe... could that be issue? What service are you using? Perhaps I should run my own mosquitto on a pi? I rather use Adafruit for IFTTT...

Seeing the exact same issue, with Adafruit IO too.

JDchauhan · 2023-07-22

Hi @turutupa & @alessionossa
I was using my own MQTT broker at that time.

Below is the part of the code I used back then that worked for me. I don't remember exactly but from the code, it seems like I am expecting the error to be thrown by check_msg() instead of publish() and then subscribing again (if it helps)

            try:
                client.publish(topic="test", msg="testing")
                if not isBroker:
                    client.subscribe(topic= data["id"] + "/#")                     
                # print("connected")
                client.check_msg()
                time.sleep(0.5)
            except:
                isBroker = False
                print("broker disconnected")
                time.sleep(2)
                break
javiergmarcos · 2023-08-12

Same on my side. I'm using micropython on ESP32 C3. Next the workaround that works for me.
Main loop blocking:
In the main() loop, I've replaced the mqtt.robust check_msg() and wait_msg() by my own, because if there is no mqtt server, check_msg() is waiting forever. That happens because when it calls to wait_msg(), it waits forever instead of generate an exception that try to reconnect and then continue the main() loop
The problem arise because check_msg() set self.mqtt_client.sock.setblocking(False), and then calls to wait_msg()
On their side wait_msg() change self.mqtt_client.sock.setblocking(True) at its beginning, and then allowing to wait for a mqtt server connection.
My workaround is using my_check_msg(), that is an exact copy of the check_msg() except it calls my_wait_msg().
my_wait_msg() is an exact copy of wait_msg() except that the initial #self.mqtt_client.sock.setblocking(True) is commented.
I've tried this with class inheritance and monkey patching without success. May be side effects
Regenerate subscription mqtt:
All my appliances (sensor, switch, light,...) connected to mqtt have a _register_mqtt(self) procedure.
In the initial setup of every appliance, I add each appliance to a device_appliances{} dictionary (key=appl_id, value=appliance) and also call _register_mqtt(self)
To reconnect after a mqtt server (or Wifi) fault, the procedure reinitialize the mqtt_client and then calls the _register_mqtt() for each appliance in the device_appliances{} dictionary.
Finally also republish the availability_topic.

andrewleech · 2023-08-12

Subclass is the current recommended solution here, however it won't really work until this is merged. Take a look for more information & example about re-subscription
https://github.com/micropython/micropython-lib/pull/669

CANDIDATE · ISSUE

umqtt.robust sends the NodeMCU Esp8266 board in some freezing state

closedby JDchauhanopened 2018-10-31updated 2024-08-27

I have observed some strange problem in my NodeMCU Esp8266 board after running umqtt.robust that the board will stop silently without any warning or throwing errors after 10-15 minutes of no communication with the broker (i.e, if the client does not publish or receive any message of subscribed topics for 15 minutes approx.).

It seems like the library might be taking the board to some blocking state.

Please check and fix that issue.

1 comment
jonnor · 2024-08-25

Hi. There is unfortunately not enough information here to reproduce or debug this problem.

Multiple issues with mqtt on esp8266 have been fixed over the last 5 years. And some are still open - search the issue tracker for more. Please re-test on the latest versions and if there are still problems which are not described in already open issues, provide details such as a minimal code example for how to reproduce - along with details about the MQTT setup, such as broker version.

Keyboard

j / / n
next pair
k / / p
previous pair
1 / / h
show query pane
2 / / l
show candidate pane
c
copy suggested comment
r
toggle reasoning
g i
go to index
?
show this help
esc
close overlays

press ? or esc to close

copied