← index #102Issue #316
Related · medium · value 1.130
QUERY · ISSUE

umqtt.robust dies when MQTT broker gets restarted

openby phieberopened 2016-09-14updated 2024-08-25

Hi,

the umqtt.robust module is pretty reliable but I found a case when it stops sending:

screenshot from 2016-09-14 13 42 10

As soon as I restart the MQTT broker, I have to connect via WebREPL and press CTRL+C.

I use the following minimal test code:
https://github.com/phieber/uPython-ESP8266-01-umqtt

Can you reproduce this issue when restarting the broker?

br
Patrick

16 comments
phieber · 2016-09-14

In the screenshot above, I have restarted the broker after two successful MQTT publish messages (two temperature values in this case)

Actpohomoc · 2016-11-09

I use code to avoid such problem:

try: retries = 0 while (retries < 20): retries += 1; client.check_msg() time.sleep(1); except OSError: connect_and_subscribe()

def connect_and_subscribe():
global client
client = MQTTClient(CONFIG['client_id'], CONFIG['broker'], CONFIG['port'], "user", "pass", 120)
client.set_callback(callback)
client.connect(False)
print("Conn to {}".format(CONFIG['broker']))
client.subscribe(b"MBI/CURRENT_DATETIME")
time.sleep(1);
client.check_msg()
.....
So I always check if there any OsError and reconnect to MQTT.

You need to upgrade the robust by this PR: [https://github.com/micropython/micropython-lib/pull/117]

craftyguy · 2017-03-17

Is there a better way to go about handling this situation where the broker 'disappears' and then 're-appears' at some later time? The current implementation of umqtt.robust is not robust at all, even with the reconnect. check_msg never works even though it seems the client reconnected.

Should the umqtt.robust object include a list of subscribed topics to auto-resubscribe in the reconnect() function?

scargill · 2017-06-01

That last message as a microPython newby worries me... I don't want to start coding in microPython on the basis of unreliable MQTT as we have reliable MQTT in C......

Can someone give an example of this "robust" code which WILL stay connected and which will resubscribe on reconnection - i.e. so that it just works in the background.

Is this possible?

dpgeorge · 2017-06-02

Is there a better way to go about handling this situation where the broker 'disappears' and then 're-appears' at some later time?

Yes, the current implementation of umqtt.robust does not handle the case when the broker is restarted and forgets all of its state (at least the state related to your client).

Should the umqtt.robust object include a list of subscribed topics to auto-resubscribe in the reconnect() function?

Prehaps. This is indeed how other libraries work (eg https://github.com/fusesource/mqtt-client). The robust MQTTClient class would need to override the "subscribe" method to record the topics (and qos), and then in the reconnect() method it would call subscribe() again after reconnecting.

dpgeorge · 2017-06-02

See #186 for a fix which will resubscribe to all existing topics if a reconnect is made.

craftyguy · 2017-06-02

Damn, I was in the process of writing a fix for this. You win the day, sir!

dpgeorge · 2017-06-02

@craftyguy I'd be interested to see your solution. And also if you want to test my solution and give feedback that would be great.

craftyguy · 2017-06-02

I literally started about less than an hour ago, but my approach was pretty much the same as yours. Your solution looks to be more elegant/robust. I'll give this a shot possibly as early as tomorrow, since I'm tired of hacking together a more robust robust mqtt :smile:

craftyguy · 2017-06-03

@dpgeorge

I tried your patch (#186), and it doesn't seem to work with this simple test program:

import machine
from umqtt.robust import MQTTClient
import utime

MQTT_SERVER = '1.1.1.1'
IN = 'in'
OUT = 'out'

# mqtt subscription callback
def sub_cb(topic, msg):
    t = topic.decode('ASCII')
    m = msg.decode('ASCII')
    print("received new topic/msg: %s / %s" % (t, m))
    if t == IN:
        print("IN: %s" % m)

umqtt_client = MQTTClient("test_client", MQTT_SERVER)
umqtt_client.DEBUG = True
umqtt_client.set_callback(sub_cb)
umqtt_client.connect(clean_session=False)
umqtt_client.subscribe(IN)
print("Connected to MQTT broker: %s" % MQTT_SERVER)


def main():
    global umqtt_client
    while True:
        utime.sleep(1)
        umqtt_client.check_msg()
        umqtt_client.publish(OUT, b'hi!')

I should note that I am invoking the main() function here from main.py.

When I restart the mqtt broker, I get an mqtt: OSError(-1,) printed to console. I can see the client reconnects to the broker since there's a message in the broker log about this, but the client doesn't respond to messages published to the IN topic, nor does it publish anything else to OUT topic.

If my test is an invalid use of umqtt, please let me know since I am relatively new to using mqtt!

dpgeorge · 2017-06-06

@craftyguy for your example to work I think you need to connect with clean_session=True, because you'll be explicitly resubscribing upon reconnection.

craftyguy · 2017-06-07

@dpgeorge I see, thank you for pointing that out. I also see the PR was merged :)
I will give it another try!

dpgeorge · 2017-06-07

I also see the PR was merged

@craftyguy No it wasn't, so you'll need to pull the PR explicitly to test it.

craftyguy · 2017-06-07

Ugh, I guess I misinterpreted a notification I received, now I see it
was you just modifying the PR. Sorry!

On Tue, Jun 06, 2017 at 09:53:05PM -0700, Damien George wrote:

I also see the PR was merged

@craftyguy No it wasn't, so you'll need to pull the PR explicitly to test it.

--
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub:
https://github.com/micropython/micropython-lib/issues/102#issuecomment-306686107

curiouswala · 2018-06-22

I tried the PR and it does recover from a broker restart but when the broker loses power and comes back, it doesn't recover. It is completely reproducible, happens every time I take the power out from my raspberry pi zero running the mosquitto broker. Does anyone else face this behaviour?
The problem seems to be that umqtt.robust does raise an error when killing my mqtt broker from the terminal but doesn't raise any error when broker dies from a power outage.

jonnor · 2024-08-25

Is this still an issue on latest MicroPython and mqtt.robut? If so, we need minimal example code on how to reproduce.

CANDIDATE · ISSUE

umqtt.robust sends the NodeMCU Esp8266 board in some freezing state

closedby JDchauhanopened 2018-10-31updated 2024-08-27

I have observed some strange problem in my NodeMCU Esp8266 board after running umqtt.robust that the board will stop silently without any warning or throwing errors after 10-15 minutes of no communication with the broker (i.e, if the client does not publish or receive any message of subscribed topics for 15 minutes approx.).

It seems like the library might be taking the board to some blocking state.

Please check and fix that issue.

1 comment
jonnor · 2024-08-25

Hi. There is unfortunately not enough information here to reproduce or debug this problem.

Multiple issues with mqtt on esp8266 have been fixed over the last 5 years. And some are still open - search the issue tracker for more. Please re-test on the latest versions and if there are still problems which are not described in already open issues, provide details such as a minimal code example for how to reproduce - along with details about the MQTT setup, such as broker version.

Keyboard

j / / n
next pair
k / / p
previous pair
1 / / h
show query pane
2 / / l
show candidate pane
c
copy suggested comment
r
toggle reasoning
g i
go to index
?
show this help
esc
close overlays

press ? or esc to close

copied